Gradual Metaprogramming
By: Tianyu Chen , Darshal Shetty , Jeremy G. Siek and more
Potential Business Impact:
Finds coding mistakes earlier in data programs.
Data engineers increasingly use domain-specific languages (DSLs) to generate the code for data pipelines. Such DSLs are often embedded in Python. Unfortunately, there are challenges in debugging the generation of data pipelines: an error in a Python DSL script is often detected too late, after the execution of the script, and the source code location that triggers the error is hard to pinpoint. In this paper, we focus on the scenario where a DSL embedded in Python (so it is dynamically-typed) generates data pipeline description code that is statically-typed. We propose gradual metaprogramming to (1) provide a migration path toward statically typed DSLs, (2) immediately provide earlier detection of code generation type errors, and (3) report the source code location responsible for the type error. Gradual metaprogramming accomplishes this by type checking code fragments and incrementally performing runtime checks as they are spliced together. We define MetaGTLC, a metaprogramming calculus in which a gradually-typed metalanguage manipulates a statically-typed object language, and give semantics to it by translation to the cast calculus MetaCC. We prove that successful metaevaluation always generates a well-typed object program and mechanize the proof in Agda.
Similar Papers
Guided Tensor Lifting
Software Engineering
Teaches computers to rewrite old code for new tasks.
Generating Dynamic Graph Algorithms for Multiple Backends for a Graph DSL
Distributed, Parallel, and Cluster Computing
Makes computer programs handle changing information faster.
Toward a Corpus Study of the Dynamic Gradual Type
Programming Languages
Helps programmers write code that's easier to fix.