Score: 1

LEGO: Layout Expression for Generating One-to-one Mapping

Published: May 12, 2025 | arXiv ID: 2505.08091v1

By: Amir Mohammad Tavakkoli, Cosmin Oancea, Mary Hall

Potential Business Impact:

Makes computer programs run much faster.

Business Areas:
Field-Programmable Gate Array (FPGA) Hardware

We describe LEGO, a new approach to optimizing data movement whereby code is expressed as a layout-independent computation and composed with layouts for data and computation. This code generator organization derives complex indexing expressions associated with hierarchical parallel code and data movement for GPUs. LEGO maps from layout specification to indexing expressions, and can be integrated into existing compilers and code templates. It facilitates the exploration of data layouts in combination with other optimizations. We demonstrate LEGO's integration with the MLIR and Triton compilers, and with CUDA templates. We show that LEGO is capable of deriving performance competitive with Triton, and shows broad applicability in its integration with MLIR and CUDA.

Country of Origin
πŸ‡ΊπŸ‡Έ πŸ‡©πŸ‡° Denmark, United States

Page Count
12 pages

Category
Computer Science:
Programming Languages