Published June 2024 | Version v1
Dissertation Open

An End-to-End Programming Model for AI Engine Architectures

Contributors

Committee member:

Description

The proliferation of deep learning in various domains has led to remarkable advancements in artificial intelligence applications, such as large-language models for scientific use cases.However, the concomitant exponential growth in computational demands, driven by the development of ever-larger deep learning models, presents significant challenges in terms of resource consumption and sustainability. This dissertation addresses these escalating computational costs by investigating how the complexity of deep learning frameworks' and their abstractions can significantly impact resource usage. In order to mitigate these growing costs, this dissertation presents novel insights into memory planning, high-level synthesis, lightweight frontend development, and end-to-end programming models for accelerator architectures. These insights culminate in the design and implementation of an embedded domain-specific language (eDSL) tailored to deep learning development on a novel accelerator, specifically the AMD AI Engine architecture. By prioritizing access to low-level APIs, leveraging compiler technologies, and rigorous mathematical models, the eDSL demonstrates the feasibility of achieving performant deep learning implementations while maintaining productivity in the design and exploration of deep learning methods.

Files

Levental_uchicago_0330D_17419.pdf

Files (3.1 MB)

Name Size Download all
md5:ac25240a54d58b532c276e53a4c273d8
3.1 MB Preview Download

Additional details

Identifiers

Other
oai:uchicago.tind.io:12382

UChicago Information

Division(s)
Physical Sciences Division
Department(s)
Computer Science