Software Verification Laboratory @ UMB

Project description

Programming Graphical Processing Units (GPUs) is an art currently reserved to a select few experts, as unlocking the full potential of these devices requires mastering an intricate hardware architecture and execution model. Scientists and domain experts need to adapt their algorithms to a programming model where simply changing the order in which data is accessed can have a 10× overhead, and off-by-one errors can silently corrupt data. The aim of this project is to help achieve correct and efficient GPU programming, with a static framework that explains sources of performance degradation and repairs concurrency bugs without loss of performance.

This project consists of three tasks.

We will develop bug prevention techniques that can prove the absence deadlocks/data-races and identify the root cause of concurrency errors. We wil introduce the first techniques that can analyze programs fully symbolically and techniques that mitigate false alarms.
We will develop static performance-profiling techniques that help identify the major sources of performance degradation, uncoalesced accesses, bank conflicts, divergence, and over-synchronization. Our technique includes an amortization analysis that de- rives a symbolic expression capturing the resource usage bounds of memory transactions, synchronizations, among others.
We will develop program repair techniques that can fix concurrency bugs without sacrificing performance.

Intellectual Merit

Our project advances the state of the art in static verification for GPUs and other accelerator architectures, both in terms of correctness and performance analysis. Existing solutions suffer from a high rate of false alarms, cannot handle large programs, or are unsound. Most importantly, existing approaches either address performance or safety, but not both; our project addresses both complementarily.

A key aspect of this project is an underlying static verification infrastructure, a collection of analysis backed by theoretical results, that enable novel and efficient analysis and tools. Such verification infrastructure is built around a novel and general intermediate representation based on behavioral type theory. Our project expands our formal understanding of GPU programming models, by introducing safety properties, semantics preserving transformations, and behavioral equivalences, fully mechanized using a proof assistant.

Research outputs. This research project produced 10 articles:

2 journal articles,
2 conference articles,
2 workshop articles, and
4 peer-reviewed extended abstracts.

Broader Impacts

This project makes GPU programming more accessible to scientists and domain experts, by furthering their understanding of parallel programming and hardware architectures. An outcome of this project is improved developer productivity and software sustainability, as our project aids in writing correct and highly efficient GPU programs without sacrificing either. We expect an interest from industry (e.g., autonomous mobility, artificial intelligence applications) as well as from academy (e.g., energy national laboratories). Our research lowers the barrier to entry of GPU programming, so we expect to widen the suitability of GPUs to more fields. The tools that result from this project can empower students to understand bugs in their code autonomously, leading to a more focused pedagogical experience between the instructor and the student.

Open Source Software

Faial: a correctness and performance verifier for GPU programs
Faial-setup: integrate Faial into GitHub CI workflows

Publications

2 Journal papers:

Sound and partially-complete static analysis of data-races in GPU programs. Dennis Liew, Tiago Cogumbreiro, Julien Lange. PACMPL, 8(OOPSLA), 2024.
Memory Access Protocols: Certified Data-Race Freedom for GPU Kernels. Tiago Cogumbreiro, Julien Lange, Dennis Liew, Hannah Zicarelli. FMSD, 2023. invited paper (CAV'21 special issue)
Slides PDF DOI

2 Conference papers:

Shelley: a framework for model checking call ordering on hierarchical systems. Carlos Mão de Ferro, Tiago Cogumbreiro, Francisco Martins. In COORDINATION. Springer, 2023.
DOI
Dynamic Determinacy Race Detection for Task-Parallel Programs with Promises. Feiyang Jin, Lechen Yu, Tiago Cogumbreiro, Vivek Sarkar, Jun Shirako. In ECOOP, volume 263 of LIPIcs. Schloss Dagstuhl, 2023.
DOI

2 Workshop papers:

Hidden assumptions in static verification of data-race free GPU programs. Tiago Cogumbreiro, Julien Lange. In VIVEKFEST. 2024.
Formalizing Model Inference of MicroPython. Carlos Mão de Ferro, Tiago Cogumbreiro, Francisco Martins. In VERDI. IEEE, 2023.
Artifact PDF DOI

4 Peer-reviewed abstracts:

RegularIMP: an imperative calculus to describe regular languages. Soroush Aghajani, Emma Kelminson, Tiago Cogumbreiro. In TFPIE. 2024.
PDF
Towards Concurrency Repair in GPU Kernels with Resource Cost Analysis. Gregory Blike, Tiago Cogumbreiro. In SERPL. 2023.
Slides PDF
Scaling data-race freedom analysis with array projections. Paul Maynard, Tiago Cogumbreiro. In SERPL. 2023.
PDF
Verifying Static Analysis Tools. Udaya Sathiyamoorthy, Tiago Cogumbreiro. In TFP. 2023.
Slides PDF