ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution

ParEVO is a new AI framework that automatically generates correct and efficient parallel code for irregular data structures like sparse graphs and unbalanced trees. The system combines the Parlay-Instruct Corpus (13,820 parallel tasks), fine-tuned LLMs (DeepSeek, Qwen, Gemini), and an Evolutionary Coding Agent to achieve an average 106x speedup on the ParEval benchmark, matching expert human coder performance.

ParEVO: Synthesizing Code for Irregular Data: High-Performance Parallelism through Agentic Evolution

ParEVO: New AI Framework Bridges Critical Gap in High-Performance Parallel Programming

A new AI framework named ParEVO has been introduced to solve one of the most persistent challenges in high-performance computing: automatically generating correct and efficient parallel code for irregular data structures. This breakthrough addresses a critical bottleneck where traditional methods and current Large Language Models (LLMs) often fail, producing code with race conditions, deadlocks, and poor scaling. The system combines a novel training corpus, specialized fine-tuned models, and an evolutionary agent to achieve unprecedented performance, matching and sometimes surpassing expert human coders.

The Core Challenge: Parallelizing the "Irregular"

The shift from sequential to parallel computing is fundamental for modern applications, from scientific simulations to large-scale data analysis. However, this transition is notoriously difficult for irregular data—structures like sparse graphs, unbalanced trees, and non-uniform meshes where data dependencies are unpredictable and static scheduling fails. Current AI coding assistants frequently generate buggy, non-scalable code for these tasks, creating a significant barrier to adoption and performance.

The ParEVO Framework: A Three-Pronged Solution

To bridge this gap, the ParEVO framework employs a comprehensive, multi-stage approach designed for rigor and performance.

First, it introduces the Parlay-Instruct Corpus, a meticulously curated dataset of 13,820 parallel programming tasks. This corpus was synthesized using a "Critic-Refine" pipeline that explicitly filters for empirically performant algorithms, ensuring they correctly utilize foundational Work-Span parallel primitives. This high-quality data serves as the essential training ground.

Second, the team created specialized versions of leading models—including DeepSeek, Qwen, and Gemini—by fine-tuning them on this corpus. This process aligns the models' probabilistic code generation with the rigorous, correctness-focused semantics of the ParlayLib parallel programming library, moving beyond mere syntactic correctness.

Third, and crucially, ParEVO features an Evolutionary Coding Agent (ECA). This component tackles the "last mile" problem of correctness by iteratively repairing generated code. It uses automated feedback from compilers, dynamic race detectors, and performance profilers to evolve code until it is both functionally correct and highly optimized.

Benchmark Performance: Outpacing Models and Matching Experts

The results on the ParEval benchmark are striking. ParEVO achieves an average 106x speedup across the entire benchmark suite, with a maximum recorded speedup of 1103x. On the particularly challenging subset of complex irregular graph problems, it maintains a robust 13.6x speedup, significantly outperforming state-of-the-art commercial AI models.

Perhaps more impressively, the evolutionary approach allows ParEVO to match expert human baselines. On specific, highly-irregular computational kernels, the system achieved up to a 4.1x speedup over already-optimized human-written code, demonstrating its potential to not just assist but advance the frontier of high-performance algorithm design.

Why This Matters for the Future of Computing

  • Democratizes High-Performance Computing: By automating the most error-prone aspects of parallel programming, ParEVO lowers the steep learning curve, allowing a broader range of developers and scientists to leverage multi-core and distributed systems effectively.
  • Solves a Critical AI Shortfall: It directly addresses the well-documented failure of current LLMs in generating reliable concurrent code, providing a blueprint for integrating formal correctness and performance semantics into AI code generation.
  • Unlocks New Applications: Efficient handling of irregular data structures is key to advancements in fields like computational biology, social network analysis, and finite element modeling, where data is inherently non-uniform.
  • Establishes a New Paradigm: The combination of curated data, semantic fine-tuning, and evolutionary validation represents a powerful template for developing trustworthy AI systems for other complex, correctness-critical domains beyond parallel programming.

The source code and datasets for ParEVO have been made publicly available, fostering further research and development in this critical area. The project repository can be accessed at https://github.com/WildAlg/ParEVO.

常见问题