Path: csiph.com!eternal-september.org!feeder.eternal-september.org!news.iecc.com!.POSTED.news.iecc.com!nerds-end
From: John R Levine <johnl@taugh.com>
Newsgroups: comp.compilers
Subject: Paper: Retrofitting Control Flow Graphs in LLVM IR for Auto Vectorization
Date: Wed, 08 Oct 2025 17:07:20 -0400
Organization: Compilers Central
Sender: johnl%iecc.com
Approved: comp.compilers@iecc.com
Message-ID: <25-10-001@comp.compilers>
MIME-Version: 1.0
Content-Type: text/plain; charset="UTF-8"
Injection-Info: gal.iecc.com; posting-host="news.iecc.com:2001:470:1f07:1126:0:676f:7373:6970"; logging-data="38963"; mail-complaints-to="abuse@iecc.com"
Keywords: parallel, optimize
Posted-Date: 08 Oct 2025 17:48:39 EDT
X-submission-address: compilers@iecc.com
X-moderator-address: compilers-request@iecc.com
X-FAQ-and-archives: http://compilers.iecc.com
Xref: csiph.com comp.compilers:3700

This paper says it significantly improves vector performance in GCC and
LLVM.

https://arxiv.org/abs/2510.04890

Retrofitting Control Flow Graphs in LLVM IR for Auto Vectorization

Shihan Fang, Wenxin Zheng

Modern processors increasingly rely on SIMD instruction sets, such as AVX
and RVV, to significantly enhance parallelism and computational
performance. However, production-ready compilers like LLVM and GCC often
fail to fully exploit available vectorization opportunities due to
disjoint vectorization passes and limited extensibility. Although recent
attempts in heuristics and intermediate representation (IR) designs have
attempted to address these problems, efficiently simplifying control flow
analysis and accurately identifying vectorization opportunities remain
challenging tasks.

To address these issues, we introduce a novel vectorization pipeline
featuring two specialized IR extensions: SIR, which encodes high-level
structural information, and VIR, which explicitly represents instruction
dependencies through data dependency analysis. Leveraging the detailed
dependency information provided by VIR, we develop a flexible and
extensible vectorization framework. This approach substantially improves
interoperability across vectorization passes and expands the search space
for identifying isomorphic instructions, ultimately enhancing both the
scope and efficiency of automatic vectorization. Experimental evaluations
demonstrate that our proposed vectorization pipeline achieves significant
performance improvements, delivering speedups of up to 53% and 58%
compared to LLVM and GCC, respectively.

Regards,
John Levine, johnl@taugh.com, Taughannock Networks, Trumansburg NY
Please consider the environment before reading this e-mail. https://jl.ly