-
-
Notifications
You must be signed in to change notification settings - Fork 14.5k
Open
Labels
A-LTOArea: Link-time optimization (LTO)Area: Link-time optimization (LTO)C-optimizationCategory: An issue highlighting optimization opportunities or PRs implementing suchCategory: An issue highlighting optimization opportunities or PRs implementing suchI-slowIssue: Problems and improvements with respect to performance of generated code.Issue: Problems and improvements with respect to performance of generated code.T-compilerRelevant to the compiler team, which will review and decide on the PR/issue.Relevant to the compiler team, which will review and decide on the PR/issue.
Description
The problem:
bench_mat4_transform_point3
time: [18.883 ns 18.986 ns 19.093 ns]
change: [+4284.5% +4303.0% +4320.2%] (p = 0.00 < 0.05)
Performance has regressed.
Found 4 outliers among 100 measurements (4.00%)
3 (3.00%) high mild
1 (1.00%) high severe
I encountered this issue while working on benchmarks from nalgebra crate after recompiling it with RUSTFLAGS="-C target-cpu=native".
Here is the smaller reproducer with more details in README: https://github.com/im-0/rust-fat-lto-perf-degradation (it still uses nalgebra and criterion as dependencies). This reproducer requires codegen-units = 32, but some benchmarks from nalgebra suffer even with just lto = "fat" and default value of codegen-units (which is 16).
I expected to see this happen: no or small performance degradation
Instead, this happened: more than 4000% performance degradation
Meta
rustc --version --verbose:
rustc 1.89.0 (29483883e 2025-08-04) (Fedora 1.89.0-2.fc42)
binary: rustc
commit-hash: 29483883eed69d5fb4db01964cdf2af4d86e9cb2
commit-date: 2025-08-04
host: x86_64-unknown-linux-gnu
release: 1.89.0
LLVM version: 20.1.8
and
rustc 1.89.0 (29483883e 2025-08-04)
binary: rustc
commit-hash: 29483883eed69d5fb4db01964cdf2af4d86e9cb2
commit-date: 2025-08-04
host: x86_64-unknown-linux-gnu
release: 1.89.0
LLVM version: 20.1.7
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
A-LTOArea: Link-time optimization (LTO)Area: Link-time optimization (LTO)C-optimizationCategory: An issue highlighting optimization opportunities or PRs implementing suchCategory: An issue highlighting optimization opportunities or PRs implementing suchI-slowIssue: Problems and improvements with respect to performance of generated code.Issue: Problems and improvements with respect to performance of generated code.T-compilerRelevant to the compiler team, which will review and decide on the PR/issue.Relevant to the compiler team, which will review and decide on the PR/issue.