seq: Performance optimization by FidelSch · Pull Request #9557 · uutils/coreutils

FidelSch · 2025-12-03T20:05:15Z

Reading #6182 I noticed that most of the time spent running cargo run seq 4e4000003 4e4000003 was just BigUint::to_string()

Reading the code I found it is being called twice, once to get the actual string representation of the first number, and again on the last number, but only to get its length; and discarding the actual result.
This seems fine on fairly small numbers, but its efficiency degrades significantly on larger ones.

In my machine, this change resulted in a ~2x speedup on the mentioned case, and a marginal but seemingly better performance on smaller cases.

$ ~/coreutils$ hyperfine -L seq target/release/coreutils,target/release/coreutils_old "{seq} seq 4e4000003 4e4000003" 
Benchmark 1: target/release/coreutils seq 4e4000003 4e4000003
  Time (mean ± σ):     26.009 s ±  0.113 s    [User: 25.992 s, System: 0.015 s]
  Range (min … max):   25.909 s … 26.294 s    10 runs
 
Benchmark 2: target/release/coreutils_old seq 4e4000003 4e4000003
  Time (mean ± σ):     52.372 s ±  0.446 s    [User: 52.352 s, System: 0.017 s]
  Range (min … max):   51.815 s … 53.017 s    10 runs
 
Summary
  'target/release/coreutils seq 4e4000003 4e4000003' ran
    2.01 ± 0.02 times faster than 'target/release/coreutils_old seq 4e4000003 4e4000003'

github-actions · 2025-12-03T20:21:28Z

GNU testsuite comparison:

Skip an intermittent issue tests/misc/tee (fails in this run but passes in the 'main' branch)

ChrisDryden · 2025-12-04T01:36:00Z

Would be great to add your example to the benchmarks

sylvestre · 2025-12-04T06:52:53Z

Would be great to add your example to the benchmarks

In a separate pr please :)

anastygnome

Any reason for not using

n.checked_ilog10().unwrap_or(0) + 1

Which should be available?

FidelSch · 2025-12-04T12:55:44Z

Any reason for not using
n.checked_ilog10().unwrap_or(0) + 1

Seemed unnecessary given it is just a constant. If there is any benefit to this alternative I am happy to refactor.

codspeed-hq · 2025-12-06T15:13:50Z

Merging this PR will improve performance by 58.82%

⚠️

Different runtime environments detected

Some benchmarks with significant performance changes were compared across different runtime environments,
which may affect the accuracy of the results.

Open the report in CodSpeed to investigate

⚡ 2 improved benchmarks
✅ 281 untouched benchmarks
⏩ 38 skipped benchmarks¹

Performance Changes

	Mode	Benchmark	`BASE`	`HEAD`	Efficiency
⚡	Simulation	`seq_large_integers`	2.1 ms	1.4 ms	+58.82%
⚡	Memory	`seq_large_integers`	52.8 KB	46.1 KB	+14.49%

_{Comparing FidelSch:seq-optimization (43b0ca5) with main (bb91a5b)}

38 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩

sylvestre · 2025-12-06T15:14:35Z

any idea why tsort_input_parsing_heavy regressed ?

github-actions · 2025-12-06T15:17:10Z

GNU testsuite comparison:

Skip an intermittent issue tests/tail/overlay-headers (fails in this run but passes in the 'main' branch)
Congrats! The gnu test tests/tty/tty-eof is no longer failing!

github-actions · 2025-12-06T17:46:28Z

GNU testsuite comparison:

Skip an intermittent issue tests/tail/overlay-headers (fails in this run but passes in the 'main' branch)

anastygnome · 2025-12-07T13:27:31Z

Any reason for not using
n.checked_ilog10().unwrap_or(0) + 1
Seemed unnecessary given it is just a constant. If there is any benefit to this alternative I am happy to refactor.

If I recall correctly the performance is similar to your solution and it's part of the language. Could you try sending a commit to trigger the benchmarks with this solution instead?

sylvestre · 2025-12-14T09:47:00Z

note, upstream fails this way - what do you think we should do here?

$ LANG=C /usr/bin/seq "4e4000003" "4e4000003"
seq: invalid floating point argument: '4e4000003'
Try '/usr/bin/seq --help' for more information.

FidelSch · 2025-12-15T12:47:51Z

After some digging, the maximum value seq accepts seems to be 11e4931, equivalent to the maximum value representable by an 80-bit long double; which supports the theory that GNU uses this type for their implementation.
By using BigDecimal we are significantly extending the representable range, so it makes sense that for huge numbers a direct comparison to seq is not viable.

sylvestre · 2025-12-26T23:10:58Z

After some digging, the maximum value seq accepts seems to be 11e4931, equivalent to the maximum value representable by an 80-bit long double; which supports the theory that GNU uses this type for their implementation. By using BigDecimal we are significantly extending the representable range, so it makes sense that for huge numbers a direct comparison to seq is not viable.

ok, could you please document this in docs/src/extensions.md ? thanks

github-actions · 2026-01-21T15:24:43Z

GNU testsuite comparison:

Skipping an intermittent issue tests/shuf/shuf-reservoir (passes in this run but fails in the 'main' branch)
Skipping an intermittent issue tests/sort/sort-stale-thread-mem (passes in this run but fails in the 'main' branch)

github-actions · 2026-02-11T12:17:18Z

GNU testsuite comparison:

GNU test failed: tests/pr/bounded-memory. tests/pr/bounded-memory is passing on 'main'. Maybe you have to rebase?

sylvestre · 2026-02-11T12:25:03Z

@FidelSch a few changes seem to be unrelated, no ?

FidelSch · 2026-02-11T12:42:08Z

Ah, they seem to have been introduced by my markdown formatter, did not notice until now. Should I roll them back?

sylvestre · 2026-02-11T12:43:22Z

yes please
only the relevant changes

Clarify seq output accuracy and value range limitations compared to GNU coreutils.

github-actions · 2026-02-11T14:30:10Z

GNU testsuite comparison:

GNU test failed: tests/pr/bounded-memory. tests/pr/bounded-memory is passing on 'main'. Maybe you have to rebase?

sylvestre

The optimization idea is sound — avoiding to_string() on large numbers is a real win.

However, the bits() / LOG2_10 approximation can be off by 1 for some values (e.g., exact powers of 10). Since this is used for padding width, being off by one character could produce misaligned output. Have you verified against the GNU test suite?

anastygnome suggested changes Dec 4, 2025

View reviewed changes

ChrisDryden mentioned this pull request Dec 4, 2025

seq: adding large integers benchmarks #9561

Merged

seq: calculate number length more efficiently

f026b7a

sylvestre force-pushed the seq-optimization branch from 88d12d3 to f026b7a Compare December 6, 2025 15:00

Merge branch 'uutils:main' into seq-optimization

7b79b27

Merge branch 'main' into seq-optimization

f1c2b37

FidelSch force-pushed the seq-optimization branch 4 times, most recently from 02ac2b2 to f1c2b37 Compare February 11, 2026 13:37

seq: Update extensions.md

43b0ca5

Clarify seq output accuracy and value range limitations compared to GNU coreutils.

sylvestre reviewed May 20, 2026

View reviewed changes

Uh oh!

Conversation

FidelSch commented Dec 3, 2025

Uh oh!

github-actions Bot commented Dec 3, 2025

Uh oh!

ChrisDryden commented Dec 4, 2025

Uh oh!

sylvestre commented Dec 4, 2025

Uh oh!

anastygnome left a comment

Choose a reason for hiding this comment

Uh oh!

FidelSch commented Dec 4, 2025

Uh oh!

codspeed-hq Bot commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will improve performance by 58.82%

Performance Changes

Footnotes

Uh oh!

sylvestre commented Dec 6, 2025

Uh oh!

github-actions Bot commented Dec 6, 2025

Uh oh!

github-actions Bot commented Dec 6, 2025

Uh oh!

anastygnome commented Dec 7, 2025

Uh oh!

sylvestre commented Dec 14, 2025

Uh oh!

FidelSch commented Dec 15, 2025

Uh oh!

sylvestre commented Dec 26, 2025

Uh oh!

github-actions Bot commented Jan 21, 2026

Uh oh!

github-actions Bot commented Feb 11, 2026

Uh oh!

sylvestre commented Feb 11, 2026

Uh oh!

FidelSch commented Feb 11, 2026

Uh oh!

sylvestre commented Feb 11, 2026

Uh oh!

github-actions Bot commented Feb 11, 2026

Uh oh!

sylvestre left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codspeed-hq Bot commented Dec 6, 2025 •

edited

Loading