fix: make random iteration over ComparisonIndexer fair across leaf indexers by mvanhorn · Pull Request #2431 · TimefoldAI/timefold-solver

mvanhorn · 2026-06-28T12:31:09Z

Summary

Makes random iteration over a ComparisonIndexer fair across all elements in the query range. When the comparison map holds several buckets (leaf sub-indexers), a tuple now has the same probability of being visited next regardless of which bucket it lives in.

Why this matters

ComparisonIndexer.RandomIterator extended DefaultIterator, which walks the in-range buckets sequentially and only randomizes within each leaf indexer's own randomIterator. As noted in #2325, the iterator picks buckets without regard to how many items each leaf holds, so a tuple in a small bucket is over- or under-represented relative to one in a large bucket. For random move selection this skews the distribution away from uniform-over-elements.

core/.../bavet/common/index/ComparisonIndexer.java now draws each in-range leaf indexer with probability proportional to its size(queryCompositeKey), so every element within the range is equally likely to be visited next. The ordered iterator() / forEach() / size() paths and the single-bucket and empty paths are unchanged.

For the filtered overload, selection draws from each leaf's unfiltered iterator and applies the predicate during selection, removing rejected tuples as they are drawn. This keeps the bucket weights exact, so the result is fair over the surviving elements rather than over the raw bucket sizes. remove() and its IllegalStateException semantics are preserved.

Testing

Added tests in ComparisonIndexerTest exercising the random path directly (constructing the indexer with a random-access leaf backend):

randomIteratorIsFairAcrossLeafIndexersOfDifferentSizes: one 100-element bucket plus three single-element buckets, sampled 200k times; every element's selection frequency is approximately uniform (the small-bucket tuples were never selected under the old behavior).
randomIteratorWithFilterIsFairOverSurvivingElements: a 100-tuple bucket with a single match against a one-tuple bucket; both survivors are selected about equally often.
Single-bucket delegation, out-of-range query, remove() before next(), and predicate-respecting completeness for the filtered overload.

./mvnw -pl core -am test -Dtest=ComparisonIndexerTest -> Tests run: 10, Failures: 0, Errors: 0.

Fixes #2325

…dexers

mvanhorn added 2 commits June 28, 2026 05:16

fix: make random iteration over ComparisonIndexer fair across leaf in…

9b92812

…dexers

fix: weight filtered random iteration by matching tuples for fairness

a60c450

mvanhorn requested a review from triceo as a code owner June 28, 2026 12:31

mvanhorn mentioned this pull request Jun 28, 2026

Random iteration over ComparisonIndexer is not fair #2325

Open

mvanhorn requested a deployment to external June 28, 2026 12:31 — with GitHub Actions Waiting

mvanhorn changed the title ~~Make random iteration over ComparisonIndexer fair across leaf indexers~~ fix: make random iteration over ComparisonIndexer fair across leaf indexers Jun 29, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: make random iteration over ComparisonIndexer fair across leaf indexers#2431

fix: make random iteration over ComparisonIndexer fair across leaf indexers#2431
mvanhorn wants to merge 2 commits into
TimefoldAI:mainfrom
mvanhorn:fix/2325-fix-make-random-iteration-over-compariso

mvanhorn commented Jun 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mvanhorn commented Jun 28, 2026

Summary

Why this matters

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant