Fix max number of tokens for synthetic data generator (#170)

jackcook · markurtz · web-flow · commit b85c6b963f60 · 2025-05-21T10:45:25.000-04:00
When using `prompt_tokens_max` (and not using `prompt_tokens_stdev`),
there will occasionally be one token more than the maximum number
specified. This can be tested as follows:

```
from guidellm.utils import IntegerRangeSampler

MIN_VALUE = 5
MAX_VALUE = 15

irs = IntegerRangeSampler(average=(MAX_VALUE - MIN_VALUE) // 2, variance=None, min_value=MIN_VALUE, max_value=MAX_VALUE, random_seed=None)
it = iter(irs)

for _ in range(10000):
    assert next(it) != 16
```

The assertion will fire, despite the max being set to 15. This happens
because `random.randint`, which is used by `IntegerRangeSampler`,
generates numbers up to and including the max value it is given. This PR
fixes that.

Co-authored-by: Mark Kurtz &lt;mark.j.kurtz@gmail.com&gt;
diff --git a/src/guidellm/utils/random.py b/src/guidellm/utils/random.py
@@ -37,7 +37,7 @@ def __iter__(self) -> Iterator[int]:
             if calc_min == calc_max:
                 yield calc_min
             elif not self.variance:
-                yield self.rng.randint(calc_min, calc_max + 1)
+                yield self.rng.randint(calc_min, calc_max)
             else:
                 rand = self.rng.gauss(self.average, self.variance)
                 yield round(max(calc_min, min(calc_max, rand)))