Releases · openai/gpt-oss · GitHub

15 Sep 19:30

v0.0.7 Latest

Latest

What's Changed

Evals: correctly pass temperature/max_tokens when using Responses API by @Maratyszcza in #174
Metal: move sampling to GPU by @Maratyszcza in #175
Metal: benchmark generation of 100 tokens instead of 1 by @Maratyszcza in #178
Metal: support generating multiple tokens at once by @Maratyszcza in #179
Adding prefill benchmarking for metal backend by @ibahmed-oai in #181
Metal: tune threadgroup sizes by @Maratyszcza in #180
Metal: Adding optimized dense matmul kernel to optimize prefill perf by @ibahmed-oai in #183
Metal: fused QKV projection (matmul+RoPE) kernel by @Maratyszcza in #184
[Bugfix]Capture stderr for python tool with uv as backend by @wuhang2014 in #182

New Contributors

@ibahmed-oai made their first contribution in #181
@wuhang2014 made their first contribution in #182

Full Changelog: v0.0.6...v0.0.7

Contributors

Maratyszcza, wuhang2014, and ibahmed-oai

Assets 2

03 Sep 22:37

v0.0.6

What's Changed

Metal: add end-to-end benchmarks by @Maratyszcza in #161
Metal: simplify and optimize Reponses API adapter by @Maratyszcza in #162
Metal: fix KV-cache invalidation after reset+append by @Maratyszcza in #163
Increase max output tokens in Reponses API to 131K by @Maratyszcza in #165
Remove requirement on maximum Python version by @Maratyszcza in #167
Move Lemonade to AMD section of awesome-gpt-oss by @danielholanda in #164
Added VLLM Offline Serve working code. by @hrithiksagar-tih in #150
Metal: indicate threadgroup is a multiple of simdgroup by @Maratyszcza in #168
Metal: mlock model weights in memory by @Maratyszcza in #170
Add You.com as tool for browser by @bojanbabic in #171

New Contributors

@hrithiksagar-tih made their first contribution in #150
@bojanbabic made their first contribution in #171

Full Changelog: v0.0.5...v0.0.6

Contributors

bojanbabic, Maratyszcza, and 2 other contributors

Assets 2

28 Aug 19:04

v0.0.5

What's Changed

Update awesome-gpt-oss.md with llama.cpp by @dkundel-openai in #148
Update README.md by @dkundel-openai in #154
Added Tensorfuse (AWS) guide by @samagra14 in #118
Add Lemonade to awesome-gpt-oss by @danielholanda in #117
Add uv python backend by @heheda12345 in #156

New Contributors

@samagra14 made their first contribution in #118
@danielholanda made their first contribution in #117

Full Changelog: v0.0.4...v0.0.5

Contributors

danielholanda, samagra14, and 2 other contributors

Assets 2

18 Aug 01:03

v0.0.4

What's Changed

fix streamlit & ollama demo. Add python tool by @dkundel-openai in #131
Add some links to awesome-gpt-oss.md by @hiyouga in #28
fix: fix f-string unmatched '(' bug in streamlit_chat.py by @liuzhiqi71 in #31
Fix start_q use in upper bound calculation by @peterbell10 in #136
Process tokens in Context lazily by @Maratyszcza in #138
Fix for bug where / in model name causes evals to fail by @simonw in #142
[README] Rename with_browser() to with_browser_tool() in README by @xiaohk in #140
Update attention kernel to use TensorDescriptor by @peterbell10 in #137
Parallelize SDPA across multiple simdgroups by @Maratyszcza in #144
chore: release 0.0.4 by @dkundel-openai in #145

New Contributors

@hiyouga made their first contribution in #28
@liuzhiqi71 made their first contribution in #31
@peterbell10 made their first contribution in #136
@simonw made their first contribution in #142
@xiaohk made their first contribution in #140

Full Changelog: v0.0.3...v0.0.4

Contributors

simonw, Maratyszcza, and 5 other contributors

Assets 2