Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix grok and add ToyGrok model #651

Merged
merged 4 commits into from
Dec 6, 2024
Merged

Fix grok and add ToyGrok model #651

merged 4 commits into from
Dec 6, 2024

Conversation

rsuderman
Copy link
Contributor

The hf layout for rotary embedding was broken during a previous refactor. Readding the layout changes fixes this. This also includes a toy grok model for testing. It was validated to produce the same results for prefill and decode.

@rsuderman rsuderman requested a review from KyleHerndon December 5, 2024 21:47
The `hf` layout for rotary embedding was broken during a previous
refactor. Readding the layout changes fixes this. This also includes a
toy grok model for testing. It was validated to produce the same results
for prefill and decode.
@rsuderman rsuderman merged commit 7df7ad4 into nod-ai:main Dec 6, 2024
8 checks passed
monorimet pushed a commit that referenced this pull request Jan 8, 2025
The `hf` layout for rotary embedding was broken during a previous
refactor. Readding the layout changes fixes this. This also includes a
toy grok model for testing. It was validated to produce the same results
for prefill and decode.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants