-
Notifications
You must be signed in to change notification settings - Fork 627
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to train this model #183
Comments
+1 |
https://github.com/kyutai-labs/moshi/blob/main/FAQ.md We will release some training / fine-tuning code, but we do not have any timeline yet. Please be patient. We don't have any other information or timeline at the moment, AFAIK |
It's too hard to implement llama version of moshi from scratch, need to work together. |
Hi! 🙂 I have some questions (maybe they don't make total sense – sorry if that's the case! – but I'm still learning about this):
These are two requirements I have for my use case. Maybe there's more that I don't know about, but these seem the two most important ones – speak in another language and perform tool calling. Thanks in advance 🙂 |
https://github.com/yangdongchao/RSTnet this repo provides code to finetune Moshi, I have used it (with some modifications) to finetune moshi. |
Would love to have a Colab of that |
Due diligence
Topic
The paper
Question
For mini model, moshi model, and tokenizer. I just wanna customize a little bit of model params of them
The text was updated successfully, but these errors were encountered: