Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to train this model #183

Open
1 task done
CodingGreatEmperor opened this issue Jan 11, 2025 · 6 comments
Open
1 task done

How to train this model #183

CodingGreatEmperor opened this issue Jan 11, 2025 · 6 comments
Labels
question Further information is requested

Comments

@CodingGreatEmperor
Copy link

Due diligence

  • I have done my due diligence in trying to find the answer myself.

Topic

The paper

Question

For mini model, moshi model, and tokenizer. I just wanna customize a little bit of model params of them

@CodingGreatEmperor CodingGreatEmperor added the question Further information is requested label Jan 11, 2025
@yukiarimo
Copy link

+1

@bruno-hays
Copy link

https://github.com/kyutai-labs/moshi/blob/main/FAQ.md

We will release some training / fine-tuning code, but we do not have any timeline yet. Please be patient.

We don't have any other information or timeline at the moment, AFAIK

@Airoura
Copy link

Airoura commented Feb 26, 2025

It's too hard to implement llama version of moshi from scratch, need to work together.

https://github.com/Airoura/LlamaMoshi

@2010b9
Copy link

2010b9 commented Feb 27, 2025

Hi! 🙂

I have some questions (maybe they don't make total sense – sorry if that's the case! – but I'm still learning about this):

  • Will the fine-tuning / training code make it possible for the model to "speak" in other languages? And to perform tool calling? If so, do you have any idea how many training examples would be needed? And for how long and in which hardware should I train the model?

These are two requirements I have for my use case. Maybe there's more that I don't know about, but these seem the two most important ones – speak in another language and perform tool calling.

Thanks in advance 🙂

@davidbrowne17
Copy link

https://github.com/yangdongchao/RSTnet this repo provides code to finetune Moshi, I have used it (with some modifications) to finetune moshi.

@yukiarimo
Copy link

Would love to have a Colab of that

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

6 participants