Skip to content
This repository was archived by the owner on May 1, 2025. It is now read-only.

Add RWKV models #16

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Add RWKV models #16

wants to merge 2 commits into from

Conversation

guangyusong
Copy link

@guangyusong guangyusong commented Jun 5, 2023

Changes:

Added support for RWKV model family.

Related links:

Paper: https://arxiv.org/abs/2305.13048
Github: https://github.com/BlinkDL/RWKV-LM

Screenshots:

Model list:
Screenshot 2023-06-05 at 2 10 45 AM

Sample generation:
Screenshot 2023-06-05 at 2 09 48 AM

@salesforce-cla
Copy link

salesforce-cla bot commented Jun 5, 2023

Thanks for the contribution! Before we can merge this, we need @guangyusong to sign the Salesforce Inc. Contributor License Agreement.

@bdqnghi
Copy link
Contributor

bdqnghi commented Jun 6, 2023

thank you. The RWKV model family looks very nice. However, they are not really code learning models, do you have the checkpoints related to coding tasks?

@guangyusong
Copy link
Author

The RWKV model family should have a similar data split as GPT-J. We anticipate releasing a model that's more adept at coding tasks in the near future.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants