Skip to content

Emersonksc/Local-Model-Evaluation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Modle Evaluation

Use commercial models to evaluate the quality of local models for roleplay or novel generation.

This evaluation simulates how SillyTavern communicates with Text Generation Web UI, which means you need to deploy both tools to evaluate the local models' text quality using commercial models like DeepSeek R1, ChatGPT, or other models.

Why this tool?

There are no datasets specifically evaluating novel text quality. You typically have to do it manually or use commercial models to evaluate the novel outputs or roleplay outputs.

For the first approach, it takes too much time to finish the evaluation. The latter one can batch generate the outputs and evaluate them automatically, saving significant time and effort. :)

About

use commercial models to evaluate the local models quality for roleplay or novel

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages