GitHub - fangz-ai/Model_EvaL: evaluator for self-attention LLMs on TPU of Sophgo

需要支持提取错误case，包括问题、标准答案、错误答案。然后需要支持单测错误case，把错误case作为数据集，只载入问题、标准答案，然后让debug后的bmodel回答，测试debug效果。

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
dataset		dataset
eval		eval
support		support
task		task
third_party		third_party
.gitattributes		.gitattributes
CMakeLists.txt		CMakeLists.txt
README.md		README.md
__main__.py		__main__.py
chat.cpp		chat.cpp

Provide feedback