We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
需要支持提取错误case,包括问题、标准答案、错误答案。 然后需要支持单测错误case,把错误case作为数据集,只载入问题、标准答案,然后让debug后的bmodel回答,测试debug效果。