ForecastBench_expert_personas

Repository for analyzing how expert personas behave on ForecastBench

TODO: Set up uv env

If you have brew:

brew install uv

uv venv -- once

uv pip install -e . -- once

source .venv/bin/activate

deactivate

Experiment 1 = select top k or bottom k and make LLM generate

Experiment 2 = human eval of forecasts, check with LLM judge

Experiment 3 = trying system prompt

Experiment 4 = trying to do expert elicitation with few shot, (LLM judge or manually selecting)

Experiment 5 = Try sampling one question per topic and have all 7 topic experts forecast on these questions to see how experts that don't know anything about this topic perform

Experiment 6 = Try sampling one question per topic and do random selection of X filtered forecasts for the few shot prompt (based on what I understood from "Could you try random selection of filtered forecasts, instead of topic-relevant? There might be a difference").

Experiment 7 = Comparison of reasonings --> have a rubric and rating similarity on a scale of 1-5 and then seeing if an LLM judge can do that properly.

Analysis: - Add mean brier score and relative brier score

Appendix

Supporting documentation for this project:

Rationale & Methodology - Detailed explanation of experimental design and approach
Feedback Variants - All feedback variants for all feedback types
LLM as a Judge Framework - Rubric and evaluation methodology
Additional Results & Analysis - Supplementary findings and visualizations

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
configs		configs
data		data
experiments		experiments
notebooks		notebooks
prompts		prompts
results		results
scripts		scripts
src/forecastbench_experts		src/forecastbench_experts
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
Prompt_Diagrams.drawio.png		Prompt_Diagrams.drawio.png
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ForecastBench_expert_personas

TODO: Set up uv env

Appendix

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ForecastBench_expert_personas

TODO: Set up uv env

Appendix

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages