ACL26-PluralEval

Evaluating Language Model Pluralism through In-the-wild Crowd Discussions

Tip

LLMs tend to hold back contradiction when you call an idea yours. Try "a recipe I found" instead of "my recipe idea" for the honest take.

We study whether LLMs faithfully represent the diversity of public opinions or collapse toward sycophantic agreement with user-stated beliefs. PluralEval builds reference opinion sets from real Reddit discussions, clusters them into stance groups, then measures how LLM outputs degrade under biased prompting through MCQ identification, ranking, and open-ended generation experiments.

The pipeline has four stages:

0_data/ — scrape raw Reddit submissions + comments into per-submission CSVs.
1_opinion_generation/ — extract one atomic opinion per comment.
2_clustering/ — group opinions into LLM-summarised clusters.
3_evaluation/ — three experiments measuring plurality awareness:
- mcq_popularity_identification/
- ranking_degradation/
- sycophancy_detection/

See each sub-folder's README for inputs, outputs, and run commands.

Setup

conda env create -f environment.yml
conda activate pluraleval
export OPENAI_API_KEY=...
export GEMINI_API_KEY=...
export ANTHROPIC_API_KEY=...

Citation

@inproceedings{mundada26evaluating,
  title     = "Evaluating language model pluralism through in-the-wild crowd discussions",
  author    = "Gagan Mundada and Rohan Surana and Nandhini Swaminathan and Bodhisattwa Prasad Majumder and Junda Wu and Julian McAuley and Zhouhang Xie",
  year      = "2026",
  booktitle = "ACL"
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ACL26-PluralEval

Evaluating Language Model Pluralism through In-the-wild Crowd Discussions

Setup

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
0_data		0_data
1_opinion_generation		1_opinion_generation
2_clustering		2_clustering
3_evaluation		3_evaluation
assets		assets
README.md		README.md
environment.yml		environment.yml

Folders and files

Latest commit

History

Repository files navigation

ACL26-PluralEval

Evaluating Language Model Pluralism through In-the-wild Crowd Discussions

Setup

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages