Skip to content
@lmarena

lmarena

An Open Platform for Crowdsourced AI Benchmarking

Popular repositories Loading

  1. arena-hard-auto arena-hard-auto Public

    Arena-Hard-Auto: An automatic LLM benchmark.

    Python 955 134

  2. copilot-arena copilot-arena Public

    TypeScript 334 25

  3. p2l p2l Public

    Prompt-to-Leaderboard

    Python 260 23

  4. PPE PPE Public

    Jupyter Notebook 57 12

  5. search-arena search-arena Public

    ⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".

    Jupyter Notebook 41 6

  6. lmarena.github.io lmarena.github.io Public

    HTML 15 14

Repositories

Showing 10 of 10 repositories

Top languages

Loading…

Most used topics

Loading…