Skip to content

Actions: tatsu-lab/alpaca_eval

alpaca_eval unit tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
349 workflow runs
349 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Don't set root logger to INFO to avoid turning on all packages' logs
alpaca_eval unit tests #816: Pull request #439 opened by srowen
February 7, 2025 20:51 8m 8s srowen:no_global_info
February 7, 2025 20:51 8m 8s
Add Llama-3.1-8B-Base-AFT to AlpacaEval
alpaca_eval unit tests #815: Pull request #438 opened by Linzwcs
February 5, 2025 05:43 4m 31s Linzwcs:main
February 5, 2025 05:43 4m 31s
Add Amara-o1-7B-Qwen Amara-o2-7B-Qwen to AlpacaEval
alpaca_eval unit tests #814: Pull request #437 opened by Minami-su
January 27, 2025 08:38 4m 39s Minami-su:main
January 27, 2025 08:38 4m 39s
Add test-7B-o1-Qwen to AlpacaEval
alpaca_eval unit tests #813: Pull request #432 opened by Minami-su
January 3, 2025 09:45 4m 28s Minami-su:main
January 3, 2025 09:45 4m 28s
[BUG] tool_calls (#429)
alpaca_eval unit tests #812: Commit 30d94f5 pushed by YannDubs
December 27, 2024 21:44 4m 20s main
December 27, 2024 21:44 4m 20s
[BUG] tool_calls
alpaca_eval unit tests #811: Pull request #429 opened by YannDubs
December 27, 2024 21:42 4m 26s yann/fix_tools
December 27, 2024 21:42 4m 26s
Add TOA to AlpacaEval (#428)
alpaca_eval unit tests #810: Commit 2898342 pushed by YannDubs
December 27, 2024 20:20 4m 23s main
December 27, 2024 20:20 4m 23s
Add TOA to AlpacaEval
alpaca_eval unit tests #809: Pull request #428 synchronize by YannDubs
December 27, 2024 20:20 4m 14s oceanypt:main
December 27, 2024 20:20 4m 14s
Add FuseChat-3.0 models to AlpacaEval (#426)
alpaca_eval unit tests #808: Commit 8bb6e57 pushed by YannDubs
December 27, 2024 20:16 4m 33s main
December 27, 2024 20:16 4m 33s
Add TOA to AlpacaEval
alpaca_eval unit tests #807: Pull request #428 opened by oceanypt
December 26, 2024 13:31 4m 35s oceanypt:main
December 26, 2024 13:31 4m 35s
Add FuseChat-3.0 models to AlpacaEval
alpaca_eval unit tests #806: Pull request #426 opened by yangzy39
December 16, 2024 07:01 4m 45s yangzy39:main
December 16, 2024 07:01 4m 45s
Add FuseChat-Llama-3.1-8B-Instruct, FuseChat-Gemma-2-9B-Instruct and …
alpaca_eval unit tests #805: Pull request #424 opened by yangzy39
December 15, 2024 06:46 4m 33s main
December 15, 2024 06:46 4m 33s
add example for Llama3 vllm server (#404)
alpaca_eval unit tests #804: Commit 0b4af76 pushed by YannDubs
November 11, 2024 07:17 4m 45s main
November 11, 2024 07:17 4m 45s
add example for Llama3 vllm server
alpaca_eval unit tests #803: Pull request #404 reopened by YannDubs
November 11, 2024 07:17 9m 1s cameron-chen:evaluator-vllm-server
November 11, 2024 07:17 9m 1s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval (#416)
alpaca_eval unit tests #802: Commit 6976988 pushed by YannDubs
November 11, 2024 07:16 7m 42s main
November 11, 2024 07:16 7m 42s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval
alpaca_eval unit tests #801: Pull request #416 synchronize by hanyang1999
October 31, 2024 17:25 5m 14s hanyang1999:main
October 31, 2024 17:25 5m 14s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval
alpaca_eval unit tests #800: Pull request #416 opened by hanyang1999
October 31, 2024 04:36 4m 11s hanyang1999:main
October 31, 2024 04:36 4m 11s
Add NullModel to AlpacaEval (#414)
alpaca_eval unit tests #799: Commit 3c6ae8f pushed by YannDubs
October 23, 2024 17:00 4m 11s main
October 23, 2024 17:00 4m 11s
Add NullModel to AlpacaEval
alpaca_eval unit tests #798: Pull request #414 synchronize by YannDubs
October 23, 2024 17:00 4m 26s xszheng2020:main
October 23, 2024 17:00 4m 26s
Add GPO-Llama-3-8B-Instruct-GPM-2B and SPPO-Llama-3-8B-Instruct-GPM-2…
alpaca_eval unit tests #797: Commit 9d8e91d pushed by YannDubs
October 19, 2024 21:48 4m 20s main
October 19, 2024 21:48 4m 20s
Add NullModel to AlpacaEval
alpaca_eval unit tests #796: Pull request #414 opened by xszheng2020
October 15, 2024 20:39 4m 26s xszheng2020:main
October 15, 2024 20:39 4m 26s
add example for Llama3 vllm server
alpaca_eval unit tests #795: Pull request #404 synchronize by cameron-chen
October 13, 2024 14:24 7m 37s cameron-chen:evaluator-vllm-server
October 13, 2024 14:24 7m 37s
Add GPO-Llama-3-8B-Instruct-GPM-2B and SPPO-Llama-3-8B-Instruct-GPM-2…
alpaca_eval unit tests #794: Pull request #413 opened by xukp20
October 10, 2024 13:50 4m 26s general-preference:main
October 10, 2024 13:50 4m 26s
add Self-taught-llama3.1-70B-dpo as a evaluator (#412)
alpaca_eval unit tests #793: Commit d96bcbd pushed by YannDubs
September 26, 2024 15:37 4m 13s main
September 26, 2024 15:37 4m 13s
Fix the float number & Add SelfMoA_gemma-2-9b-it-SimPO, SelfMoA_gemm…
alpaca_eval unit tests #792: Commit b759c8d pushed by YannDubs
September 25, 2024 21:13 4m 37s main
September 25, 2024 21:13 4m 37s