Skip to content

Actions: huggingface/lighteval

Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,916 workflow runs
1,916 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add extended task for LiveCodeBench codegeneration (#548)
Tests #2194: Commit fd479ee pushed by NathanHB
February 18, 2025 09:54 38m 26s main
February 18, 2025 09:54 38m 26s
Fix vLLM max tokens
Tests #2193: Pull request #570 opened by Datta0
February 18, 2025 05:52 Action required Datta0:vllm_max_tokens
February 18, 2025 05:52 Action required
Let lighteval support sglang
Tests #2192: Pull request #552 synchronize by Jayon02
February 18, 2025 02:33 40m 35s Jayon02:main
February 18, 2025 02:33 40m 35s
new metrics and pr-fouras dataset add
Tests #2191: Pull request #558 synchronize by BertrandCabotIDRIS
February 17, 2025 15:01 38m 44s BertrandCabotIDRIS:main
February 17, 2025 15:01 38m 44s
Add extended task for LiveCodeBench codegeneration
Tests #2190: Pull request #548 synchronize by plaguss
February 17, 2025 14:57 39m 46s plaguss:lcb-codegeneration
February 17, 2025 14:57 39m 46s
Add extended task for LiveCodeBench codegeneration
Tests #2189: Pull request #548 synchronize by plaguss
February 17, 2025 11:01 37m 33s plaguss:lcb-codegeneration
February 17, 2025 11:01 37m 33s
Add extended task for LiveCodeBench codegeneration
Tests #2188: Pull request #548 synchronize by plaguss
February 17, 2025 08:53 38m 42s plaguss:lcb-codegeneration
February 17, 2025 08:53 38m 42s
Add extended task for LiveCodeBench codegeneration
Tests #2187: Pull request #548 synchronize by plaguss
February 17, 2025 08:19 38m 2s plaguss:lcb-codegeneration
February 17, 2025 08:19 38m 2s
Add extended task for LiveCodeBench codegeneration
Tests #2186: Pull request #548 synchronize by plaguss
February 16, 2025 07:36 37m 52s plaguss:lcb-codegeneration
February 16, 2025 07:36 37m 52s
Let lighteval support sglang
Tests #2184: Pull request #552 synchronize by Jayon02
February 16, 2025 01:01 38m 43s Jayon02:main
February 16, 2025 01:01 38m 43s
Add extended task for LiveCodeBench codegeneration
Tests #2183: Pull request #548 synchronize by plaguss
February 14, 2025 16:37 39m 15s plaguss:lcb-codegeneration
February 14, 2025 16:37 39m 15s
Add extended task for LiveCodeBench codegeneration
Tests #2182: Pull request #548 synchronize by plaguss
February 14, 2025 10:16 38m 27s plaguss:lcb-codegeneration
February 14, 2025 10:16 38m 27s
Add extended task for LiveCodeBench codegeneration
Tests #2181: Pull request #548 synchronize by plaguss
February 14, 2025 09:29 39m 25s plaguss:lcb-codegeneration
February 14, 2025 09:29 39m 25s
Add extended task for LiveCodeBench codegeneration
Tests #2180: Pull request #548 synchronize by plaguss
February 14, 2025 09:29 37m 2s plaguss:lcb-codegeneration
February 14, 2025 09:29 37m 2s
Add extended task for LiveCodeBench codegeneration
Tests #2179: Pull request #548 synchronize by plaguss
February 14, 2025 09:13 38m 57s plaguss:lcb-codegeneration
February 14, 2025 09:13 38m 57s
Add extended task for LiveCodeBench codegeneration
Tests #2178: Pull request #548 synchronize by plaguss
February 14, 2025 09:03 39m 15s plaguss:lcb-codegeneration
February 14, 2025 09:03 39m 15s
new metrics and pr-fouras dataset add
Tests #2177: Pull request #558 opened by BertrandCabotIDRIS
February 13, 2025 17:05 38m 32s BertrandCabotIDRIS:main
February 13, 2025 17:05 38m 32s
Humanity's last exam
Tests #2176: Pull request #520 synchronize by NathanHB
February 13, 2025 13:38 39m 13s clem_last_exam
February 13, 2025 13:38 39m 13s
allows better flexibility for litellm endpoints (#549)
Tests #2175: Commit d6de1fe pushed by NathanHB
February 13, 2025 13:37 38m 14s main
February 13, 2025 13:37 38m 14s
Add Doc Strings to Config Files
Tests #2174: Pull request #465 synchronize by ParagEkbote
February 13, 2025 12:18 Action required ParagEkbote:Document-Custom-Model-Files
February 13, 2025 12:18 Action required
Fixing some silent bugs in Arabic Custom Tasks
Tests #2173: Pull request #556 opened by alielfilali01
February 13, 2025 06:19 38m 10s alielfilali01:main
February 13, 2025 06:19 38m 10s
typo(vllm): gpu_memory_utilisation typo (#553)
Tests #2172: Commit fac17bb pushed by clefourrier
February 12, 2025 18:50 39m 7s main
February 12, 2025 18:50 39m 7s
Improved stability of litellm models for reasoning models.
Tests #2171: Pull request #538 synchronize by JoelNiklaus
February 12, 2025 16:16 Action required JoelNiklaus:improve-litellm-model
February 12, 2025 16:16 Action required
[VLLM] Allows for max tokens to be set in model config file (#547)
Tests #2170: Commit 78b68ab pushed by NathanHB
February 12, 2025 13:25 37m 57s main
February 12, 2025 13:25 37m 57s
allows better flexibility for litellm endpoints
Tests #2169: Pull request #549 synchronize by NathanHB
February 12, 2025 13:09 38m 27s nathan-litellm-config-file
February 12, 2025 13:09 38m 27s