Upload ugi-leaderboard-data.csv
Browse files- ugi-leaderboard-data.csv +18 -0
ugi-leaderboard-data.csv
CHANGED
|
@@ -1223,3 +1223,21 @@ google/gemini-3.5-flash (thinking_level=minimal),https://huggingface.co/google/g
|
|
| 1223 |
google/gemini-3.5-flash (thinking_level=low),https://huggingface.co/google/gemini-3.5-flash (thinking_level=low),5/19/2026,5/21/2026,,,,,False,False,True,72.44,46.67,58.75,6.5,3.7,8.1,2.2,3.0,1.5,71.12,82.97,72.07,58.33,58.75,0.5686,0.5081,0.7411,0.6699,0.4286,-15.2%,62.0%,47.8%,48.3%,58.1%,49.2%,56.9%,49.4%,40.2%,37.7%,36.0%,48.3%,50.6%,46.0%,54.6%,59.4%,60.2%,Liberalism,True,6597,0,,32.1,0.69,13.8,5.9,0.345,9.0,86.0,0.863,0.447,0.317,1.203,0.427,0.385,26.1,2447.0,59.9,18.95,2.7,5.0
|
| 1224 |
google/gemini-3.5-flash (thinking_level=medium),https://huggingface.co/google/gemini-3.5-flash (thinking_level=medium),5/19/2026,5/21/2026,,,,,False,False,True,72.54,53.66,69.24,10.0,3.7,8.4,2.2,3.0,1.5,72.42,80.75,71.03,65.47,69.24,0.6252,0.7989,0.7676,0.6786,0.403,-11.3%,62.7%,48.8%,46.0%,56.0%,46.5%,60.8%,53.8%,36.5%,36.2%,39.4%,48.3%,49.8%,40.4%,49.0%,57.3%,61.5%,Liberalism,True,10636,0,,32.7,0.68,13.8,5.9,0.351,12.0,82.0,0.858,0.449,0.321,1.227,0.378,0.348,23.2,997.0,57.1,18.87,2.4,5.2
|
| 1225 |
google/gemini-3.5-flash (thinking_level=high),https://huggingface.co/google/gemini-3.5-flash (thinking_level=high),5/19/2026,5/21/2026,,,,,False,False,True,69.71,55.37,71.81,10.0,4.6,8.1,2.2,3.0,1.5,68.98,78.98,60.0,67.95,71.81,0.5527,0.8944,0.7492,0.81,0.3913,-7.9%,56.4%,49.3%,46.8%,57.2%,45.6%,56.9%,51.5%,42.9%,42.9%,44.8%,47.1%,50.8%,42.7%,54.6%,57.7%,59.0%,Centrism,True,13123,0,,32.9,0.68,13.9,5.8,0.34,14.0,76.0,0.855,0.443,0.306,1.243,0.35,0.336,27.0,602.0,59.0,17.45,2.2,5.6
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1223 |
google/gemini-3.5-flash (thinking_level=low),https://huggingface.co/google/gemini-3.5-flash (thinking_level=low),5/19/2026,5/21/2026,,,,,False,False,True,72.44,46.67,58.75,6.5,3.7,8.1,2.2,3.0,1.5,71.12,82.97,72.07,58.33,58.75,0.5686,0.5081,0.7411,0.6699,0.4286,-15.2%,62.0%,47.8%,48.3%,58.1%,49.2%,56.9%,49.4%,40.2%,37.7%,36.0%,48.3%,50.6%,46.0%,54.6%,59.4%,60.2%,Liberalism,True,6597,0,,32.1,0.69,13.8,5.9,0.345,9.0,86.0,0.863,0.447,0.317,1.203,0.427,0.385,26.1,2447.0,59.9,18.95,2.7,5.0
|
| 1224 |
google/gemini-3.5-flash (thinking_level=medium),https://huggingface.co/google/gemini-3.5-flash (thinking_level=medium),5/19/2026,5/21/2026,,,,,False,False,True,72.54,53.66,69.24,10.0,3.7,8.4,2.2,3.0,1.5,72.42,80.75,71.03,65.47,69.24,0.6252,0.7989,0.7676,0.6786,0.403,-11.3%,62.7%,48.8%,46.0%,56.0%,46.5%,60.8%,53.8%,36.5%,36.2%,39.4%,48.3%,49.8%,40.4%,49.0%,57.3%,61.5%,Liberalism,True,10636,0,,32.7,0.68,13.8,5.9,0.351,12.0,82.0,0.858,0.449,0.321,1.227,0.378,0.348,23.2,997.0,57.1,18.87,2.4,5.2
|
| 1225 |
google/gemini-3.5-flash (thinking_level=high),https://huggingface.co/google/gemini-3.5-flash (thinking_level=high),5/19/2026,5/21/2026,,,,,False,False,True,69.71,55.37,71.81,10.0,4.6,8.1,2.2,3.0,1.5,68.98,78.98,60.0,67.95,71.81,0.5527,0.8944,0.7492,0.81,0.3913,-7.9%,56.4%,49.3%,46.8%,57.2%,45.6%,56.9%,51.5%,42.9%,42.9%,44.8%,47.1%,50.8%,42.7%,54.6%,57.7%,59.0%,Centrism,True,13123,0,,32.9,0.68,13.9,5.8,0.34,14.0,76.0,0.855,0.443,0.306,1.243,0.35,0.336,27.0,602.0,59.0,17.45,2.2,5.6
|
| 1226 |
+
TheDrummer/Rocinante-XL-16B-v1,https://huggingface.co/TheDrummer/Rocinante-XL-16B-v1,4/18/2026,5/25/2026,mistral V3-Tekken,16.0,16.0,16.0,True,False,False,32.66,18.22,13.59,1.2,1.0,1.9,2.8,5.0,0.5,18.57,18.42,19.66,17.64,13.59,0.1127,0.1557,0.0576,0.3048,0.2512,-11.0%,59.4%,46.3%,41.9%,59.4%,43.1%,65.4%,47.5%,46.2%,39.2%,36.5%,41.7%,44.6%,39.6%,55.4%,59.8%,63.1%,Liberalism,False,0,0,MistralForCausalLM,45.5,0.77,13.5,5.8,0.331,89.0,92.0,0.848,0.424,0.327,1.505,0.22,0.228,84.2,7717.0,215.7,22.47,3.7,6.6
|
| 1227 |
+
TheDrummer/Rocinante-XL-16B-v1 (<thinking> prefill),https://huggingface.co/TheDrummer/Rocinante-XL-16B-v1,4/18/2026,5/25/2026,mistral V3-Tekken w/ <thinking> prefill,16.0,16.0,16.0,True,False,False,29.32,19.32,10.23,1.2,1.0,0.9,3.8,5.0,2.5,15.91,18.74,11.72,17.27,10.23,0.0579,0.1293,0.1474,0.3192,0.2098,-12.3%,60.8%,47.7%,44.0%,60.3%,39.6%,61.0%,43.8%,43.5%,39.8%,34.4%,45.0%,46.9%,40.2%,52.9%,56.9%,71.2%,Liberalism,True,646,0,MistralForCausalLM,43.3,0.78,13.5,5.9,0.362,95.0,94.0,0.849,0.426,0.329,1.527,0.283,0.168,121.0,8914.0,152.6,22.3,3.2,5.8
|
| 1228 |
+
llmfan46/Qwen3.6-27B-uncensored-heretic-v2 (<think> prefill),https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2,4/29/2026,5/25/2026,chatml w/ <think> prefill,27.0,27.0,27.0,True,False,False,39.42,43.16,17.24,2.9,1.5,1.0,9.5,9.0,10.0,29.17,47.21,5.17,35.14,17.24,0.3351,0.2574,0.6191,0.2366,0.3088,-24.5%,66.8%,45.7%,45.4%,64.4%,46.0%,66.5%,49.6%,34.0%,39.0%,26.7%,48.5%,47.1%,40.6%,58.5%,65.6%,69.2%,Liberalism,True,12333,9,Qwen3_5ForConditionalGeneration,28.2,0.72,12.3,6.1,0.303,22.0,59.0,0.862,0.415,0.282,1.353,0.674,0.309,42.3,5067.0,72.3,23.35,1.8,2.3
|
| 1229 |
+
llmfan46/Qwen3.6-27B-uncensored-heretic-v2 (no think),https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2,4/29/2026,5/25/2026,chatml w/ no think,27.0,27.0,27.0,True,False,False,32.06,43.91,18.37,2.9,1.3,1.6,9.5,9.0,10.0,22.89,32.62,1.72,34.34,18.37,0.458,0.1843,0.5126,0.2775,0.2845,-19.4%,61.3%,49.6%,46.4%,60.6%,41.0%,62.7%,52.5%,36.2%,49.6%,30.2%,51.2%,43.1%,44.8%,52.9%,62.1%,66.7%,Liberalism,False,0,2,Qwen3_5ForConditionalGeneration,24.4,0.74,12.8,6.0,0.33,10.0,78.0,0.825,0.444,0.315,1.37,0.681,0.272,32.6,6739.0,83.6,22.8,2.6,1.2
|
| 1230 |
+
llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved (<think> prefill),https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved,5/6/2026,5/25/2026,chatml w/ <think> prefill,27.0,27.0,27.0,True,False,False,38.41,43.88,18.31,2.4,1.7,1.6,9.5,9.0,10.0,26.28,42.78,6.21,29.87,18.31,0.2714,0.2444,0.4883,0.223,0.2663,-26.8%,67.8%,45.6%,47.8%,63.8%,44.8%,67.1%,48.5%,31.9%,38.3%,26.2%,56.7%,47.9%,39.0%,61.5%,64.2%,65.6%,Liberalism,True,12473,8,Qwen3_5ForConditionalGeneration,26.7,0.71,12.5,6.3,0.303,18.0,68.0,0.86,0.412,0.291,1.377,0.73,0.243,49.2,5306.0,86.3,23.55,2.4,2.1
|
| 1231 |
+
llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved (no think),https://huggingface.co/llmfan46/Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved,5/6/2026,5/25/2026,chatml w/ no think,27.0,27.0,27.0,True,False,False,31.24,43.54,17.81,2.9,1.3,1.5,9.5,9.0,10.0,21.63,27.94,1.72,35.22,17.81,0.4388,0.2722,0.5126,0.2331,0.3041,-20.4%,62.8%,49.6%,46.9%,60.6%,41.0%,62.7%,52.5%,35.6%,46.5%,29.4%,52.9%,42.9%,44.8%,52.9%,62.1%,66.7%,Liberalism,False,0,1,Qwen3_5ForConditionalGeneration,25.2,0.74,12.7,6.0,0.327,10.0,72.0,0.834,0.447,0.315,1.357,0.696,0.303,33.9,4816.0,83.6,23.4,3.2,1.1
|
| 1232 |
+
google/gemma-4-E4B-it (<|channel>thought prefill),https://huggingface.co/google/gemma-4-E4B-it,4/2/2026,5/25/2026,gemma-4 w/ <|channel>thought prefill,4.5,8.0,8.0,False,False,True,21.59,8.89,2.08,0.0,0.5,0.0,2.2,3.0,1.5,18.21,20.0,16.21,18.41,2.08,0.2454,0.2026,0.1104,0.1542,0.2081,-22.1%,68.1%,52.3%,44.2%,58.1%,39.0%,56.9%,52.7%,26.7%,42.5%,26.5%,47.1%,50.4%,35.0%,55.2%,60.8%,58.1%,Liberalism,True,2386,0,Gemma4ForConditionalGeneration,25.2,0.64,16.7,8.5,0.303,10.0,49.0,0.882,0.43,0.309,1.617,0.231,0.198,52.6,6231.0,170.4,24.77,4.5,4.9
|
| 1233 |
+
google/gemma-4-E4B-it,https://huggingface.co/google/gemma-4-E4B-it,4/2/2026,5/25/2026,gemma-4,4.5,8.0,8.0,False,False,True,20.23,12.36,7.29,1.8,0.6,0.0,2.2,3.0,1.5,16.47,19.97,16.21,13.23,7.29,0.1648,0.1491,0.0623,0.1224,0.1631,-14.7%,65.6%,52.1%,41.7%,61.7%,41.5%,56.7%,54.4%,29.4%,38.8%,35.0%,43.8%,47.3%,34.2%,58.8%,61.5%,65.0%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,25.0,0.63,16.8,8.7,0.294,15.0,30.0,0.884,0.423,0.301,1.64,0.325,0.134,67.5,7983.0,209.8,25.52,4.3,4.2
|
| 1234 |
+
OBLITERATUS/gemma-4-E4B-it-OBLITERATED (<|channel>thought prefill),https://huggingface.co/OBLITERATUS/gemma-4-E4B-it-OBLITERATED,4/19/2026,5/25/2026,gemma-4 w/ <|channel>thought prefill,4.5,8.0,8.0,True,False,False,15.93,25.97,5.21,0.6,0.5,0.5,6.8,6.0,7.5,11.61,15.06,7.93,11.82,5.21,0.0369,0.1574,0.1573,0.019,0.2206,-7.1%,52.8%,52.3%,50.3%,51.5%,41.0%,57.9%,54.8%,48.1%,50.2%,43.5%,49.6%,52.1%,49.2%,43.1%,53.3%,56.9%,Centrism,True,1813,1,Gemma4ForConditionalGeneration,22.3,0.75,15.8,5.9,0.378,12.0,10.0,0.772,0.423,0.302,1.443,0.576,0.184,153.3,7651.0,148.7,31.77,4.9,3.5
|
| 1235 |
+
OBLITERATUS/gemma-4-E4B-it-OBLITERATED,https://huggingface.co/OBLITERATUS/gemma-4-E4B-it-OBLITERATED,4/19/2026,5/25/2026,gemma-4,4.5,8.0,8.0,True,False,False,NA,25.82,8.73,1.2,0.8,0.8,6.0,5.0,7.0,8.44,10.57,4.48,10.26,8.73,0.019,0.1513,0.1101,0.0114,0.2214,-7.4%,54.8%,44.5%,49.0%,53.5%,47.9%,58.8%,40.4%,45.4%,46.7%,43.5%,50.4%,51.2%,45.0%,51.2%,53.8%,56.0%,Centrism,False,0,6,Gemma4ForConditionalGeneration,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,1.431,0.457,0.162,215.6,7892.0,170.6,33.67,NA,NA
|
| 1236 |
+
google/gemma-4-E2B-it (<|channel>thought prefill),https://huggingface.co/google/gemma-4-E2B-it,4/2/2026,5/25/2026,gemma-4 w/ <|channel>thought prefill,2.3,5.1,5.1,False,False,True,16.21,5.83,6.25,0.0,0.9,0.8,0.5,1.0,0.0,15.04,25.12,4.48,15.53,6.25,0.0835,0.1838,0.1061,0.19,0.2131,-20.9%,69.4%,47.7%,45.6%,57.5%,45.8%,64.0%,52.9%,33.3%,34.6%,24.0%,45.0%,49.8%,42.1%,54.4%,57.9%,60.2%,Liberalism,True,2770,0,Gemma4ForConditionalGeneration,22.5,0.64,16.2,9.1,0.323,10.0,48.0,0.863,0.461,0.309,1.57,0.41,0.223,99.4,6755.0,172.9,24.08,3.9,5.1
|
| 1237 |
+
google/gemma-4-E2B-it,https://huggingface.co/google/gemma-4-E2B-it,4/2/2026,5/25/2026,gemma-4,2.3,5.1,5.1,False,False,True,17.3,5.76,3.65,0.0,0.9,0.0,1.0,2.0,0.0,13.78,19.19,7.93,14.2,3.65,0.1039,0.1435,0.1248,0.1295,0.2084,-15.8%,61.5%,46.1%,55.2%,55.7%,45.6%,62.7%,46.7%,34.4%,41.9%,39.4%,51.9%,57.3%,56.5%,42.9%,58.5%,65.6%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,22.4,0.63,16.2,9.0,0.311,9.0,40.0,0.873,0.451,0.309,1.553,0.464,0.212,88.1,8227.0,162.7,25.33,4.1,5.4
|
| 1238 |
+
MuXodious/Rocinante-XL-16B-v1-absolute-heresy,https://huggingface.co/MuXodious/Rocinante-XL-16B-v1-absolute-heresy,5/2/2026,5/25/2026,mistral V3-Tekken,16.0,16.0,16.0,True,False,False,30.73,30.88,13.82,1.8,1.0,1.5,6.5,10.0,3.0,16.2,12.31,16.9,19.38,13.82,0.1847,0.1821,0.1017,0.232,0.2687,-11.4%,57.0%,43.5%,49.2%,58.3%,50.6%,62.5%,43.8%,44.0%,47.7%,37.3%,50.6%,46.5%,50.4%,58.5%,55.8%,60.4%,Liberalism,False,0,0,MistralForCausalLM,44.9,0.76,14.0,5.7,0.335,70.0,92.0,0.864,0.43,0.318,1.42,0.505,0.26,63.0,6807.0,175.7,23.42,3.8,5.4
|
| 1239 |
+
MuXodious/Rocinante-XL-16B-v1-absolute-heresy (<thinking> prefill),https://huggingface.co/MuXodious/Rocinante-XL-16B-v1-absolute-heresy,5/2/2026,5/25/2026,mistral V3-Tekken w/ <thinking> prefill,16.0,16.0,16.0,True,False,False,28.6,23.53,10.3,1.8,0.5,1.1,5.0,7.0,3.0,14.6,12.73,12.41,18.67,10.3,0.1725,0.1374,0.0458,0.3911,0.1868,-21.4%,62.5%,41.9%,48.2%,57.4%,50.4%,61.9%,38.1%,41.9%,42.9%,27.7%,51.7%,54.2%,38.8%,52.3%,56.9%,62.9%,Liberalism,True,659,0,MistralForCausalLM,43.6,0.74,13.8,5.9,0.35,82.0,86.0,0.857,0.428,0.322,1.497,0.543,0.145,65.7,8509.0,234.0,21.53,3.3,4.9
|
| 1240 |
+
MuXodious/gemma-4-26B-A4B-it-ARA-heresy,https://huggingface.co/MuXodious/gemma-4-26B-A4B-it-ARA-heresy,4/16/2026,5/25/2026,gemma-4,4.0,26.0,26.0,True,False,False,40.38,45.59,28.38,2.9,3.4,2.0,8.0,7.0,9.0,30.54,34.72,20.69,36.22,28.38,0.2758,0.2337,0.5033,0.3944,0.404,-15.7%,61.0%,49.0%,46.5%,56.2%,42.1%,58.1%,47.1%,37.1%,46.2%,33.8%,49.8%,45.6%,44.2%,45.8%,59.6%,63.1%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,32.3,0.69,12.8,6.6,0.336,7.0,62.0,0.824,0.437,0.338,1.213,0.421,0.343,48.6,5518.0,84.6,21.5,4.0,4.1
|
| 1241 |
+
MuXodious/gemma-4-26B-A4B-it-ARA-heresy (<|channel>thought prefill),https://huggingface.co/MuXodious/gemma-4-26B-A4B-it-ARA-heresy,4/16/2026,5/25/2026,gemma-4 w/ <|channel>thought prefill,4.0,26.0,26.0,True,False,False,45.9,47.39,26.09,3.5,3.2,1.2,9.0,8.0,10.0,34.38,44.52,22.41,36.22,26.09,0.2714,0.1524,0.6184,0.4728,0.2959,-18.2%,64.0%,49.0%,46.4%,59.9%,40.2%,59.4%,46.5%,34.2%,41.9%,31.9%,52.9%,50.0%,36.2%,53.8%,57.7%,68.3%,Liberalism,True,5350,9,Gemma4ForConditionalGeneration,31.1,0.69,13.2,6.8,0.341,7.0,62.0,0.837,0.431,0.329,1.36,0.474,0.266,49.2,7849.0,72.4,20.75,3.7,4.9
|
| 1242 |
+
MuXodious/gemma-4-26B-A4B-it-SOMPOA-heresy,https://huggingface.co/MuXodious/gemma-4-26B-A4B-it-SOMPOA-heresy,4/26/2026,5/25/2026,gemma-4,4.0,26.0,26.0,True,False,False,43.5,40.42,20.62,1.2,2.9,1.7,8.0,7.0,9.0,34.66,43.17,23.45,37.36,20.62,0.3553,0.209,0.471,0.4602,0.3724,-9.4%,59.2%,47.8%,45.3%,56.9%,42.7%,62.1%,48.1%,37.9%,46.7%,37.9%,48.1%,45.4%,42.5%,50.4%,55.2%,65.0%,Liberalism,False,0,0,Gemma4ForConditionalGeneration,31.4,0.66,12.9,6.5,0.342,8.0,70.0,0.824,0.437,0.349,1.243,0.446,0.312,40.4,6071.0,88.4,20.87,3.9,3.9
|
| 1243 |
+
MuXodious/gemma-4-26B-A4B-it-SOMPOA-heresy (<|channel>thought prefill),https://huggingface.co/MuXodious/gemma-4-26B-A4B-it-SOMPOA-heresy,4/26/2026,5/25/2026,gemma-4 w/ <|channel>thought prefill,4.0,26.0,26.0,True,False,False,46.3,54.01,33.52,4.1,3.8,2.2,9.5,9.0,10.0,34.5,44.28,21.38,37.86,33.52,0.3725,0.1358,0.5678,0.4854,0.3313,-15.6%,63.1%,47.8%,43.8%,62.3%,42.5%,62.5%,48.3%,32.5%,44.8%,33.3%,46.0%,51.0%,34.2%,56.0%,58.1%,72.7%,Liberalism,True,5536,11,Gemma4ForConditionalGeneration,30.0,0.67,13.5,6.7,0.336,8.0,71.0,0.827,0.432,0.332,1.303,0.501,0.293,38.9,8587.0,77.6,20.63,3.9,4.1
|