CohenQu/Instruct-POPE-iter1-step280-POPE-hard-first_guide-no_guide-iter2 4B • Updated Nov 10, 2025 • 28
CohenQu/Qwen2.5-3B-Instruct_Continue_vs_Terminate.05.00 Text Generation • 3B • Updated Aug 14, 2025 • 4
CohenQu/sft_Qwen3-1.7B_Continue_vs_Terminate.05.00_orchard Text Generation • 2B • Updated Jul 29, 2025 • 7
CohenQu/sft_Qwen3-1.7B_Continue_vs_Terminate.05.01_orchard Text Generation • 2B • Updated Jul 29, 2025 • 5
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_0713 2B • Updated Jul 14, 2025 • 8