MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_5__global_step_1776 2B • Updated 8 days ago • 252
MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_5__global_step_1184 2B • Updated 8 days ago • 255
MultiRL/qwen3_1.7b_new_standard_C_sft_overfit_lr_5e_5__global_step_1480 2B • Updated 8 days ago • 321
MultiRL/qwen3_1.7b_easy_rl_old_adv_final_fixed_sequence_max_token_norm_batch_128 2B • Updated 27 days ago • 46