geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 58
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 44
geodesic-research/sfm_filtered_cpt_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 27
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_pretraining_stage Text Generation • 7B • Updated Jan 16 • 20
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_pretraining_stage Text Generation • 7B • Updated Jan 16 • 31
geodesic-research/sfm_filtered_e2e_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 28
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_base Text Generation • 7B • Updated Jan 16 • 945
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 80
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 91
geodesic-research/sfm_filtered_cpt_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 54
geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 106
geodesic-research/sfm_unfiltered_midtrain_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 6
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 94
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 92
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 97
geodesic-research/sfm_filtered_e2e_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 123
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 40
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 7
geodesic-research/sfm_filtered_cpt_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 39
geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 7
geodesic-research/sfm_unfiltered_midtrain_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 6