AI & ML interests
None defined yet.
Recent Activity
SpectralPO/DeepSeek-R1-Distill-Qwen-7B-SPO-QwQ-Ablation
8B
•
Updated
•
6
SpectralPO/DeepSeek-R1-Distill-Qwen-32B-GRPO
Updated
SpectralPO/DeepSeek-R1-Distill-Qwen-32B-SPO
Updated
SpectralPO/DeepSeek-R1-Distill-Qwen-7B-SPO-Qwen3-235B
8B
•
Updated
•
5
SpectralPO/DeepSeek-R1-Distill-Qwen-7B-SPO-QwQ
8B
•
Updated
•
4
SpectralPO/DeepSeek-R1-Distill-Qwen-7B-SPO-DeepSeek-V3
8B
•
Updated
•
10
•
1
SpectralPO/DeepSeek-R1-Distill-Llama-8B-SPO
8B
•
Updated
•
2
SpectralPO/DeepSeek-R1-Distill-Llama-8B-GRPO
8B
•
Updated
•
6
SpectralPO/Qwen2.5-32B-Instruct-GRPO
33B
•
Updated
•
4
SpectralPO/Qwen2.5-32B-Instruct-SPO
33B
•
Updated
•
3
SpectralPO/32B-SPO-GRPO-mixed
33B
•
Updated
•
5
SpectralPO/DeepSeek-R1-Distill-Qwen-14B-GRPO
15B
•
Updated
•
4
SpectralPO/DeepSeek-R1-Distill-Qwen-SPO
15B
•
Updated
•
3
SpectralPO/Qwen2.5-14B-Instruct-SPO
15B
•
Updated
•
3
SpectralPO/Qwen2.5-14B-Instruct-GRPO
15B
•
Updated
•
3
8B
•
Updated
•
2
SpectralPO/DeepSeek-R1-Distill-Qwen-7B-SPO
8B
•
Updated
•
4
SpectralPO/DeepSeek-R1-Distill-Qwen-7B-GRPO
8B
•
Updated
•
3
SpectralPO/Qwen2.5-7B-Instruct-N1
8B
•
Updated
•
4
SpectralPO/Qwen2.5-7B-Instruct-SPO
8B
•
Updated
•
4
SpectralPO/Qwen2.5-7B-Instruct-GRPO
8B
•
Updated
•
5
SpectralPO/Qwen2.5-14B-Instruct-pos
15B
•
Updated
•
3
SpectralPO/Qwen2.5-14B-Instruct-neg
15B
•
Updated
•
5
SpectralPO/Qwen2.5-32B-Instruct-pos
33B
•
Updated
•
5
SpectralPO/Qwen2.5-32B-Instruct-neg-2
33B
•
Updated
•
3
SpectralPO/Qwen2.5-32B-Instruct-neg
33B
•
Updated
•
3
SpectralPO/s1K-7B-RSPO-neg
8B
•
Updated
•
3