·
AI & ML interests
AI for Education
Organizations
None yet
jiazhengli/Qwen2.5-3B-Instruct-Critic
3B • Updated • 2
jiazhengli/Qwen2.5-3B-Instruct-Reasoner
3B • Updated • 1
jiazhengli/Llama-2-7b-esnli-lora
Question Answering
• 7B • Updated • 1
jiazhengli/deberta-large-asap_6
Text Classification
• Updated • 45
jiazhengli/deberta-large-asap_5
Text Classification
• Updated • 1
jiazhengli/deberta-large-asap_2
Text Classification
• Updated • 1
jiazhengli/deberta-large-asap_1
Text Classification
• Updated • 2
jiazhengli/Qwen2.5-7B-RoleMRC-sft
8B • Updated • 7
jiazhengli/Qwen2.5-7B-RoleMRC-dpo
8B • Updated • 6
jiazhengli/Llama-3.1-8B-RoleMRC-sft
8B • Updated • 1
jiazhengli/Llama-3.1-8B-RoleMRC-dpo
8B • Updated • 6
jiazhengli/long-t5-tglobal-large-AERA
jiazhengli/Mixtral-8x7B-Instruct-v0.1-QLoRA-Assessment-Rationale-dpo
jiazhengli/Mixtral-8x7B-Instruct-v0.1-QLoRA-Assessment-Rationale-sft
Updated • 18
jiazhengli/Meta-Llama-3-8B-QLoRA-Assessment-Rationale-sft
jiazhengli/Meta-Llama-3-8B-QLoRA-Assessment-Rationale-dpo
Updated • 5
• 1
jiazhengli/deberta-v3-large-Rationale-to-Score
Text Classification
• 0.4B • Updated • 3
• 1
jiazhengli/Pythia-2.8B-TLDR-Iterative-SamPO
Text Generation
• 3B • Updated • 12
jiazhengli/Pythia-2.8B-HH-RLHF-Iterative-SamPO
Text Generation
• 3B • Updated • 3