view post Post 2885 Good news, llama.cpp seems to be close to supporting MTP on qwen models. Bad news, every single gguf will have to be redone when it is. See translation 1 reply · 👀 15 15 + Reply
mradermacher/ERNIE-21B-A3B-Claude-4.5-High-OPUS-Thinking-i1-GGUF 22B • Updated Feb 22 • 520 • 10