Angelino Santiago

MrDevolver

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

Tesslate/OmniCoder-9B:A wild idea / suggestion...

liked a dataset 1 day ago

TeichAI/Healer-Alpha-16k

liked a dataset 1 day ago

TeichAI/Hunter-Alpha-16k

View all activity

Organizations

New activity in Tesslate/OmniCoder-9B 1 day ago

A wild idea / suggestion...

🔥 1

#4 opened 2 days ago by

MrDevolver

liked 4 datasets 1 day ago

liked a model 2 days ago

mradermacher/OpenAi-GPT-oss-36B-BrainStorm20x-uncensored-GGUF

37B • Updated Aug 31, 2025 • 217 • 1

liked 3 models 3 days ago

bharatgenai/Param2-17B-A2.4B-Thinking

Text Generation • 17B • Updated 3 days ago • 2.25k • 54

Tesslate/OmniCoder-9B-GGUF

9B • Updated 4 days ago • 21.5k • 90

Tesslate/OmniCoder-9B

Text Generation • Updated 3 days ago • 7.34k • 230

liked 2 models 4 days ago

mradermacher/Qwen3.5-21B-Claude-4.6-Opus-Thinking-EXP2-GGUF

21B • Updated 4 days ago • 1.42k • 1

mradermacher/Qwen3.5-21B-Claude-4.6-Opus-Thinking-EXP-GGUF

21B • Updated 4 days ago • 1.48k • 1

liked a model 5 days ago

0xA50C1A1/Darkmere-14B-v0.1

Image-Text-to-Text • 14B • Updated 3 days ago • 110 • 4

upvoted a collection 5 days ago

Qwen 3.5 - 0.8, 2, 4, 9, 27, 35B - regular / uncensored

Collection

Min 256k context + images : Reg, Heretic, Heretic fine tunes of Qwen 3.5 in all sizes. • 34 items • Updated about 14 hours ago • 19

replied to DavidAU's post 5 days ago

Currently have full running 13B (GLM 4.7 Flash) - which is very strong ; and experimental 21Bs of Qwen 3.5.
These are trained.

These are in testing, and access is limited as of this writing.

As for MOEs:
This is a little more complicated as scripting must be written for Mergekit to "moe together" 0.8B, 2B, 4B, 9Bs etc etc.
A draft (by me) has been completed to do this; but not tested/debugged yet.

No time line here ; too many variables.

RE 35B moes ; it is possible to address this in a different way ; but I have not tried it yet.
This is a different approach than REAP.

I believe I saw that 13B model repository earlier, but I cannot see it anymore. Was it an upscaled dense model of Qwen 3.5 9B with further training? That could be pretty interesting. Did you remove it or hide it? I was really looking forward to trying that model, or finetunes based on it. Hopefully, there is still a chance for that to reach public. 🙏
Good luck with these projects! 👍

liked a model 6 days ago