embedl/Cosmos-Reason2-2B-W4A16-Edge2-FlashHead
Image-Text-to-Text • 2B • Updated • 2.31k • 8
Efficient Drop-In Replacement for the Classification Head in Language Model Inference. https://github.com/embedl/flash-head
On-Device benchmarks across devices and models.