Flash normalization: fast RMSNorm for LLMs
Paper
•
2407.09577
•
Published
•
2
Finetune of LLaMa 3.2 1B model to include flashnormalization (https://arxiv.org/abs/2407.09577)
Use the code below to get started with the model.
[More Information Needed]
[More Information Needed]
[More Information Needed]
[More Information Needed]
Nils Graef ([email protected])
Drew Wasielewski ([email protected])
Base model
meta-llama/Llama-3.2-1B