huawei-noah/TinyBERT_General_6L_768D
Updated
•
1.25k
•
8
Artificial Intelligence
VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse
ROOT: Robust Orthogonalized Optimizer for Neural Network Training