Zixi "Oz" Li

OzTianlu

https://github.com/lizixi-0x2F

lizixi-0x2F

AI & ML interests

My research focuses on deep reasoning with small language models, Transformer architecture innovation, and knowledge distillation for efficient alignment and transfer.

Recent Activity

reacted to Parveshiiii's post with 🔥 about 2 hours ago

🚀 Wanna train your own AI Model or Tokenizer from scratch? Building models isn’t just for big labs anymore — with the right data, compute, and workflow, you can create **custom AI models** and **tokenizers** tailored to any domain. Whether it’s NLP, domain‑specific datasets, or experimental architectures, training from scratch gives you full control over vocabulary, embeddings, and performance. ✨ Why train your own? - Full control over vocabulary & tokenization - Domain‑specific optimization (medical, legal, technical, etc.) - Better performance on niche datasets - Freedom to experiment with architectures ⚡ The best part? - Tokenizer training (TikToken / BPE) can be done in **just 3 lines of code**. - Model training runs smoothly on **Google Colab notebooks** — no expensive hardware required. 📂 Try out my work: - 🔗 https://github.com/OE-Void/Tokenizer-from_scratch - 🔗 https://github.com/OE-Void/GPT

liked a model about 17 hours ago

NoesisLab/NanoHammer-1.5B-Instruct

reacted to their post with 🔥 about 18 hours ago

🚀 NanoHammer-1.5B-Instruct: https://huggingface.co/NoesisLab/NanoHammer-1.5B-Instruct We are excited to introduce NanoHammer, a novel architecture by NoesisLab designed for Causal State Compression and true Linear Inference Complexity. 🧠 The Core: Holographic State SpaceForget the growing KV Cache. NanoHammer leverages Holographic Rotary Embeddings to compress sequence history into a dynamic integral state. Polynomial Compression: Instead of storing raw history, we "integrate" context into a complex number space , treating memory as a container of evolving polynomial coefficients. Dynamic Evolution: The architecture features a custom StateUpdateCell that uses Euler method fixed-point iteration, allowing the model to perform implicit reasoning via differential state updates. ⚡ Why It Matters: Efficiency Meets Reasoning O(1) Inference Memory: State size remains constant regardless of sequence length.Causal Modeling: Explicitly models the causal flow of logic through time, perfect for "implicit reasoning" tasks without the verbosity of Chain-of-Thought.1.5B Lightweight Design: High performance, low resource footprint. 🛠 Model Card HighlightsType: nanohammer (Hybrid Causal-State Architecture) License: Apache 2.0 Capabilities: Instruction following, Long-context handling 🔗 Try it on Hugging Face: https://huggingface.co/NoesisLab/NanoHammer-1.5B-Instruct

View all activity

Organizations

OzTianlu 's models

None public yet