4 10

Xie

stonexjr

AI & ML interests

Generative Art

Recent Activity

upvoted an article about 2 months ago

KV Caching Explained: Optimizing Transformer Inference Efficiency

upvoted an article 7 months ago

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

upvoted an article 7 months ago

You could have designed state of the art positional encoding

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 329

upvoted 2 articles 7 months ago

Article

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

exploding-gradients

•

Sep 16, 2025

• 20

Article

You could have designed state of the art positional encoding

FL33TW00D-HF

•

Nov 25, 2024

• 478

liked a model about 1 year ago

ostris/Flex.2-preview

Text-to-Image • Updated Apr 25, 2025 • 502 • 393

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.84k

The ultimate guide to training LLM on large GPU Clusters

liked a dataset about 1 year ago

poloclub/diffusiondb

Updated Jan 22, 2024 • 26.7k • 612

upvoted an article over 1 year ago

Article

Understanding InstaFlow/Rectified Flow

Isamu136

•

Oct 6, 2023

• 37

liked 2 datasets over 1 year ago

zh-plus/tiny-imagenet

Viewer • Updated Jul 12, 2022 • 110k • 22.9k • 98

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 94.7k • 798

liked 4 models over 1 year ago

liked a model about 3 years ago

lllyasviel/ControlNet

Updated Feb 25, 2023 • 2 • 3.81k

Xie

AI & ML interests

Recent Activity

Organizations

stonexjr's activity

KV Caching Explained: Optimizing Transformer Inference Efficiency

Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation

You could have designed state of the art positional encoding

The Ultra-Scale Playbook

Understanding InstaFlow/Rectified Flow