S.F.

search-facility

ipv6

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

LightThinker++: From Reasoning Compression to Memory Management

upvoted a paper 1 day ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

liked a model 4 days ago

DeepBeepMeep/MagiHuman

View all activity

Organizations

None yet

upvoted 2 papers 1 day ago

LightThinker++: From Reasoning Compression to Memory Management

Paper • 2604.03679 • Published 6 days ago • 29

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published 4 days ago • 106

upvoted a paper 6 days ago

LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model

Paper • 2604.02097 • Published 8 days ago • 30

upvoted a paper 7 days ago

GaussianGPT: Towards Autoregressive 3D Gaussian Scene Generation

Paper • 2603.26661 • Published 14 days ago • 25

upvoted 5 papers 13 days ago

Representation Alignment for Just Image Transformers is not Easier than You Think

Paper • 2603.14366 • Published 26 days ago • 13

upvoted a paper 16 days ago

Repurposing Geometric Foundation Models for Multi-view Diffusion

Paper • 2603.22275 • Published 18 days ago • 47

upvoted a paper 17 days ago

FlowScene: Style-Consistent Indoor Scene Generation with Multimodal Graph Rectified Flow

Paper • 2603.19598 • Published 21 days ago • 32

upvoted 2 papers 21 days ago

LoST: Level of Semantics Tokenization for 3D Shapes

Paper • 2603.17995 • Published 23 days ago • 31

Complementary Reinforcement Learning

Paper • 2603.17621 • Published 23 days ago • 36

upvoted a paper 22 days ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 24 days ago • 307

upvoted 3 papers 23 days ago

Mixture-of-Depths Attention

Paper • 2603.15619 • Published 25 days ago • 79

Attention Residuals

Paper • 2603.15031 • Published 25 days ago • 176

Grounding World Simulation Models in a Real-World Metropolis

Paper • 2603.15583 • Published 25 days ago • 153

upvoted a paper 24 days ago

OmniForcing: Unleashing Real-time Joint Audio-Visual Generation

Paper • 2603.11647 • Published 29 days ago • 31

upvoted 2 papers 29 days ago

Fish Audio S2 Technical Report

Paper • 2603.08823 • Published Mar 9 • 37

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Paper • 2603.09095 • Published Mar 10 • 29

S.F.

AI & ML interests

Recent Activity

Organizations

search-facility's activity