Dynamic Long Context Reasoning over Compressed Memory via End-to-End Reinforcement Learning
Paper
• 2602.08382 • Published
• 10
None defined yet.
Dynamic Long Context Reasoning over Compressed Memory via End-to-End Reinforcement Learning
LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding