Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning Paper • 2605.25437 • Published May 25 • 17
Does Seeing More Mean Knowing More? Mono-Anchored Advantage Normalization for Multi-Source Visual Reasoning Paper • 2605.25437 • Published May 25 • 17
Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients Paper • 2603.17809 • Published Mar 18 • 1
Unveiling Fine-Grained Visual Traces: Evaluating Multimodal Interleaved Reasoning Chains in Multimodal STEM Tasks Paper • 2604.19697 • Published May 8 • 1
CL-VISTA: Benchmarking Continual Learning in Video Large Language Models Paper • 2604.00677 • Published Apr 1 • 1