Attention-driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models without Fine-Tuning Paper • 2412.10840 • Published Dec 14, 2024 • 1
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents Paper • 2506.03143 • Published Jun 3, 2025 • 53
data-is-better-together/open-image-preferences-v1-results Viewer • Updated Dec 9, 2024 • 10k • 57 • 31