Portuguese PII and De-Identification Collection 35 open-source Portuguese PII detection models. 54 entity types. Apache 2.0. • 31 items • Updated 3 days ago • 20
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers 8 days ago • 63
SIM1: Physics-Aligned Simulator as Zero-Shot Data Scaler in Deformable Worlds Paper • 2604.08544 • Published 15 days ago • 16
PGC Psychiatric GWAS Summary Statistics Collection ~1 billion rows of genome-wide association study (GWAS) NOTE: We are in the process to transfer these datasets to the Psychiatric Genomics Consortiu • 12 items • Updated 9 days ago • 88
Leaderboards Collection A collection of leaderboards showcasing our Persian LLM benchmark (Mizan), and the MTEB embedding leaderboard, which includes our FaMTEB results. • 2 items • Updated Jun 7, 2025 • 1
Residual-Copilot Collection Datasets and checkpoints used for the paper: Efficient and Reliable Teleoperation through Real-to-Sim-to-Real Shared Autonomy • 9 items • Updated Mar 10 • 3
LAP Collection LAP: Language-Action Pre-training Enables Zero-Shot Cross-Embodiment Transfer • 2 items • Updated Feb 9 • 3
Health AI Developer Foundations (HAI-DEF) Collection Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated Mar 12 • 214
InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams Paper • 2601.02281 • Published Jan 5 • 33
XVLA Collection X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated Dec 4, 2025 • 12