AI & ML interests
Google ❤️ Open Source AI
Recent Activity
Papers
Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning
Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
-
MedGemma - Radiology Explainer Demo
🩺216Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋168Simulated Pre-visit Intake Demo built using MedGemma
-
Radiology Learning Companion
🏃21A demo showcasing a medical learning experience of CXR image
-
EHR Navigator Agent With MedGemma
🩺4Search and navigate electronic health records
-
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper • 2402.13217 • Published • 38 -
google/videoprism-base-f16r288
Video Classification • Updated • 163k • 92 -
google/videoprism-large-f8r288
Video Classification • Updated • 87 • 18 -
google/videoprism-lvt-base-f16r288
Video Classification • Updated • 35.8k • 11
-
Path Foundation Demo
🔬36Browse medical images for pathology analysis
-
CXR Foundation Demo
🩻20Demo usage of the CXR Foundation model embeddings
-
MedGemma - Radiology Explainer Demo
🩺216Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋168Simulated Pre-visit Intake Demo built using MedGemma
-
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 10.1k • 230 -
google/gemma-3-4b-pt-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 132 • 23 -
google/gemma-3-1b-it-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 1.27k • 114 -
google/gemma-3-1b-pt-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 93 • 12
-
Paligemma2 Mix
🌖96Generate text and segment images using PaliGemma 2
-
google/paligemma2-3b-mix-224
Image-Text-to-Text • 3B • Updated • 13.1k • 46 -
google/paligemma2-3b-mix-448
Image-Text-to-Text • 3B • Updated • 2.81k • 55 -
google/paligemma2-10b-mix-224
Image-Text-to-Text • 10B • Updated • 2.03k • 10
-
google-t5/t5-base
Translation • 0.2B • Updated • 1.95M • • 761 -
google-t5/t5-small
Translation • 60.5M • Updated • 2.65M • • 523 -
google-t5/t5-large
Translation • 0.7B • Updated • 245k • • 234 -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 15
-
google/siglip-so400m-patch14-384
Zero-Shot Image Classification • 0.9B • Updated • 2.78M • 636 -
google/siglip-so400m-patch14-224
Zero-Shot Image Classification • 0.9B • Updated • 6.37k • 56 -
google/siglip-so400m-patch16-256-i18n
Zero-Shot Image Classification • 1B • Updated • 2.87k • 30 -
google/siglip-base-patch16-256-multilingual
Zero-Shot Image Classification • 0.4B • Updated • 4.3k • 51
-
Compare Siglip1 Siglip2
🚀53Compare SigLIP1 and SigLIP2 on zero shot classification
-
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Paper • 2502.14786 • Published • 157 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • 0.4B • Updated • 589k • 85 -
google/siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 67.7k • 6
-
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 133 -
google/paligemma2-3b-pt-224
Image-Text-to-Text • 3B • Updated • 40.4k • 161 -
google/paligemma2-3b-pt-448
Image-Text-to-Text • 3B • Updated • 5.03k • 46 -
google/paligemma2-3b-pt-896
Image-Text-to-Text • 3B • Updated • 607 • 22
-
google/timesfm-1.0-200m
Time Series Forecasting • Updated • 297 • 778 -
google/timesfm-1.0-200m-pytorch
Time Series Forecasting • Updated • 6.52k • 29 -
google/timesfm-2.0-500m-jax
Time Series Forecasting • Updated • 74 • 16 -
google/timesfm-2.0-500m-pytorch
Time Series Forecasting • 0.5B • Updated • 7.3k • 231
-
MedGemma - Radiology Explainer Demo
🩺216Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋168Simulated Pre-visit Intake Demo built using MedGemma
-
Radiology Learning Companion
🏃21A demo showcasing a medical learning experience of CXR image
-
EHR Navigator Agent With MedGemma
🩺4Search and navigate electronic health records
-
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper • 2402.13217 • Published • 38 -
google/videoprism-base-f16r288
Video Classification • Updated • 163k • 92 -
google/videoprism-large-f8r288
Video Classification • Updated • 87 • 18 -
google/videoprism-lvt-base-f16r288
Video Classification • Updated • 35.8k • 11
-
Path Foundation Demo
🔬36Browse medical images for pathology analysis
-
CXR Foundation Demo
🩻20Demo usage of the CXR Foundation model embeddings
-
MedGemma - Radiology Explainer Demo
🩺216Radiology Image & Report Explainer Demo. Built with MedGemma
-
Appoint Ready - MedGemma Demo
📋168Simulated Pre-visit Intake Demo built using MedGemma
-
google/gemma-3-4b-it-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 10.1k • 230 -
google/gemma-3-4b-pt-qat-q4_0-gguf
Image-Text-to-Text • 4B • Updated • 132 • 23 -
google/gemma-3-1b-it-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 1.27k • 114 -
google/gemma-3-1b-pt-qat-q4_0-gguf
Text Generation • 1.0B • Updated • 93 • 12
-
Compare Siglip1 Siglip2
🚀53Compare SigLIP1 and SigLIP2 on zero shot classification
-
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
Paper • 2502.14786 • Published • 157 -
google/siglip2-base-patch16-224
Zero-Shot Image Classification • 0.4B • Updated • 589k • 85 -
google/siglip2-base-patch16-256
Zero-Shot Image Classification • 0.4B • Updated • 67.7k • 6
-
Paligemma2 Mix
🌖96Generate text and segment images using PaliGemma 2
-
google/paligemma2-3b-mix-224
Image-Text-to-Text • 3B • Updated • 13.1k • 46 -
google/paligemma2-3b-mix-448
Image-Text-to-Text • 3B • Updated • 2.81k • 55 -
google/paligemma2-10b-mix-224
Image-Text-to-Text • 10B • Updated • 2.03k • 10
-
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 133 -
google/paligemma2-3b-pt-224
Image-Text-to-Text • 3B • Updated • 40.4k • 161 -
google/paligemma2-3b-pt-448
Image-Text-to-Text • 3B • Updated • 5.03k • 46 -
google/paligemma2-3b-pt-896
Image-Text-to-Text • 3B • Updated • 607 • 22
-
google-t5/t5-base
Translation • 0.2B • Updated • 1.95M • • 761 -
google-t5/t5-small
Translation • 60.5M • Updated • 2.65M • • 523 -
google-t5/t5-large
Translation • 0.7B • Updated • 245k • • 234 -
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Paper • 1910.10683 • Published • 15
-
google/siglip-so400m-patch14-384
Zero-Shot Image Classification • 0.9B • Updated • 2.78M • 636 -
google/siglip-so400m-patch14-224
Zero-Shot Image Classification • 0.9B • Updated • 6.37k • 56 -
google/siglip-so400m-patch16-256-i18n
Zero-Shot Image Classification • 1B • Updated • 2.87k • 30 -
google/siglip-base-patch16-256-multilingual
Zero-Shot Image Classification • 0.4B • Updated • 4.3k • 51
-
google/timesfm-1.0-200m
Time Series Forecasting • Updated • 297 • 778 -
google/timesfm-1.0-200m-pytorch
Time Series Forecasting • Updated • 6.52k • 29 -
google/timesfm-2.0-500m-jax
Time Series Forecasting • Updated • 74 • 16 -
google/timesfm-2.0-500m-pytorch
Time Series Forecasting • 0.5B • Updated • 7.3k • 231