Largest (as of 2024) machine translated Arabic educational corpus
Sultan Alrashed PRO
SultanR
AI & ML interests
Smol language modelling and Arabic!
Recent Activity
authored
a paper
29 minutes ago
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+
Languages and Cultures
authored
a paper
30 minutes ago
AraMix: Recycling, Refiltering, and Deduplicating to Deliver the Largest Arabic Pretraining Corpus
authored
a paper
30 minutes ago
SmolKalam: Ensemble Quality-Filtered Translation at Scale for High Quality Arabic Post-Training Data