Submitted by Ruben Härle 13 KletterMix: Climbing Toward High-Quality German Pretraining Data Artificial Intelligence & Machine Learning Lab at TU Darmstadt 5