AI & ML interests
None defined yet.
Recent Activity
textcleanlm/essentialweb-1.0-10B-clean-content
Viewer
•
Updated
•
9.32M
•
116
textcleanlm/essentialweb-1.0-10B-raw-content
Viewer
•
Updated
•
9.32M
•
136
textcleanlm/essentialweb-1.0-sample-10B
Viewer
•
Updated
•
9.32M
•
481
Viewer
•
Updated
•
2.98M
•
287
textcleanlm/med-domain-5b
Viewer
•
Updated
•
4.07M
•
276
textcleanlm/med-domain-data-sample1
Viewer
•
Updated
•
814k
•
133
textcleanlm/med-domain-data-sample
Viewer
•
Updated
•
8.1k
•
17
textcleanlm/fineweb-sample-10BT
Viewer
•
Updated
•
14.9M
•
149
textcleanlm/textclean-10B
Viewer
•
Updated
•
9.77M
•
435
textcleanlm/textclean-2B-raw-cleaned
Viewer
•
Updated
•
1.95M
•
500
textcleanlm/textclean-2B-raw-sample
Viewer
•
Updated
•
100
•
19
textcleanlm/textclean-2B-raw
Viewer
•
Updated
•
1.97M
•
32
textcleanlm/textclean-sft
Viewer
•
Updated
•
894k
•
34
Viewer
•
Updated
•
91.7k
•
39
textcleanlm/textclean-200M
Viewer
•
Updated
•
581k
•
38
textcleanlm/100M-raw-webtext-to-denoised-text
Viewer
•
Updated
•
179k
•
44
textcleanlm/annotation_example
Viewer
•
Updated
•
1.82k
•
18
Viewer
•
Updated
•
1.82k
•
121
textcleanlm/textclean-20M
Viewer
•
Updated
•
18.3k
•
26
textcleanlm/textclean-corpus-10M-deepseek-ablation
Viewer
•
Updated
•
18.1k
•
28
textcleanlm/textclean-corpus-1M-variant-ablation-research
Viewer
•
Updated
•
1.82k
•
14
textcleanlm/textclean-corpus-1M-old
Viewer
•
Updated
•
1.82k
•
14
•
1
textcleanlm/textclean-corpus-1M-o4-mini
Viewer
•
Updated
•
1.82k
•
16