arxiv:2508.18255
Jeffrey Quesnelle PRO
emozilla
AI & ML interests
None yet
Organizations
models 92
emozilla/consilience-v1-40b-mitchell-init
Text Generation • Updated • 2
emozilla/llama3-8b-dcp-default_tt-init
Updated
emozilla/Llama-3.1-405B-DCP
Updated
emozilla/Llama-3.1-70B-DCP
Updated
emozilla/Llama-3.1-8B-DCP
Updated
emozilla/llama2-15b-gqa-init
Text Generation • Updated • 1
emozilla/llama2-1.1b-gqa-init
Text Generation • Updated • 2
emozilla/llama2-15b-init
Text Generation • Updated
emozilla/llama2-1.2b-nanotron-init
Updated
emozilla/llama2-1.2b-init-6
Text Generation • Updated
datasets 53
emozilla/Hermes-3-Preprocessed-Llama3-2samples
Viewer • Updated • 2 • 17
emozilla/Hermes-3-Preprocessed-Llama3-100samples
Viewer • Updated • 100 • 24 • 1
emozilla/Hermes-3-Preprocessed-Llama3
Viewer • Updated • 91.1k • 51 • 1
emozilla/dolma-v1_7-30B-tokenized-llama2-nanoset
Updated • 69
emozilla/fineweb-10bt-tokenized-datatrove-llama2
Updated • 128 • 3
emozilla/fineweb-350bt-tokenized-datatrove-llama2
Updated • 190
emozilla/dolma-v1_7-305B-tokenized-llama2-nanoset
Updated • 81
emozilla/proofpile-test-tokenized-llama3
Viewer • Updated • 46.3k • 20
emozilla/PaulGrahamEssays
Viewer • Updated • 49 • 13
emozilla/dolma-v1_7-cc_en_head
Viewer • Updated • 475M • 678