188
followers
·
24 following
AI & ML interests
LLM, RL, DL, ML, AGI. Developing LLMs (preferably fully fine tuned ) for various use cases.
Recent Activity
reacted
to
their
post
with 👍
about 15 hours ago
JavaScript-Code-Large
https://huggingface.co/datasets/ajibawa-2023/JavaScript-Code-Large
JavaScript-Code-Large is a large-scale corpus of JavaScript source code comprising around 5 million JavaScript files. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis for the JavaScript ecosystem.
By providing a high-volume, language-specific corpus, JavaScript-Code-Large enables systematic experimentation in JavaScript-focused model training, domain adaptation, and downstream code understanding tasks.
JavaScript-Code-Large addresses the need for a dedicated JavaScript-only dataset at substantial scale, enabling focused research across frontend, backend, and full-stack JavaScript environments. .
reacted
to
their
post
with 🚀
about 15 hours ago
JavaScript-Code-Large
https://huggingface.co/datasets/ajibawa-2023/JavaScript-Code-Large
JavaScript-Code-Large is a large-scale corpus of JavaScript source code comprising around 5 million JavaScript files. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis for the JavaScript ecosystem.
By providing a high-volume, language-specific corpus, JavaScript-Code-Large enables systematic experimentation in JavaScript-focused model training, domain adaptation, and downstream code understanding tasks.
JavaScript-Code-Large addresses the need for a dedicated JavaScript-only dataset at substantial scale, enabling focused research across frontend, backend, and full-stack JavaScript environments. .
reacted
to
their
post
with 🔥
about 15 hours ago
JavaScript-Code-Large
https://huggingface.co/datasets/ajibawa-2023/JavaScript-Code-Large
JavaScript-Code-Large is a large-scale corpus of JavaScript source code comprising around 5 million JavaScript files. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis for the JavaScript ecosystem.
By providing a high-volume, language-specific corpus, JavaScript-Code-Large enables systematic experimentation in JavaScript-focused model training, domain adaptation, and downstream code understanding tasks.
JavaScript-Code-Large addresses the need for a dedicated JavaScript-only dataset at substantial scale, enabling focused research across frontend, backend, and full-stack JavaScript environments. .
View all activity
Organizations
view post
JavaScript-Code-Large
ajibawa-2023/JavaScript-Code-Large JavaScript-Code-Large is a large-scale corpus of JavaScript source code comprising around 5 million JavaScript files. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis for the JavaScript ecosystem. By providing a high-volume, language-specific corpus, JavaScript-Code-Large enables systematic experimentation in JavaScript-focused model training, domain adaptation, and downstream code understanding tasks. JavaScript-Code-Large addresses the need for a dedicated JavaScript-only dataset at substantial scale, enabling focused research across frontend, backend, and full-stack JavaScript environments. .
See translation
view post
Java-Code-Large (
ajibawa-2023/Java-Code-Large ) Java-Code-Large is a large-scale corpus of publicly available Java source code comprising more than 15 million java codes. The dataset is designed to support research in large language model (LLM) pretraining, code intelligence, software engineering automation, and program analysis. By providing a high-volume, language-specific corpus, Java-Code-Large enables systematic experimentation in Java-focused model training, domain adaptation, and downstream code understanding tasks.
See translation
models
32
ajibawa-2023/Python-Code-13B
Text Generation
•
13B
•
Updated
Nov 9, 2024
•
706
•
7
ajibawa-2023/Young-Children-Storyteller-Mistral-7B
Text Generation
•
7B
•
Updated
Jun 26, 2024
•
21
•
23
ajibawa-2023/SlimOrca-Llama-3-8B
Text Generation
•
8B
•
Updated
May 27, 2024
•
11
•
•
4
ajibawa-2023/Code-Llama-3-8B
Text Generation
•
8B
•
Updated
May 8, 2024
•
474
•
31
ajibawa-2023/Uncensored-Frank-Llama-3-8B
Text Generation
•
8B
•
Updated
May 8, 2024
•
23
•
•
13
ajibawa-2023/Scarlett-Llama-3-8B-v1.0
Text Generation
•
Updated
May 7, 2024
•
5
•
5
ajibawa-2023/Scarlett-Llama-3-8B
Text Generation
•
Updated
Apr 26, 2024
•
5
•
8
ajibawa-2023/Code-Mistral-7B
Text Generation
•
7B
•
Updated
Apr 26, 2024
•
36
•
15
ajibawa-2023/General-Stories-Mistral-7B
Text Generation
•
Updated
Apr 23, 2024
•
19
•
5
ajibawa-2023/Code-Jamba-v0.1
Text Generation
•
52B
•
Updated
Apr 12, 2024
•
11
•
7