Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

AxionML/Qwen3.5-27B-NVFP4

liked a model 1 day ago

tencent/HY-WorldPlay

liked a model 1 day ago

Lightricks/LTX-2.3

View all activity

Organizations

commented a paper 6 months ago

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 662 •

New activity in Qwen/Qwen3-32B 7 months ago

Will Qwen3-32B be updated just like Qwen3-235B-A22B?

#40 opened 7 months ago by

New activity in openbmb/MiniCPM-V-4 7 months ago

License?

#1 opened 7 months ago by

New activity in kernels-community/vllm-flash-attn3 7 months ago

Support for sm120?

#2 opened 7 months ago by

New activity in tiiuae/Falcon-H1-34B-Instruct 7 months ago

Hi, the licence of this model should be apache 2.0 according to the blog.

#11 opened 7 months ago by

commented 3 papers 8 months ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10, 2025 • 36 •

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10, 2025 • 36 •

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

Paper • 2507.07996 • Published Jul 10, 2025 • 36 •

commented 2 papers 9 months ago

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24, 2025 • 44 •

Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models

Paper • 2506.19697 • Published Jun 24, 2025 • 44 •

New activity in open-llm-leaderboard/open_llm_leaderboard 12 months ago

It's been a wild ride, folks :) (end of the Open LLM Leaderboard)

#1135 opened 12 months ago by

New activity in open-llm-leaderboard/open_llm_leaderboard about 1 year ago

Qwen2.5-32B merge eval error

#1078 opened about 1 year ago by

commented a paper over 1 year ago

What Matters in Transformers? Not All Attention is Needed

Paper • 2406.15786 • Published Jun 22, 2024 • 31 •

New activity in mattshumer/Reflection-Llama-3.1-70B over 1 year ago

If you find Independent third-party evaluation results about this model, please share here.

#42 opened over 1 year ago by

New activity in Weyaxi/leaderboard-results-to-modelcard over 1 year ago

Hi, can you deal with the bot spamming the community section of mattshumer/ref_70_e3 and

#16 opened over 1 year ago by

New activity in mattshumer/ref_70_e3 over 1 year ago

🚩 Report

#6 opened over 1 year ago by

Reflection-Llama-3.1-70B was sonnet 3.5.

#5 opened over 1 year ago by

mattshumer/ref_70_e3 and mattshumer/Reflection-Llama-3.1-70B-ep2-working are the SAME.

#10 opened over 1 year ago by

Third party evaluation on this so-called "LLM".

#7 opened over 1 year ago by

New activity in mattshumer/Reflection-Llama-3.1-70B over 1 year ago

Reflection-Llama-3.1-70B was claude 3, chatgpt4o and what?

#48 opened over 1 year ago by