khtsly's picture

khtsly

khtsly

·

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

khtsly/roblox_docs_corpus_text

new activity 4 days ago

Jackrong/Qwopus-GLM-18B-Merged-GGUF:merging problem

new activity 5 days ago

google/gemma-4-31B-it:Can anyone improve the model using the Rys methodology—by duplicating a block of layers?

View all activity

Organizations

None yet

upvoted a paper 11 days ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published 15 days ago • 75

upvoted a paper 29 days ago

MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens

Paper • 2603.23516 • Published Mar 6 • 48

upvoted a paper about 1 month ago

Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context

Paper • 2603.15653 • Published Mar 7 • 12

upvoted 2 collections about 2 months ago

Qwen3.5-Abliterated-Opus-4.6-Distilled

Qwen3.5-Abliterated • 3 items • Updated Mar 8 • 1

Qwen3.5-Opus-4.6-Distilled

14 items • Updated Mar 9 • 2