Activity Feed

AI & ML interests

None defined yet.

Recent Activity

ericmsmithΒ 
authored 14 papers 5 months ago
MolbapΒ 
posted an update 5 months ago
view post
Post
3420
πŸš€ New blog: Maintain the unmaintainable – 1M+ Python LOC, 400+ models

How do you stop a million-line library built by thousands of contributors from collapsing under its own weight?
At πŸ€— Transformers, we do it with explicit software-engineering tenets, principles that make the codebase hackable at scale.

πŸ” Inside the post:
– One Model, One File: readability first β€” you can still open a modeling file and see the full logic, top to bottom.
– Modular Transformers: visible inheritance that cuts maintenance cost by ~15Γ— while keeping models readable.
– Config-Driven Performance: FlashAttention, tensor parallelism, and attention scheduling are config-level features, not rewrites.

Written with @lysandre ,@pcuenq and @yonigozlan , this is a deep dive into how Transformers stays fast, open, and maintainable.

Read it here β†’ transformers-community/Transformers-tenets
lysandreΒ 
posted an update 6 months ago
view post
Post
8032
We're kick-starting the process of Transformers v5, with @ArthurZ and @cyrilvallez !

v5 should be significant: we're using it as a milestone for performance optimizations, saner defaults, and a much cleaner code base worthy of 2025.

Fun fact: v4.0.0-rc-1 came out on Nov 19, 2020, nearly five years ago!
  • 6 replies
Β·
ariG23498Β 
posted an update 6 months ago
view post
Post
1751
New post is live!

This time we cover some major updates to transformers.

πŸ€—
  • 2 replies
Β·