11 7 4

Tom Schelsen

TomSchelsen

AI & ML interests

None yet

Recent Activity

new activity 15 days ago

mistralai/Devstral-Small-2-24B-Instruct-2512:Actual context length ?

upvoted an article 15 days ago

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

new activity about 1 month ago

mistralai/Ministral-3-3B-Instruct-2512:Context size for images

View all activity

Organizations

None yet

upvoted an article 15 days ago

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

17 days ago

•

upvoted a collection 3 months ago

ARC-Encoders

Collection

Pretrained ARC-Encoders and a fine-tuning dataset: context compression for unmodified LLMs. • 7 items • Updated 10 days ago • 4

upvoted an article 5 months ago

Article

Luth: Efficient French Specialization for Small Language Models

Aug 11, 2025

•

upvoted 2 articles 6 months ago

Article

Should We Still Pretrain Encoders with Masked Language Modeling?

Jul 2, 2025

•

Article

Mastering Tensor Dimensions in Transformers

Jan 12, 2025

•

128

upvoted an article 10 months ago

Article

🇪🇺 EU AI Act: Comments on the Third Code of Practice Draft 🇪🇺

Mar 13, 2025

•

upvoted a paper over 1 year ago

Reducing the Footprint of Multi-Vector Retrieval with Minimal Performance Impact via Token Pooling

Paper • 2409.14683 • Published Sep 23, 2024 • 11

Tom Schelsen

AI & ML interests

Recent Activity

Organizations

TomSchelsen's activity

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

Luth: Efficient French Specialization for Small Language Models

Should We Still Pretrain Encoders with Masked Language Modeling?

Mastering Tensor Dimensions in Transformers

🇪🇺 EU AI Act: Comments on the Third Code of Practice Draft 🇪🇺