Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
👋
Open to Work
0.6
TFLOPS
Joseph [open/acc] Pollack
PRO
Tonic
516
473
2306
Follow
Seewhatson's profile picture
Antonio49's profile picture
janakaSteph's profile picture
621 followers
·
1,955 following
https://discord.gg/qdfnvSPcqP
josephpollack
Tonic-AI
josephpollack
AI & ML interests
🤖Making robots to help people learn things quicker 👩🏻🚀🚀
Recent Activity
reacted
to
RDTvlokip
's
post
with 👍
about 16 hours ago
I finally changed the architecture of my 15M French LLM. It worked. Then I almost fooled myself about how much and catching that was the real win. After proving last time that architecture is a threshold, not a lever, I got stubborn: could I change how the model learns? Four honest attempts, Lion, a sharper AdamW β2, multi-token prediction, LayerScale. Four failures. The bottleneck wasn't the learning rule either. So I changed the shape of the computation instead: loop the same transformer blocks 4×, deeper reasoning, zero added parameters. It beat the baseline on perplexity, the first thing in the whole project to move that number. Then I added my own twist: let each token decide how deep to think, halting on its own entropy. My first evaluation was spectacular. Coherence up 65%. Hallucinated names down 62%. It was noise. Eight prompts, one seed. I re-ran on 50 prompts × 200 tokens and watched the gains shrink to "modest" and on out-of-domain prompts, recurrence actually made things worse. No universal winner. And none of it is new: it's Adaptive Computation Time (2016), the Universal Transformer (2018), and LoopViT (2026), recombined and measured honestly. The real lesson: A number from 8 prompts is a rumor. The eval harness that kills your own best result is worth more than the result it kills. Cite your lineage. Stay preliminary until multiple seeds say otherwise. The three models are live. The write-up is honest about every caveat 👇 🔗 https://huggingface.co/blog/RDTvlokip/teaching-a-15m-french-llm-to-think-deeper
liked
a Space
3 days ago
krea/Krea-2
liked
a Space
6 days ago
julien-c/caliceo
View all activity
Organizations
Tonic
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
laion/tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps-swes-r2eg-32b__Qwen3-32B
2 months ago
fantastic name for this one !
2
#1 opened 2 months ago by
Tonic
New activity in
NuTonic/sat-bbox-metadata-sft-v1
2 months ago
[bot] Conversion to Parquet
#1 opened 2 months ago by
parquet-converter
Dataset Viewer issue: JobManagerCrashedError
1
#2 opened 2 months ago by
Tonic
New activity in
ReubenDataLab/README
2 months ago
This is an Important Organisation
🤗
1
#6 opened 2 months ago by
Tonic
New activity in
NuTonic/sat-image-boundingbox-sft-large
2 months ago
[bot] Conversion to Parquet
#1 opened 2 months ago by
parquet-converter
New activity in
le-leadboard/OpenLLMFrenchLeaderboard
2 months ago
Update app.py
#4 opened 2 months ago by
Tonic
New activity in
Tonic/GeneReviews
3 months ago
[bot] Conversion to Parquet
#1 opened 10 months ago by
parquet-converter
New activity in
Tonic/fr-on-device
3 months ago
ZeroGPU allocates 6 GPUs instead of 1
2
#2 opened 4 months ago by
Tonic
New activity in
galsenai/WaxalNLP
3 months ago
Absolutely fantastic work
❤️
1
1
#2 opened 3 months ago by
Tonic
New activity in
Tonic/hugging-claw
3 months ago
Update setup-hf-config.mjs
1
#2 opened 4 months ago by
ubix
New activity in
google/WaxalNLP
3 months ago
wolof text does not match audio
9
#16 opened 4 months ago by
Tonic
New activity in
loleg/fastapi-apertus
3 months ago
if you make this @gpu.zero decorator on the correct methods you can make this a free zero.gpu demo
#1 opened 3 months ago by
Tonic
New activity in
nvidia/HiLiftAeroML
4 months ago
i just cant wait for this one
1
#1 opened 4 months ago by
Tonic
New activity in
FireRedTeam/FireRed-Image-Edit-1.1
4 months ago
JS Decode errors
6
#1 opened 5 months ago by
povgeek37
New activity in
unsloth-jobs/LFM2.5-1.2B-Instruct-mobile-actions
4 months ago
can i please try with any lfm ? lfm-vl ? that would be nice.
1
#2 opened 4 months ago by
Tonic
New activity in
Tonic/MiniF2F
4 months ago
[bot] Conversion to Parquet
#1 opened over 1 year ago by
parquet-converter
New activity in
kurakurai/Luth-LFM2-700M-GGUF
4 months ago
adds Q4 KM gguf file for mobile compatibility
1
#1 opened 4 months ago by
Tonic
New activity in
kurakurai/Luth-LFM2-350M-GGUF
4 months ago
adds Q4_KM for mobile compatible
#1 opened 4 months ago by
Tonic
New activity in
Tonic/fr-on-device
4 months ago
Apply for a GPU community grant: Personal project
🔥
3
1
#1 opened 4 months ago by
Tonic
New activity in
Tonic/hugging-claw
5 months ago
use case
1
#1 opened 5 months ago by
rahul7star
Load more