👋 Open to Work

Joseph [open/acc] Pollack PRO

Tonic

hugging-science

·

https://discord.gg/qdfnvSPcqP

AI & ML interests

🤖Making robots to help people learn things quicker 👩🏻‍🚀🚀

Recent Activity

reacted to RDTvlokip's post with 👍 about 16 hours ago

I finally changed the architecture of my 15M French LLM. It worked. Then I almost fooled myself about how much and catching that was the real win. After proving last time that architecture is a threshold, not a lever, I got stubborn: could I change how the model learns? Four honest attempts, Lion, a sharper AdamW β2, multi-token prediction, LayerScale. Four failures. The bottleneck wasn't the learning rule either. So I changed the shape of the computation instead: loop the same transformer blocks 4×, deeper reasoning, zero added parameters. It beat the baseline on perplexity, the first thing in the whole project to move that number. Then I added my own twist: let each token decide how deep to think, halting on its own entropy. My first evaluation was spectacular. Coherence up 65%. Hallucinated names down 62%. It was noise. Eight prompts, one seed. I re-ran on 50 prompts × 200 tokens and watched the gains shrink to "modest" and on out-of-domain prompts, recurrence actually made things worse. No universal winner. And none of it is new: it's Adaptive Computation Time (2016), the Universal Transformer (2018), and LoopViT (2026), recombined and measured honestly. The real lesson: A number from 8 prompts is a rumor. The eval harness that kills your own best result is worth more than the result it kills. Cite your lineage. Stay preliminary until multiple seeds say otherwise. The three models are live. The write-up is honest about every caveat 👇 🔗 https://huggingface.co/blog/RDTvlokip/teaching-a-15m-french-llm-to-think-deeper

liked a Space 3 days ago

liked a Space 6 days ago

julien-c/caliceo

View all activity

Organizations

New activity in laion/tasrep-a1mfc-gfistaqc-dev1-scaff-maxeps-swes-r2eg-32b__Qwen3-32B 2 months ago

fantastic name for this one !

#1 opened 2 months ago by

New activity in NuTonic/sat-bbox-metadata-sft-v1 2 months ago

[bot] Conversion to Parquet

#1 opened 2 months ago by

parquet-converter

Dataset Viewer issue: JobManagerCrashedError

#2 opened 2 months ago by

New activity in ReubenDataLab/README 2 months ago

This is an Important Organisation

#6 opened 2 months ago by

New activity in NuTonic/sat-image-boundingbox-sft-large 2 months ago

[bot] Conversion to Parquet

#1 opened 2 months ago by

parquet-converter

New activity in le-leadboard/OpenLLMFrenchLeaderboard 2 months ago

Update app.py

#4 opened 2 months ago by

New activity in Tonic/GeneReviews 3 months ago

[bot] Conversion to Parquet

#1 opened 10 months ago by

parquet-converter

New activity in Tonic/fr-on-device 3 months ago

ZeroGPU allocates 6 GPUs instead of 1

#2 opened 4 months ago by

New activity in galsenai/WaxalNLP 3 months ago

Absolutely fantastic work

#2 opened 3 months ago by

New activity in Tonic/hugging-claw 3 months ago

Update setup-hf-config.mjs

#2 opened 4 months ago by

New activity in google/WaxalNLP 3 months ago

wolof text does not match audio

#16 opened 4 months ago by

New activity in loleg/fastapi-apertus 3 months ago

if you make this @gpu.zero decorator on the correct methods you can make this a free zero.gpu demo

#1 opened 3 months ago by

New activity in nvidia/HiLiftAeroML 4 months ago

i just cant wait for this one

#1 opened 4 months ago by

New activity in FireRedTeam/FireRed-Image-Edit-1.1 4 months ago

JS Decode errors

#1 opened 5 months ago by

New activity in unsloth-jobs/LFM2.5-1.2B-Instruct-mobile-actions 4 months ago

can i please try with any lfm ? lfm-vl ? that would be nice.

#2 opened 4 months ago by

New activity in Tonic/MiniF2F 4 months ago

[bot] Conversion to Parquet

#1 opened over 1 year ago by

parquet-converter

New activity in kurakurai/Luth-LFM2-700M-GGUF 4 months ago

adds Q4 KM gguf file for mobile compatibility

#1 opened 4 months ago by

New activity in kurakurai/Luth-LFM2-350M-GGUF 4 months ago

adds Q4_KM for mobile compatible

#1 opened 4 months ago by

New activity in Tonic/fr-on-device 4 months ago

Apply for a GPU community grant: Personal project

#1 opened 4 months ago by

New activity in Tonic/hugging-claw 5 months ago

use case

#1 opened 5 months ago by