Eni Grand's picture

Eni Grand

Enigrand

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Self-Distillation Enables Continual Learning

liked a model 1 day ago

Qwen/Qwen3-ForcedAligner-0.6B

liked a model 1 day ago

Qwen/Qwen3-ASR-1.7B

View all activity

Organizations

upvoted a paper about 13 hours ago

Self-Distillation Enables Continual Learning

Paper • 2601.19897 • Published 3 days ago • 18

upvoted a collection 1 day ago

Qwen3-ASR

4 items • Updated 1 day ago • 30

upvoted a collection 3 days ago

Trinity-Large

5 items • Updated 2 days ago • 32

upvoted a paper 3 days ago

Qwen3-TTS Technical Report

Paper • 2601.15621 • Published 9 days ago • 54

upvoted a collection 4 days ago

Jan-v3

2 items • Updated 4 days ago • 3

upvoted a paper 8 days ago

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

Paper • 2510.13999 • Published Oct 15, 2025 • 10

upvoted a collection 8 days ago

Qwen3-TTS

7 items • Updated 8 days ago • 255

upvoted a collection 13 days ago

Qwen3-VL

37 items • Updated about 1 month ago • 613

upvoted a paper 28 days ago

Evaluating Parameter Efficient Methods for RLVR

Paper • 2512.23165 • Published Dec 29, 2025 • 26

upvoted a paper 29 days ago

Scaling Open-Ended Reasoning to Predict the Future

Paper • 2512.25070 • Published about 1 month ago • 16

upvoted a collection 30 days ago

IQuest-Coder

13 items • Updated about 1 month ago • 90

upvoted a paper about 1 month ago

TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times

Paper • 2512.16093 • Published Dec 18, 2025 • 95

upvoted a collection about 1 month ago

Openhands Trajectories

Dataset of 67,074 OpenHands trajectories collected with Qwen3-Coder-480B-A35B-Instruct and two RFT checkpoints trained on the data • 3 items • Updated Dec 23, 2025 • 6

upvoted 4 papers about 1 month ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 85

VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse

Paper • 2512.14531 • Published Dec 16, 2025 • 14

Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics

Paper • 2512.12602 • Published Dec 14, 2025 • 44

QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management

Paper • 2512.12967 • Published Dec 15, 2025 • 108

upvoted a collection about 1 month ago

Molmo2

Artifacts for the Molmo2 release • 6 items • Updated Dec 23, 2025 • 31

upvoted 2 collections about 2 months ago

Bolmo

Artifacts for the Bolmo release: https://allenai.org/papers/bolmo. • 4 items • Updated Dec 23, 2025 • 12

Olmo 3.1

The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated Dec 23, 2025 • 47