4 284 43

Dazhi Jiang

thuzhizhi

jiangzizi

AI & ML interests

None yet

Recent Activity

liked a dataset about 2 months ago

qi6776/Recflow

upvoted a paper 2 months ago

Data-Efficient RLVR via Off-Policy Influence Guidance

liked a Space 2 months ago

HuggingFaceTB/smol-training-playbook

View all activity

Organizations

None yet

liked a dataset about 2 months ago

qi6776/Recflow

Updated Jul 11, 2025 • 103 • 1

upvoted a paper 2 months ago

Data-Efficient RLVR via Off-Policy Influence Guidance

Paper • 2510.26491 • Published Oct 30, 2025 • 10

liked a Space 2 months ago

The Smol Training Playbook

📚

2.78k

The secrets to building world-class LLMs

liked 2 models 2 months ago

inclusionAI/LLaDA-MoE-7B-A1B-Instruct

7B • Updated Oct 28, 2025 • 1.63k • 61

inclusionAI/LLaDA2.0-mini-preview

Text Generation • 16B • Updated 16 days ago • 3.9k • 86

upvoted a collection 2 months ago

LLaDA 2.0

Collection

7 items • Updated 11 days ago • 39

updated a Space 3 months ago

MorningMind NewsCards 🌱

🐳

Flip through news flashcards to stay informed

published a Space 3 months ago

MorningMind NewsCards 🌱

🐳

Flip through news flashcards to stay informed

liked a Space 3 months ago

DeepSite v3

🐳

16.2k

Generate any application by Vibe Coding

liked a model 4 months ago

SJTU-Deng-Lab/D2F_LLaDA_Instruct_8B_Lora

Text Generation • Updated Aug 14, 2025 • 5

liked a Space 4 months ago

Qwen Image Edit

✒

800

Edit and enhance images based on descriptive instructions

New activity in GSAI-ML/LLaDA-1.5 4 months ago

期待demo

#1 opened 7 months ago by

zzzgry

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated Sep 5, 2025 • 49.9k • • 810

liked a model 5 months ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26, 2025 • 4.95k • 1k

authored a paper 5 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 195

liked a model 5 months ago

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 33.5k • • 700

upvoted a paper 5 months ago

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Paper • 2508.06471 • Published Aug 8, 2025 • 195

liked 2 models 5 months ago

Qwen/Qwen-Image

Text-to-Image • Updated Aug 18, 2025 • 232k • • 2.32k

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 20.8k • • 1.39k

liked a model 6 months ago

Qwen/Qwen3-Coder-480B-A35B-Instruct

Text Generation • 480B • Updated Aug 21, 2025 • 19.4k • • 1.26k

Dazhi Jiang

AI & ML interests

Recent Activity

Organizations

thuzhizhi's activity

The Smol Training Playbook

MorningMind NewsCards 🌱

MorningMind NewsCards 🌱

DeepSite v3

Qwen Image Edit

期待demo