ByteDance-Seed/Stable-DiffCoder-8B-Base
Text Generation
•
8B
•
Updated
•
90
•
11
None defined yet.
SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation