Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Penghui Qi's picture

4 26 8

Penghui Qi

QPHutu

dreamerdeo's profile picture

exoplanet's profile picture

21world's profile picture

·

QPHutu

AI & ML interests

None yet

Organizations

QPHutu 's collections 4

STP: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving

Paper • 2502.00212 • Published Jan 31, 2025 • 3
Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12, 2025 • 45

LLM Pretraining

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300
Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25, 2025 • 32
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 92
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27, 2025 • 109
Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

Pipeline Parallelism

Zero Bubble Pipeline Parallelism

Paper • 2401.10241 • Published Nov 30, 2023 • 25
Pipeline Parallelism with Controllable Memory

Paper • 2405.15362 • Published May 24, 2024 • 3
Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published Nov 8, 2024 • 20
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Paper • 2503.01328 • Published Mar 3, 2025 • 16

STP: Self-play LLM Theorem Provers with Iterative Conjecturing and Proving

Paper • 2502.00212 • Published Jan 31, 2025 • 3
Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12, 2025 • 45

DeepResearchGym: A Free, Transparent, and Reproducible Evaluation Sandbox for Deep Research

Paper • 2505.19253 • Published May 25, 2025 • 32
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Paper • 2505.20411 • Published May 26, 2025 • 92
Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27, 2025 • 109
Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

LLM Pretraining

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14, 2025 • 300
Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published May 15, 2025 • 83
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 321

Pipeline Parallelism

Zero Bubble Pipeline Parallelism

Paper • 2401.10241 • Published Nov 30, 2023 • 25
Pipeline Parallelism with Controllable Memory

Paper • 2405.15362 • Published May 24, 2024 • 3
Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published Nov 8, 2024 • 20
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization

Paper • 2503.01328 • Published Mar 3, 2025 • 16

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs