tokyotech-llm

university

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

s-mizuki-nlp updated a model 5 days ago

tokyotech-llm/Qwen3-Swallow-32B-CPT-v0.2

s-mizuki-nlp updated a model 5 days ago

tokyotech-llm/Qwen3-Swallow-30B-A3B-CPT-v0.2

s-mizuki-nlp updated a model 5 days ago

tokyotech-llm/Qwen3-Swallow-8B-CPT-v0.2

View all activity

Organization Card

Community About org cards

Swallow LLM

Research and development of large language models conducted by the members mainly in Okazaki Laboratory and Yokota Laboratory at Institute of Science Tokyo (formerly known as Tokyo Institute of Technology)

From Okazaki Laboratory, Institute of Science Tokyo, the following members:
- Naoaki Okazaki
- Sakae Mizuki
- Youmi Ma
- Sangwhan Moon
- Koki Maeda
- Masanari Ohi
- Hinari Shimada
- Taihei Shiotani
- Koshiro Saito
- Tatsuya Ichinose
- Naoya Matsushita
- Sora Miyamoto
- Nguyen Tien Dung
- Yuta Katayama
From YOKOTA Laboratory, Institute of Science Tokyo, the following members:
- Rio Yokota
- Kazuki Fujii
- Taishi Nakamura
- Takumi Okamoto
- Ishida Shigeki
- Masaki Kawamura
- Yukito Tajima
From Artificial Intelligence Research Center, AIST, Japan, the following members:
- Hiroya Takamura

Collections 16

View 16 collections

models 132

tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2

Text Generation • 8B • Updated 5 days ago • 7.28k • 5

tokyotech-llm/Qwen3-Swallow-32B-CPT-v0.2

Text Generation • 33B • Updated 5 days ago • 222 • 1

tokyotech-llm/Qwen3-Swallow-30B-A3B-CPT-v0.2

Text Generation • 31B • Updated 5 days ago • 407

tokyotech-llm/Qwen3-Swallow-8B-CPT-v0.2

Text Generation • 8B • Updated 5 days ago • 554 • 1

tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2-AWQ-INT4

Text Generation • 33B • Updated 5 days ago • 481 • 1

tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2-AWQ-INT4

Text Generation • 31B • Updated 5 days ago • 516

tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2-AWQ-INT4

Text Generation • 8B • Updated 5 days ago • 807

tokyotech-llm/Qwen3-Swallow-32B-RL-v0.2

Text Generation • 33B • Updated 5 days ago • 718 • 1

tokyotech-llm/Qwen3-Swallow-30B-A3B-RL-v0.2

Text Generation • 31B • Updated 5 days ago • 807 • 5

tokyotech-llm/Qwen3-Swallow-8B-RL-v0.2

Text Generation • 8B • Updated 5 days ago • 2.46k • 2

View 132 models

datasets 19

tokyotech-llm/Swallow-Nemotron-Post-Training-Dataset-v1

Viewer • Updated 7 days ago • 8.84M • 510 • 3

tokyotech-llm/lmsys-chat-1m-synth

Updated 9 days ago • 822 • 20

tokyotech-llm/s1-test-time-scaling-synth-public

Viewer • Updated 9 days ago • 59k • 16

tokyotech-llm/swallow-code-v2

Viewer • Updated Nov 8, 2025 • 147M • 174k • 32

tokyotech-llm/swallow-math-v2

Viewer • Updated Nov 6, 2025 • 17.4M • 5.28k • 27

tokyotech-llm/swallow_english_mt_bench

Viewer • Updated Aug 18, 2025 • 80 • 219

tokyotech-llm/MMLU-ProX-English

Updated Aug 18, 2025 • 334

tokyotech-llm/MMLU-Pro-English

Updated Aug 18, 2025 • 520

tokyotech-llm/MMLU-ProX-Japanese

Updated Aug 18, 2025 • 627

tokyotech-llm/JEMHopQA

Viewer • Updated Aug 8, 2025 • 3.78k • 256

View 19 datasets