8 tps on nVidia H200
4
#17 opened about 4 hours ago
by
svilen333
๐ llama.cpp Support Now Available!
๐
๐ฅ
4
1
#16 opened 2 days ago
by
nologik
IQuest-Coder-V1
#15 opened 3 days ago
by
AlphaOrionis9527
Using "IQuest-Coder-V1-40B-Loop-Instruct" on Cursor
2
#14 opened 6 days ago
by
Maub69
What vLLM version should I use to deploy this model?
3
#13 opened 6 days ago
by
yyg201708
Benchmaxxed
๐ง
๐
9
2
#12 opened 7 days ago
by
Tom-Neverwinter
Availability of 7B and 14B models mentioned in the paper
๐
2
#11 opened 8 days ago
by
Sopelllka
need official fp8 weights
#7 opened 8 days ago
by
wangruiai2023
LM Studio Support with Q4_K_S please?
โค๏ธ
3
1
#6 opened 9 days ago
by
Cagannn
่ฟๅๆฏ่ฐๅฎถ้ซๆไบ๏ผ๏ผ
๐ค
1
1
#3 opened 9 days ago
by
shangyue2333
smaller + thinking models
๐
๐
14
#2 opened 9 days ago
by
Fizzarolli
can you share benchmarks for all models you released
#1 opened 10 days ago
by
Narutoouz