lvkaokao
lvkaokao
AI & ML interests
None yet
Recent Activity
updated a dataset about 9 hours ago
Intel/ld_requests updated a dataset about 13 hours ago
Intel/dynamic_model_information updated a model 1 day ago
lvkaokao/Qwen3-32B-autoround-MXFP4Organizations
Data contamination with GSM8k?
👍 1
1
#4 opened over 2 years ago
by
kno10
Comprehensive overhaul of README.md for better documentation of the model
#4 opened over 2 years ago
by
bconsolvo
Update README.md
#8 opened over 2 years ago
by
bconsolvo
Update README.md
#7 opened over 2 years ago
by
bconsolvo
Update README.md
#6 opened over 2 years ago
by
bconsolvo
Update README.md
#19 opened over 2 years ago
by
bconsolvo
update README.md
#18 opened over 2 years ago
by
bconsolvo
Contaminated?
7
#6 opened over 2 years ago
by
kno10
Minor updates to README
#1 opened over 2 years ago
by
bconsolvo
v3-2 vs v3-1
👍 13
5
#1 opened over 2 years ago
by
bartowski
testing calculations
2
#2 opened over 2 years ago
by
eramax
Prompt Template?
👍 4
13
#1 opened over 2 years ago
by
fakezeta
Other benchmarks as MT-Bench and/or AlpacaEval
2
#14 opened over 2 years ago
by
alvarobartt
About DROP results within the `lm-eval-harness`
4
#13 opened over 2 years ago
by
alvarobartt
Adding `safetensors` variant of this model
👍 1
2
#10 opened over 2 years ago
by
SFconvertbot
Potential ways to reduce inference latency on CPU cluster?
2
#11 opened over 2 years ago
by
TheBacteria
Adding Evaluation Results
#8 opened over 2 years ago
by
leaderboard-pr-bot
remove model
3
#388 opened over 2 years ago
by
lvkaokao
Added demo code according to the prompt format
🤝🤗 1
1
#5 opened over 2 years ago
by
macadeliccc
Is the context length same as Mistral (8k)?
2
#1 opened over 2 years ago
by
krumeto