Nexesenex
/

Llama_3.1_8b_Smarteaz_0.2_R1

Text Generation

Eval Results (legacy)

text-generation-inference

Model card Files Files and versions

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using huihui-ai/DeepSeek-R1-Distill-Llama-8B-abliterated as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

merge_method: model_stock
models:
  - model: meditsolutions/Llama-3.1-MedIT-SUN-8B
    parameters:
      weight: 1.0
  - model: allenai/Llama-3.1-Tulu-3.1-8B
    parameters:
      weight: 1.0
base_model: huihui-ai/DeepSeek-R1-Distill-Llama-8B-abliterated
dtype: bfloat16
normalize: true

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	28.11
IFEval (0-Shot)	63.46
BBH (3-Shot)	30.70
MATH Lvl 5 (4-Shot)	26.06
GPQA (0-shot)	6.71
MuSR (0-shot)	12.32
MMLU-PRO (5-shot)	29.39

Downloads last month: 7

Safetensors

Model size

8B params

Tensor type

BF16

·

Model tree for Nexesenex/Llama_3.1_8b_Smarteaz_0.2_R1

allenai/Llama-3.1-Tulu-3.1-8B

huihui-ai/DeepSeek-R1-Distill-Llama-8B-abliterated

meditsolutions/Llama-3.1-MedIT-SUN-8B

Merge model

this model

Merges

Quantizations

Paper for Nexesenex/Llama_3.1_8b_Smarteaz_0.2_R1

Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28, 2024 • 13

Evaluation results

strict accuracy on IFEval (0-Shot)
Open LLM Leaderboard

63.460
normalized accuracy on BBH (3-Shot)
Open LLM Leaderboard

30.700
exact match on MATH Lvl 5 (4-Shot)
Open LLM Leaderboard

26.060
acc_norm on GPQA (0-shot)
Open LLM Leaderboard

6.710
acc_norm on MuSR (0-shot)
Open LLM Leaderboard

12.320
accuracy on MMLU-PRO (5-shot)
test set Open LLM Leaderboard

29.390