Cross-lingual Transfer of Reward Models
Collection
This is the collection of synthetic preference data and trained reward models in "Cross-lingual Transfer of Reward Models in Multilingual Alignment". • 5 items • Updated
This model is a fine-tuned version of Qwen/Qwen2.5-3B-Instruct on the iqwiki-kor/MP-86k dataset.
| Model | Chat | Chat-Hard | Safety | Reasoning | Avg. |
|---|---|---|---|---|---|
| iqwiki-kor/Qwen2.5-3B-MP-RM | 89.1 | 75.2 | 87.3 | 95.4 | 86.8 |
| RLHFlow/ArmoRM-Llama3-8B-v0.1 | 96.9 | 76.8 | 90.5 | 97.3 | 90.4 |
The following hyperparameters were used during training: