Model Card for FIPO-IPL-IPO-Tulu2-70B

Our repository: https://github.com/LuJunru/FIPO_Project.

Our paper: https://arxiv.org/abs/2402.11811.

Input Format

The model is trained to use the following format (note the newlines):

<|user|>
Your message here!
<|assistant|>

For best results, format all inputs in this manner. Make sure to include a newline after <|assistant|>, this can affect generation quality quite a bit.

Downloads last month: 7

Safetensors

Model size

69B params

Tensor type

F16

Model tree for Junrulu/FIPO-IPL-IPO-Tulu2-70B

Base model

meta-llama/Llama-2-70b-hf

Finetuned

allenai/tulu-2-dpo-70b

Finetuned

(1)

this model

Quantizations

2 models

Dataset used to train Junrulu/FIPO-IPL-IPO-Tulu2-70B

Paper for Junrulu/FIPO-IPL-IPO-Tulu2-70B

FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema

Paper • 2402.11811 • Published Feb 19, 2024

Junrulu
/

FIPO-IPL-IPO-Tulu2-70B