Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
clembench-playpen 's Collections
SFT Final Models Merged
Datasets for DPO
KTO Final Models
OLD SFT Final Models Merged
SFT Final Models
Preference Dataset KTO (Wordle & Wordle_withclue)
Llama-3.2-3B
Llama-3.1-8B
Llama-3.2-1B

Datasets for DPO

updated Sep 19, 2025

Collection of datasets for DPO for development. Data come from clembench v0.9 and v1.0 for all games, except for referencegame (v1.6).

Upvote
-

  • clembench-playpen/DPO_dialogue

    Viewer • Updated Jul 11, 2025 • 10.1k • 12

  • clembench-playpen/DPO_turn

    Viewer • Updated Aug 28, 2025 • 58.9k • 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs