Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop 🔄
1
3
38
Michał Wiliński
MWilinski
Follow
adamm-hf's profile picture
arjunpushpik's profile picture
alxtrtw's profile picture
17 followers
·
26 following
https://michal-wilinski.com
inverse_hessian
JanekDev
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated
a collection
1 day ago
irl-alignment-5.1-expert
updated
a dataset
1 day ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
published
a dataset
1 day ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
View all activity
Organizations
MWilinski
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a collection
1 day ago
irl-alignment-5.1-expert
Collection
4 items
•
Updated
1 day ago
updated
a dataset
1 day ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer
•
Updated
1 day ago
•
1k
•
4
published
a dataset
1 day ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer
•
Updated
1 day ago
•
1k
•
4
updated
a collection
1 day ago
irl-alignment-5.1-expert
Collection
4 items
•
Updated
1 day ago
updated
a dataset
1 day ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer
•
Updated
1 day ago
•
1k
•
1
published
a dataset
1 day ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer
•
Updated
1 day ago
•
1k
•
1
updated
a dataset
2 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
2 days ago
•
1k
•
2
published
a dataset
2 days ago
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
2 days ago
•
1k
•
2
updated
a dataset
2 days ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
2 days ago
•
1k
•
3
published
a dataset
2 days ago
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer
•
Updated
2 days ago
•
1k
•
3
liked
a model
2 months ago
PleIAs/Baguettotron
Text Generation
•
0.3B
•
Updated
Dec 14, 2025
•
2.47k
•
218
updated
4 datasets
3 months ago
MWilinski/hh-rlhf-harmless-base
Viewer
•
Updated
Nov 5, 2025
•
44.8k
•
26
MWilinski/hh-rlhf-helpful-base
Viewer
•
Updated
Nov 5, 2025
•
46.2k
•
33
MWilinski/hh-rlhf-helpful-online
Viewer
•
Updated
Nov 5, 2025
•
23.1k
•
22
MWilinski/hh-rlhf-helpful-rejection-sampled
Viewer
•
Updated
Nov 5, 2025
•
55.2k
•
16
updated
a collection
3 months ago
hh-rlhf-TRL
Collection
4 items
•
Updated
Nov 5, 2025
Load more