Models & datasets from Characterizing Narrative Content in Web-Scale LLM Pretraining Data (NarraDolma & NarraBERT)
Teagan Johnson
teagrjohnson
AI & ML interests
Computational Narratology, Data Curation, LLM Behavior
Recent Activity
submitted a paper about 22 hours ago
Characterizing Narrative Content in Web-scale LLM Pretraining Data authored a paper 1 day ago
Characterizing Narrative Content in Web-scale LLM Pretraining Data updated a collection 4 days ago
Narratives in LLM Pretraining Data