Domain Specific Corpora Collection Collection of corpora prepared from specific domains mainly in Galician language. • 5 items • Updated 7 days ago
Text Datasets for Fine-tuning and Instruction tuning Collection Collection of datasets in Galician for fine-tuning, instruction tuning or training purposes. • 22 items • Updated 15 days ago
Text Datasets for Evaluation Collection Collection of datasets in Galician for LLM evaluation. It includes translations from already existing datasets as well as datasets created by us. • 21 items • Updated 15 days ago