Colin Raffel
157 Arbeiten29.243 Zitationen
Relevante Arbeiten
Meistzitierte Publikationen im Bereich Gesundheit & MedTech
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
2022 · 548 Zit. · arXiv (Cornell University)
Improving Few-Shot Generalization by Exploring and Exploiting Auxiliary Data
2023 · 1 Zit. · arXiv (Cornell University)
Efficiently Estimating Data Efficiency for Language Model Fine-tuning
2025 · 0 Zit. · arXiv (Cornell University)
The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text
2025 · 0 Zit. · ArXiv.org
Efficiently Estimating Data Efficiency for Language Model Fine-tuning
2025 · 0 Zit. · ArXiv.org
Position: The Most Expensive Part of an LLM should be its Training Data
2025 · 0 Zit. · ArXiv.org