PaperAdvancedTechnical
Scaling Laws for Neural Language Models
Kaplan J, McCandlish S, Henighan T, et al.
What to Read Next
PaperAdvanced
Attention Is All You Need
The seminal paper that introduced the 'Transformer' architecture, which forms the basis of all modern LLMs (GPT, BERT, Claude).
GuideBeginner
A Very Gentle Introduction to Large Language Models without the Hype
Clear, accessible overview of what LLMs actually are and how they work -- no jargon.
ToolIntermediate
Hugging Face
The hub for open-source AI models, datasets, and demos.
Scaling Laws for Neural Language Models
