Collections ToolkitScaling Laws for Neural Language Models

PaperAdvancedTechnical

Scaling Laws for Neural Language Models

Kaplan J, McCandlish S, Henighan T, et al.

What to Read Next

Attention Is All You Need

The seminal paper that introduced the 'Transformer' architecture, which forms the basis of all modern LLMs (GPT, BERT, Claude).

A Very Gentle Introduction to Large Language Models without the Hype

Clear, accessible overview of what LLMs actually are and how they work — no jargon.

ToolIntermediate

Hugging Face

The hub for open-source AI models, datasets, and demos.

Scaling Laws for Neural Language Models