Deep Dive into LLMs like ChatGPT
by andrej-karpathy
A comprehensive introduction to how large language models work, covering the entire pipeline from internet data collection through tokenization, neural network training, and inference.
by andrej-karpathy
A comprehensive introduction to how large language models work, covering the entire pipeline from internet data collection through tokenization, neural network training, and inference.
by demis-hassabis
Documentary tracing DeepMind's journey from a London startup to defeating world champions at Go and StarCraft, revealing the team's pursuit of artificial general intelligence.