Get the eBook free when you register your print book at Manning. "An exceptional deep dive into the next frontier of AI.” —Aman Chadha, Google This book is a practical guide to understanding how modern reasoning-oriented LLMs work by building their core methods step by step. The book tells a clear engineering story: start with a conventional pre-trained LLM, learn how text generation works, build reliable evaluation tools, improve reasoning through inference-time methods, then move into training-based approaches such as reinforcement learning and distillation. The progression is deliberate. Early chapters establish the baseline model and explain text generation, KV caching, and evaluation with math verifiers. The middle chapters show how reasoning can be improved without changing model weights, using chain-of-thought prompting, sampling, self-consistency, response scoring, and self-refinement. Later chapters move to changing the model itself through reinforcement learning with verifiable rewards, GRPO improvements, format rewards, and finally distillation from stronger reasoning models into smaller ones. The book is especially useful because it implements the core methods from scratch rather than treating them as black-box library calls. Readers see how self-consistency, self-refinement, Best-of-N, and training-based methods actually work, including their cost and latency trade-offs. It also discusses common failure modes, including cases where refinement can make answers worse. Difficult concepts such as softmax, temperature, and top-p sampling are clarified with code-linked explanations and diagrams, and visual workflows make pipelines and scoring methods easier to follow. Reading the book feels like following a guided technical build rather than a loose survey of AI topics. Each concept is introduced because the project now needs it. Diagrams, roadmaps, code listings, exercises, and repeated workflow summaries help readers stay oriented through advanced material. This structure reflects Sebastian Raschka’s professional strength: explaining complex machine learning topics by making every detail concrete and showing exactly where each section fits in the larger story. He does not treat mechanisms like evaluation, log-probabilities, KL regularization, or distillation as isolated abstractions; he connects them to the goal of making reasoning models understandable and implementable. Physically and organizationally, the book has eight chapters and seven substantial appendixes. That keeps the main narrative focused while moving supporting material like references, exercise solutions, model source code, larger models, batching, evaluation alternatives, and chat interfaces into ordered appendixes. The result is a logically flowing book that remains hands-on, navigable, and technically deep without constantly interrupting the central build. What‘s inside From-scratch implementations of core LLM reasoning improvements Verifier-based evaluation methods RL with automatic verifiers for mathematics tasks About the reader For readers who know Python and have some knowledge of machine learning. About the author Sebastian Raschka is an LLM Research Engineer with over a decade of experience. He is the author of the bestselling book Build a Large Language Model (From Scratch). Table of Contents 1 Understanding reasoning models 2 Generating text with a pretrained LLM 3 Evaluating reasoning models 4 Improving reasoning with inference-time scaling 5 Inference-time scaling via self-refinement 6 Training reasoning models with reinforcement learning 7 Improving GRPO for reinforcement learning 8 Distilling reasoning models for efficient reasoning A References and further reading B Exercise solutions C Qwen3 LLM source code D Using larger LLMs E Batching and throughput-oriented execution F Common approaches to model evaluation G Building a chat interface
Die bei uns gelisteten Preise basieren auf Angaben der gelisteten Händler zum Zeitpunkt unserer Datenabfrage. Diese erfolgt einmal täglich. Von diesem Zeitpunkt bis jetzt können sich die Preise bei den einzelnen Händlern jedoch geändert haben. Bitte prüfen sie auf der Zielseite die endgültigen Preise.
Die Sortierung auf unserer Seite erfolgt nach dem besten Preis oder nach bester Relevanz für Suchbegriffe (je nach Auswahl).
Für manche Artikel bekommen wir beim Kauf über die verlinkte Seite eine Provision gezahlt. Ob es eine Provision gibt und wie hoch diese ausfällt, hat keinen Einfluß auf die Suchergebnisse oder deren Sortierung.
Unser Preisvergleich listet nicht alle Onlineshops. Möglicherweise gibt es auf anderen bei uns nicht gelisteten Shops günstigere Preise oder eine andere Auswahl an Angeboten.
Versandkosten sind in den angezeigten Preisen und der Sortierung nicht inkludiert.
* - Angaben ohne Gewähr. Preise und Versandkosten können sich zwischenzeitlich geändert haben. Bitte prüfen sie vor dem Kauf auf der jeweiligen Seite, ob die Preise sowie Versandkosten noch aktuell sind.