From 40c302040d8ee36aba925b4cd52979f77567605d Mon Sep 17 00:00:00 2001 From: Shayan Rais Date: Thu, 7 May 2026 12:35:03 +0500 Subject: [PATCH] reformat slide 10 to match slide 17 pattern (heading + separator + content) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Slide 10 ("How an LLM generates text") restructured to use the deck's canonical heading-with-separator pattern modeled after slide 17: -

uses default styling (no inline overrides) → gets the default border-bottom that acts as the separator - Outer flex-centering wrapper dropped, matching slide 17's flat layout Heading text changed from "One token at a time" → "How an LLM generates text (autoregressive)" — promoted out of the SVG asset (which has been trimmed to diagram-only in the paired commit). Caption simplified to a single bold line: "Each predicted token is appended to the input, then fed back into the LLM." (was previously the SVG's subtitle). The secondary streaming/cost line is removed for focus. Figure max-width increased 860px → 1100px (~28% larger) for projector legibility — combined with the SVG's diagram-shifted-up restructuring, the on-screen diagram is now roughly 2x the previous size. Co-Authored-By: Claude --- .../claude-code-best-practice/index.html | 30 +++++++------------ 1 file changed, 11 insertions(+), 19 deletions(-) diff --git a/presentation/claude-code-best-practice/index.html b/presentation/claude-code-best-practice/index.html index 50d087b..521c0b2 100644 --- a/presentation/claude-code-best-practice/index.html +++ b/presentation/claude-code-best-practice/index.html @@ -480,25 +480,17 @@
-
- - -

One token at a time

- - -
- Animated diagram showing autoregressive generation: prompt feeds into LLM, which predicts one token, feeds it back, and repeats until the full answer is produced. -
- The model produces one token per inference, feeding each result back as new input.
- This is why streaming feels gradual — and why longer outputs cost more in both latency and API spend. -
-
- -
+

How an LLM generates text (autoregressive)

+
+ Animated diagram showing autoregressive generation: prompt feeds into LLM, which predicts one token, feeds it back, and repeats until the full answer is produced. +
+ Each predicted token is appended to the input, then fed back into the LLM. +
+