Commit Graph

5 Commits

Author SHA1 Message Date
Shayan Rais 13f7ca9e48 restructure llm-basic.svg for projector legibility
Title and subtitle removed from the SVG (now promoted to the slide-level
heading and caption). Iteration counter relocated from the top of the SVG
(y=85) to the bottom (y=480), just above the feedback caption, with font
size increased from 13 to 20 for projector readability.

The whole diagram (panels, LLM box, arrows, animated circles, feedback
path) shifted upward by 80px to reclaim the space freed by the removed
title and subtitle — the diagram now fills more of its viewBox rather
than leaving empty space at the top.

ViewBox extended 500 → 530 at the bottom to accommodate the relocated
iteration counter and pushed-down feedback caption (y=488 → y=510).
Background rect height bumped to 530 to match (otherwise the new bottom
strip would render transparent).

All 7 <animate> blocks on iteration elements preserved verbatim — only
the parent <text> y and font-size attributes changed.

Co-Authored-By: Claude <noreply@anthropic.com>
2026-05-07 12:34:51 +05:00
Shayan Rais 9a2657c431 add llm-animation-tokenids.svg — advanced tokenization view with integer IDs
Animated SVG showing what the LLM actually receives: integer token IDs (one
layer deeper than llm-advanced.svg). Each of the 32 input tokens displays the
ID prominently with the token text in small italic underneath (e.g., 28133
"Does", 17554 " Chat", 162016 "GPT", 97481 " Claude", 29683 " Anth", 71571
"ropic"). Same 7-iteration autoregressive loop; generated tokens also shown
as IDs. Vocab size labeled V ≈ 200,000. Title formula: f: ℤᵏ → ℝⱽ;
next_id = argmax(f(ids)). ViewBox 1360×600 (wider than the other LLM SVGs).

Co-Authored-By: Claude <noreply@anthropic.com>
2026-05-07 11:58:57 +05:00
Shayan Rais a05c791c41 add llm-advanced.svg — combined BPE tokenization + autoregressive diagram
Animated SVG showing the same BPE-tokenized prompt from tokens.jpg (32 colored
subword tiles, e.g., "Anthropic" → "Anth"+"ropic", "Perplexity" →
"Per"+"plex"+"ity") feeding into the LLM and generating "Yes, they all use
BPE." token-by-token across 7 iterations. Combines tokenization and
autoregressive generation into one view.

Co-Authored-By: Claude <noreply@anthropic.com>
2026-05-07 11:45:16 +05:00
Shayan Rais faa82716b0 add tokens.jpg — OpenAI tokenizer screenshot for tokenization slide
Screenshot of platform.openai.com/tokenizer showing the sentence "Does ChatGPT,
Claude, Anthropic, Llama, Mistral, Gemini, and Perplexity all use Byte-Pair
Encoding (BPE)?" tokenizing to 32 tokens / 105 characters. Visible tabs:
GPT-5.x & O1/3, GPT-4 & GPT-3.5 (legacy), GPT-3 (legacy) — illustrates that
different model generations use different tokenizers.

Co-Authored-By: Claude <noreply@anthropic.com>
2026-05-07 11:45:09 +05:00
Shayan Rais e53739367a add llm-basic.svg — animated autoregressive generation diagram
Three-panel SVG (input context, LLM black box, predicted next token) with
7-iteration loop generating "The capital of Japan is Tokyo." from the prompt
"What is the capital of Japan?". Includes purple feedback loop showing each
predicted token appended back into the input.

Co-Authored-By: Claude <noreply@anthropic.com>
2026-05-07 11:45:00 +05:00