Syllabus

Week-by-week schedule covering hardware fundamentals through speculative decoding.

W1

Introduction to Computer Architecture

Patterson & Hennessy Ch. 1

W2

Number Systems & Data Representation

P&H Ch. 2

W3

CPU Design & Instruction Sets

P&H Ch. 3-4

W4

Memory Hierarchy & Caching

P&H Ch. 5

W5

GPU Architecture & Parallel Computing

Kirk & Hwu Ch. 1-3

W6

Introduction to Neural Networks

Goodfellow et al. Ch. 6

W7

Transformer Architecture Deep Dive

Vaswani et al. (2017)

W8

LLM Inference & KV Cache

Selected papers

W9

Quantization Techniques (GGUF, GPTQ, AWQ)

Selected papers

W10

Speculative Decoding Theory

Leviathan et al. (2023)

W11

Speculative Decoding Lab

llama.cpp documentation

W12

Benchmarking & Performance Analysis

Course materials

W13

Advanced Topics & Future Directions

Selected papers

W14

Final Project Presentations