The 42 Weekly Readings, Annotated
Three readings a week for fourteen weeks. The list below is the complete canon, organized by part, with each paper's one-line reading instruction carried over from its week page. Two readings appear in two different weeks each — the 2005 What Goes Around Comes Around and the Anthropic self-service analytics post — so the deduplicated list runs to forty entries. Where a reading is assigned twice, both week numbers are shown: the repetition is the assignment.
Part I — Foundations Under New Workloads (Weeks 1–4)
Part II — New Access Methods & Engines (Weeks 5–8)
Part III — Semantics, Agents, Governance (Weeks 9–12)
Part IV — Frontier & Futures (Weeks 13–14)
Petrov, and the Alternate On-Ramp
Only one book is required: Alex Petrov’s Database Internals (O’Reilly, 2019; databass.dev). It is the course’s implementation-depth backstop — when a lecture asserts something about page layouts, recovery, or consensus and you want to see the actual mechanics, Petrov is where you go. Only chapters 2–4 and 7 are formally assigned (Week 2), but the rest of the book shadows the syllabus:
| Chapters | Topic | Where it lands in DATA 2027 |
|---|---|---|
| ch. 1 | Introduction; storage-engine taxonomy | Background for Week 1 |
| ch. 2–4 | B-tree basics, file formats, implementing B-trees | Assigned, Week 2; foundation for Lab 1 |
| ch. 5 | Transaction processing & recovery (WAL, ARIES-style thinking) | Weeks 2 & 8; Lab 2’s WAL design |
| ch. 6 | B-tree variants (B-link, copy-on-write trees) | Week 2 skim; copy-on-write returns in Week 8 |
| ch. 7 | Log-structured storage (LSM-trees, compaction) | Assigned, Week 2; the spec for Lab 1 |
| ch. 8–10 | Distributed systems primer; failure detection; leader election | Background for Week 4 |
| ch. 11–12 | Replication, consistency, anti-entropy | Weeks 4 & 8 |
| ch. 13–14 | Distributed transactions; consensus | Week 8, alongside Calvin and Spanner |
The alternate on-ramp is Martin Kleppmann’s Designing Data-Intensive Applications (O’Reilly, 2017; dataintensive.net). It covers much of the same territory one level of abstraction up — system properties and trade-offs rather than page formats and cell layouts. If Petrov’s chapter 4 feels like reading a disassembly, start with DDIA’s chapters 3 (storage and retrieval) and 5–9 (replication through consistency), then come back. Students who arrive from an applications background consistently report that DDIA-first, Petrov-second is the gentler path; students who have written a storage engine before can skip DDIA entirely.
How Progress Gets Measured — and Mismeasured
Week 9 and Lab 3 lean on two text-to-SQL benchmarks, and you should know both as artifacts, not just as numbers.
- Spider 2.0 (Lei et al., ICLR 2025) — enterprise-scale workflows over real warehouses: thousand-column schemas, dialect quirks, multi-step tasks that require reading project documentation. This is the benchmark where leaderboard scores fell off the famous cliff relative to its predecessor, and the one Lab 3’s eval harness imitates in miniature.
- BIRD (Li et al., NeurIPS 2023) — large-scale, database-grounded text-to-SQL with “external knowledge” attached to questions: the first major benchmark to admit that natural-language questions underdetermine SQL.
What to know about eval quality
Treat every leaderboard number as a measurement made with an imperfect instrument. Independent audits of BIRD’s gold annotations have found a substantial fraction of reference queries that are arguably wrong — ambiguous questions, gold SQL that doesn’t match the stated intent, schema values that contradict the “external knowledge.” This matters in both directions: systems get penalized for correct answers and rewarded for reproducing annotation mistakes. When a model “beats human performance” on a benchmark whose human-written gold labels contain errors, ask what is actually being measured. Lab 3 makes this concrete: part of your grade is auditing your own eval set and reporting the annotation defects you find. The habit generalizes — Week 2’s benchmark skepticism (RUM corners, hardware assumptions) and Week 9’s annotation skepticism are the same skill applied at different layers.
The Lab Toolchain
Four labs, four primary tools. Install all of them in Week 1; nothing here takes more than a few minutes to set up, and Lab 1’s toolchain check is due before Week 2.
| Tool | Lab | Role | Link |
|---|---|---|---|
| Rust + cargo | Lab 1 (Weeks 2–4) | Implementation language for VLSM, your LSM-tree with vector segments — you’ll own memtables, SSTables, compaction, and a vector access method. | rust-lang.org |
| MinIO | Lab 2 (Weeks 5–8) | S3-compatible object store run locally; the disaggregated storage layer under Mini-Neon’s copy-on-write pages and branches. | min.io |
| DuckDB | Lab 3 (Weeks 9–10) | The analytical engine your text-to-SQL agent targets and your eval harness queries — in-process, zero-ops, full SQL. | duckdb.org |
| Python 3.12+ | Lab 4 (Weeks 10–12) | Host language for the semantic-operator optimizer: logical plans, model-tier physical operators, and cascade calibration. | python.org |
Secondary dependencies (an LLM API key for Lab 3 — Lab 4’s simulator makes no real API calls — and plotting libraries for lab reports) are listed on each lab page: Lab 1, Lab 2, Lab 3, Lab 4.
After Week 14
This course will be stale in places within a year — Part III especially. The fix is a short list of feeds that have stayed reliable while everything else churned.
- Andy Pavlo’s annual “Databases in 20XX” retrospectives — cs.cmu.edu/~pavlo/blog. One post a year, every January, scoring the previous year’s acquisitions, funding, failures, and fashion cycles. The single highest signal-to-noise database publication in existence; read the whole back catalog.
- The three venues — CIDR (January; where systems people publish opinions and architectures — Lakehouse, SageDB, Palimpzest, and self-driving DBMSs all debuted here), SIGMOD (June), and VLDB (August/September). Skim the industrial tracks first: that is where the production retrospectives live.
- CMU Database Group seminar videos — youtube.com/@CMUDatabaseGroup. Weekly talks from the people building the systems in this syllabus, including the “quarantine tech talks” archive; the fastest way to hear how a paper’s claims sound when its authors defend them live.
- The two companion essays on this site — Essay № 01, the argument that the database client has changed, and Essay № 02, the design rationale for this course. Reread both after the final exam; they were written to be disagreed with, and by Week 14 you will be equipped to.