Database System Internals (PostgreSQL)

0 %

Course content

Resources
Chapter 1: The Anatomy of PostgreSQL: Architecture and Process Model
- 1.0: Overview & Readings
- 1.1: The Client/Server Model and Postmaster 10 xp
  - Quiz
- 1.2: Shared Memory Architecture 10 xp
  - Quiz
- 1.3: Core Background Workers
- 1.4: Utility Processes
- 1.5: The Backend Process
Chapter 2: The Journey of a Query: Lexing, Parsing, and the Traffic Cop
Chapter 3: The PostgreSQL Rule System and Query Rewriting
- 3.0 Overview & Readings
- 3.1: Rules vs Triggers
- 3.2: The Query Tree Structure
- 3.3: Views Under the Hood
- 3.4: The Rewriter in Action
Chapter 4: The Query Planner Part I: Statistics and Cost Estimation
- 4.0: Overview & Readings
- 4.1: The System Catalogs
- 4.2: The Role of ANALYZE
- 4.3: Understanding pg_statistic
- 4.4: Cost Constants
- 4.5: Calculating Cost
Chapter 5: The Query Planner Part II: Path Generation and GEQO
Chapter 6: The Executor: Processing the Plan Tree
- 6.0: Overview & Readings
- 6.1: The Volcano Execution Model
- 6.2: Executor Phases
- 6.3: Scan Nodes
- 6.4: Join and Materialize Nodes
- 6.5: Aggregate and Sort Nodes
Chapter 7: Advanced Indexing Under the Hood
- 7.0: Overview & Readings
- 7.1: B-Tree Internals
- 7.2: GiST Indexes
- 7.3: GIN Indexes
- 7.4: BRIN Indexes
- 7.5: Operator Classes
Chapter 8: Multiversion Concurrency Control (MVCC) and Vacuuming
Chapter 9: Memory Management and Caching Strategies
- 9.0: Overview & Readings
- 9.1: The Dual Caching Model
- 9.2: The Buffer Manager
- 9.3: Eviction Policies
- 9.4: Local Memory (work_mem)
- 9.5: Maintenance Memory
- 9.6: Assignment Reminder
Chapter 10: The Write-Ahead Log (WAL) and Crash Recovery
- 10.0: Overview & Readings
- 10.1: The Purpose of WAL
- 10.2: WAL Physical Structure
- 10.3: Physiological Logging
- 10.4: Checkpoints
- 10.5: Crash Recovery Mechanics
Chapter 11: Replication: Physical and Logical
Chapter 12: Distributed PostgreSQL and Sharding
Chapter 13: Extending the Engine
- 13.0: Overview & Readings
- 13.1: Extensibility Architecture
- 13.2: Custom Data Types
- 13.3: Background Worker Processes
- 13.4: Extension Hooks
- 13.5: Packaging Extensions
Project Based Assignments

Project 3: Architecting for Scale with Partition Pruning and FDW

You are the lead database architect for an IoT company that generates millions of sensor readings daily. Querying this massive, monolithic table has become unacceptably slow.

Your task is to implement declarative table partitioning by range (e.g., partitioning by month) to physically divide the data while maintaining a single logical table interface. Once partitioned, you must simulate a "hot/cold" storage tier architecture. You will spin up a second PostgreSQL instance, configure postgres_fdw (Foreign Data Wrapper), and move all partitions older than one year to this remote "cold" server.

You must write a series of time-series queries spanning both recent and historical data. Using EXPLAIN, you must prove two things: first, that the query planner is successfully executing "Partition Pruning" (skipping irrelevant local partitions), and second, that it is successfully pushing down the remote queries to the FDW rather than pulling the entire remote dataset into local memory to filter.

Rubric:

Criteria	Excellent	Proficient	Needs Improvement
Partitioning Implementation	Flawlessly implements range partitioning; data routes correctly; handles edge cases and default partitions.	Implements partitioning but misses a default partition or uses an inefficient partition key.	Fails to successfully partition the data or breaks the logical schema structure.
FDW Configuration	Successfully establishes the foreign server, user mappings, and remote tables with secure and correct configurations.	Establishes the FDW but struggles with user mappings or remote execution permissions.	Fails to connect the two instances or successfully move the historical data.
Partition Pruning Proof	Provides conclusive EXPLAIN output proving the planner skips irrelevant local partitions for bounded queries.	Shows an EXPLAIN plan but the planner is still scanning more partitions than strictly necessary.	Planner performs full sequential scans across all local partitions.
Query Pushdown Verification	Proves via EXPLAIN VERBOSE that WHERE clauses and aggregates are pushed down to the remote server.	Proves basic FDW connection but pulls raw data locally to perform filtering or aggregation.	Does not understand or demonstrate query pushdown mechanics.

Database System Internals (PostgreSQL)

Completed

Project 3: Architecting for Scale with Partition Pruning and FDW

Project 3: Architecting for Scale with Partition Pruning and FDW

Rubric: