Learning path: Data Engineering¶
This page explains Learning path: Data Engineering and how it fits into the RecSys suite.
Goals¶
- Understand event schemas and file layouts
- Run backfills safely
- Operate data quality gates
- Define evolution rules for new fields
Read in this order¶
1) reference/schemas-events.md 1) explanation/data-lifecycle.md 1) how-to/run-backfill.md 1) how-to/add-event-field.md 1) reference/output-layout.md
Key practical advice¶
- Treat canonical events as the contract boundary.
- Keep schema evolution backwards compatible.
- Always validate before publishing.
Read next¶
- Start here: Start here
- Event schemas: Event schemas
- Backfill safely: How-to: Backfill pipelines safely
- Output layout: Output layout (local filesystem)