Skip to content

RecSys

Backfill safely

Initializing search

Home
Start here
Personas
Tutorials
How-to guides
Reference
Explanation
For businesses
Operations
Components
Project
What's new
Tags

RecSys

Home
Start here
Start here
Personas
Personas
Tutorials
Tutorials
How-to guides
How-to guides
Reference
Reference
- Auth and tenancy
- Minimum instrumentation spec
- Integration spec
- API
  API
- Data contracts
  Data contracts
- Config
  Config
- CLI
  CLI
  - recsys-eval
  - recsys-pipelines
- Database
  Database
Explanation
Explanation
For businesses
For businesses
Operations
Operations
- Production readiness checklist
- Performance & capacity
- Baseline benchmarks
- Failure modes & diagnostics
- Runbooks
  Runbooks
Components
Components
- recsys-algo
  recsys-algo
- recsys-eval
  recsys-eval
  - Overview
  - Workflows
    Workflows
    
    Offline gate in CI
    
    Online A/B in production
    
    Decision playbook (ship/hold/rollback)
    
    Default evaluation pack
    
    Interpretation cheat sheet
  - Concepts
  - Data contracts
  - Integration
  - Metrics
  - Interpreting results
  - OPE
  - Interleaving
  - Architecture
  - CI gates
  - Scaling
  - Runbooks
  - Troubleshooting
  - Security & privacy
  - Style guide
  - Roadmap
- recsys-pipelines
  recsys-pipelines
  - Overview
  - Start here
  - Glossary
  - Learning paths
    Learning paths
    
    Engineer
    
    Data engineer
    
    SRE / on-call
    
    Product / stakeholders
  - Tutorials
    Tutorials
    
    Local quickstart
    
    Job-per-container mode
  - How-to
    How-to
    
    Operate pipelines daily
    
    Backfill safely Backfill safely
    Table of contents
    
    Who this is for
    
    Goal
    
    Quick paths
    
    Checklist (safe default)
    
    Read next
    
    Roll back artifacts safely
    
    Run incremental
    
    Run a backfill
    
    Schedule pipelines
    
    Debug failures
    
    Roll back the manifest
    
    Add artifact type
    
    Add event field
  - Explanation
    Explanation
    
    Architecture
    
    Data lifecycle
    
    Windows and backfills
    
    Artifacts and versioning
    
    Validation and guardrails
    
    Documentation approach
  - Reference
    Reference
    
    CLI
    
    Config
    
    Output layout
    
    Exit codes
    
    Event schema
    
    Artifact schema
  - Operations
    Operations
    
    SLOs and freshness
    
    Runbooks
    Runbooks
    
    Pipeline failed
    
    Validation failed
    
    Limit exceeded
    
    Stale artifacts
  - Contributing
    Contributing
    
    Dev workflow
    
    Style guide
    
    Releasing
Project
Project
What's new
What's new
- Archive
  Archive
  - 2026
Tags

Table of contents

Who this is for
Goal
Quick paths
Checklist (safe default)
Read next

Home
Components
recsys-pipelines
How-to

backfill how-to ops recsys-pipelines

How-to: Backfill pipelines safely¶

This guide shows how to how-to: Backfill pipelines safely in a reliable, repeatable way.

Who this is for¶

Data engineers running historical reprocessing
SRE / on-call handling late data, broken windows, or schema changes

Goal¶

Recompute historical windows without breaking “current” artifacts, while staying within guardrails.

Quick paths¶

Run a backfill: How-to: Run a backfill safely
Windows and backfills (concepts): Windows and backfills
Validation and guardrails: Validation and guardrails
Output layout (verify results): Output layout (local filesystem)

Checklist (safe default)¶

Define the backfill window and why you need it
Start small (1–3 days) to validate assumptions.
Run the backfill
Follow the canonical command patterns: How-to: Run a backfill safely
Verify before publishing “current”
Inspect output locations and manifest pointers: Output layout (local filesystem)
Watch guardrails and resource limits
Validation failures are designed to stop bad publishes: Validation and guardrails

Read next¶

Roll back safely: How-to: Roll back artifacts safely
Validation failed runbook: Runbook: Validation failed
Limit exceeded runbook: Runbook: Limit exceeded

2026-02-08

Aatu Harju

Operate pipelines daily

Roll back artifacts safely

Copyright © RecSys - Change cookie settings

Made with Material for MkDocs

Cookie consent

We use cookies to measure documentation usage and improve this site. You can accept or reject analytics cookies.

GitHub
Google Analytics

Manage settings