Build and run a pipeline
Data pipelines transform raw data into analysis-ready datasets. In traditional workflows, this means manually writing transformation code, testing it, and carefully deploying to production - a process that can take days.
With Bauplan and AI agents, you can build and run a complete pipeline in minutes. The agent writes the transformation code, executes it on a safe development branch, and helps you iterate until it's ready for production.
main data branch (production)
│
├─── alice.pipeline-dev (development branch)
│ │
│ ├── Iteration 1: Agent builds and run pipeline, see results
│ ├── Iteration 2: Agent fixes bug, run again
│ └── Iteration 3: Add new model, verify
│
└─── (merge when ready) → main updated with new pipeline outputs
Bauplan's data branches make AI-assisted pipeline development safe. Just like Git branches for code, data branches isolate your pipeline outputs from production:
- Development branch: Agent builds and runs pipelines here - test freely without risk
mainbranch: Your production data - only updated when you're ready to merge- Iteration cycle: Generate code → Run pipeline → Review results → Refine → Repeat
- Atomic merges: When satisfied, merge everything to
mainin one operation
This is a very powerful capability of Bauplan: AI agents can write new data to your lakehouse safely and securely through branch isolation.
What you'll build: Code-first pipelines (Python or SQL) that live in Git, execute on specific branches, and automatically handle dependencies, parallelism, and data lineage.
Learn more: