Examples

Serverless Data Product
Serverless data product with built-in quality checks using Lambda and Bauplan.

RAG system with Pinecone
Build a RAG system with Pinecone and OpenAI over StackOverflow data.

Medallion Architecture + WAP Pattern
End-to-end data engineering repo using Mage & the medallion architecture.

From unstructured to structured data with LLMs
Convert PDFs into structured, analyzable tables using LLMs.

Playlist recommendations with MongoDB
Embedding-based recommender system for music playlists.

Iceberg Lakehouse Pipeline
Orchestrated WAP pattern for ingesting parquet files to Iceberg tables.

PDF analysis with bauplan and OpenAI
Analyze PDFs using Bauplan for data preparation and OpenAI's GPT for text analysis

ML Model Training and Deployment Pipeline
End-to-end ML pipeline for predicting taxi trip tips.

Entity Matching with OpenAI
Product matching across e-commerce catalogs using LLMs.

Near Real-time Analytics
Build near real-time analytics pipeline with WAP pattern and metrics visualization.

Interactive Data Dashboard
Build an interactive dashboard to visualize taxi pickup locations in NYC.