ExamplesΒΆ

Browse our collection of self-contained examples demonstrating various use cases with pre-built pipelines.

ML Pipeline Example

ML Model Training and Deployment Pipeline

End-to-end ML pipeline for predicting taxi trip tips.

Scikit-Learn Pandas Notebooks Streamlit
Entity Matching Example

Entity Matching with OpenAI

Product matching across e-commerce catalogs using LLMs.

OpenAI Streamlit Pandas DuckDB
Data Quality Example

Data Quality and Expectations

Implement data quality checks using expectations.

PyArrow Pandas DuckDB
Iceberg Lakehouse Example

Iceberg Lakehouse Pipeline

Orchestrated WAP pattern for ingesting parquet files to Iceberg tables.

Prefect Pandas Iceberg
Data Dashboard Example

Interactive Data Dashboard

Build an interactive dashboard to visualize taxi pickup locations in NYC.

Streamlit Pandas
Real-time Analytics Example

Near Real-time Analytics

Build near real-time analytics pipeline with WAP pattern and metrics visualization.

Prefect Streamlit DuckDB
Playlist recomendations with MongoDB

Playlist recomendations with MongoDB

Embedding-based recommender system for music playlists.

MongoDB Vector Search Recs