Courses 1, 3 & 4 of the Databricks Specialization on Coursera
Master Databricks from the ground up: start with Lakehouse Fundamentals (architecture, Spark, Delta Lake), then advance to MLOps and GenAI engineering through hands-on labs on Databricks Free Edition, with deep understanding gained by building equivalent systems from scratch using the Sovereign AI Stack (Rust).
| Course | Title | Duration | Focus |
|---|---|---|---|
| 1 | Lakehouse Fundamentals | 3 weeks | Lakehouse Architecture, Spark, Delta Lake |
| 3 | MLOps Engineering | 3 weeks | MLflow, Feature Store, Model Serving |
| 4 | GenAI Engineering | 3 weeks | Vector Search, RAG, Fine-Tuning |
- Basic SQL and Python familiarity
- Databricks Free Edition account
- For Courses 3 & 4: Basic Rust familiarity (ownership, cargo)
# Clone repository
git clone https://github.com/paiml/DB-mlops-genai.git
cd DB-mlops-genai
# Install Python dependencies
pip install uv && uv sync --all-extras
# Install Rust toolchain (for Sovereign AI Stack)
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
# Install Sovereign AI Stack tools
cargo install batuta realizar pmat# Setup environment
make setup
# Run quality checks
make check
# Run demos
make demos
# Run specific course tests
make test-course1
make test-course3
make test-course4Dual-layer pedagogy:
- Databricks layer (~80%) — Use production MLOps and GenAI tools effectively
- Sovereign AI Stack layer (~20%) — Build equivalent systems in Rust to understand internals
"Understand by building" creates deeper retention than tutorials alone.
demos/
course1/ # Lakehouse Fundamentals demos (weeks 1-3)
course3/ # MLOps demos (weeks 1-3)
course4/ # GenAI demos (weeks 1-3)
labs/
course1/ # Lakehouse hands-on labs
course3/ # MLOps hands-on labs
course4/ # GenAI hands-on labs
examples/
databricks/ # Databricks notebook examples
sovereign/ # Sovereign AI Stack (Rust) examples
docs/
outline.md # Course outline
| Week | Topic | Key Concepts |
|---|---|---|
| 1 | Lakehouse Architecture & Platform | Data architecture evolution, Unity Catalog, compute |
| 2 | Spark Fundamentals | Notebooks, DataFrames, select/filter/groupBy/join |
| 3 | Delta Lake & Workflows | ACID transactions, MERGE, time travel, Jobs |
Note: Course 1 is Databricks-only — no Sovereign AI Stack component.
| Week | Topic | Databricks | Sovereign AI |
|---|---|---|---|
| 1 | Tracking & Features | MLflow, Feature Store | trueno, alimentar |
| 2 | Training & Serving | AutoML, Model Serving | aprender, realizar |
| 3 | Production | Workflows, Monitoring | batuta, pmat |
| Week | Topic | Databricks | Sovereign AI |
|---|---|---|---|
| 1 | LLM & Prompts | Foundation Models | realizar |
| 2 | Vectors & RAG | Vector Search, RAG | trueno-rag |
| 3 | Fine-tune & Production | Fine-tuning, Serving | entrenar, batuta |
| Skill | Databricks Certification |
|---|---|
| MLflow Tracking & Registry | ML Associate |
| Feature Engineering | ML Associate |
| Model Serving | ML Associate |
| Foundation Model APIs | GenAI Engineer |
| Vector Search & RAG | GenAI Engineer |
| Fine-Tuning | GenAI Engineer |
make check # Lint + test
make format # Auto-format code
make test-course1 # Course 1 tests only (syntax)
make test-course3 # Course 3 tests only
make test-course4 # Course 4 tests only- Fork the repository
- Create a feature branch (
git checkout -b feature/my-feature) - Run quality checks (
make check) - Commit changes (
git commit -m 'Add feature') - Push to branch (
git push origin feature/my-feature) - Open a Pull Request
See CONTRIBUTING.md for detailed guidelines.
MIT