$ whoami
> Jacopo Sardellini — Data Engineer & Python developer
$ uptime
> been running since 19xx
> load average: coffee dependent
$ cat /etc/motto
> "It Ain’t What You Don’t Know That Gets You Into Trouble. It’s What You Know for Sure That Just Ain’t So."{
"data_engineering": {
"orchestration": ["Apache Airflow"],
"transformation": ["Dataform", "dbt"],
"processing": ["PySpark", "Pandas", "Polars"],
"messaging": ["Apache Kafka", "Pub/Sub"],
"data_quality": ["Soda", "ODCS"],
"cloud": ["GCP", "AWS"]
},
"software_engineering": {
"languages": ["Python", "C", "SQL"],
"databases": ["PostgreSQL", "SQLite", "MongoDB", "Firestore"],
"infra": ["Docker", "Terraform"],
"devops": ["Git", "Github Actions", "Linux"]
},
"currently_learning": ["Apache Iceberg", "Flink"]
}| Progetto | Stack | Descrizione |
|---|---|---|
| 🔧 iceberg-lakehouse | Apache Iceberg · Trino · Nessie · Docker · MinIO | Apache Iceberg Lakehouse using MinIO, Trino and Nessie. |
| 📊 zillow-pipeline | Python · Apache Airflow · AWS | End-to-end ETL pipeline to interact with Zillow Real Estate data. |
| 🤖 design-patterns | Python | A curated collection of design patterns in Python with clear examples and explanations. |
PRINCIPLES TOWARD ZEN
inspired by The Zen of Python, but slightly more caffeinated.
[1] The best code is the one you didn't write yourself.
[2] Every pipeline has an owner. If you don't know who, it's you.
[3] Pipelines need to be tested as well.
[4] The best backfill is the one you never have to run.
[5] If it isn't monitored, it isn't running — it's just hoping.
[6] Production is not the place to discover your assumptions.
$ shutdown -h now
> status: still debugging life
> warnings: ignored
> Connection closed by remote host.