Skip to content
View shsiddhant's full-sized avatar

Block or report shsiddhant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shsiddhant/README.md

Hi, I'm Siddhant

I build end-to-end data pipelines and analytical systems using Python and SQL, with a focus on data modeling and orchestration.

I particularly enjoy analytics pipelines and tools around things I find worth tracking, viz. cricket and music, mainly.


Projects

Cricket Warehouse

End-to-end ELT data pipeline for ball-by-ball cricket match data using Python, PostgreSQL, dbt, and Airflow.

Includes ingestion, incremental loading, layered data modeling, and fully orchestrated transformations via Airflow (Astronomer Cosmos).

🔗 https://github.com/shsiddhant/cricket-warehouse


memory.fm

Python library, CLI tool, and dashboard for exploring music listening history from Last.fm and Spotify.

Focuses on temporal patterns such as attachment, repetition, and listening streaks.

🔗 https://github.com/shsiddhant/memory.fm


Women’s Cricket World Cup Prediction

Machine learning project predicting match outcomes using features engineered from historical match data.

🔗 https://github.com/shsiddhant/womens-wc


memory.journal

Lightweight offline journaling application with password protection and Markdown support.

🔗 https://github.com/shsiddhant/memory.journal

Stack

Python Postgres dbt Apache Airflow Docker

Python • SQL • PostgreSQL • dbt • Airflow • Docker • Pandas • NumPy • Git


Interests

  • Data engineering
  • Data warehouses and analytics pipelines
  • Sports analytics
  • Personal data exploration tools
  • Python-based CLI tools

Currently Exploring

  • Improving pipeline design and orchestration patterns.
  • Performance optimization for data processing workflows.
  • T20 Cricket analytics

Connect with me

GitHub Email LinkedIn

Pinned Loading

  1. cricket-warehouse cricket-warehouse Public

    An ELT pipeline to build data warehouse for ball-by-ball cricket match data, designed for analytics and modeling.

    Python 1

  2. memory.fm memory.fm Public

    A Python library, CLI tool, and web-based dashboard for exploring music listening history from Last.fm and Spotify.

    Python 1

  3. womens-wc womens-wc Public

    ML project to predict match outcomes for Women's Cricket World Cup 2025.

    Jupyter Notebook 1

  4. memory.journal memory.journal Public

    A simple and lightweight journaling app.

    HTML