SQL-powered analysis of SpaceX launch data using SQLite in a Python environment. Explore payloads, landing outcomes, and booster performance through SQL queries and pandas.
π Interactive dashboard built in Power BI showcasing payload mass, landing outcome, booster version comparison with Launch site.
This project demonstrates SQL-powered exploration of SpaceX launch data using a local SQLite database and Jupyter Notebook. It combines SQL querying with Python's pandas and SQLite to uncover insights into SpaceXβs missions, including launch sites, booster performance, payloads, and landing outcomes.
| Attribute | Details |
|---|---|
| Dataset | SpaceX Launch Dataset (IBM Capstone) |
| Source | CSV Link |
| Tools | SQL (via ipython-sql), SQLite, pandas |
| Environment | Jupyter Notebook |
| Focus | SQL Queries & Data Analysis |
- Connect to a SQLite database and load launch data
- Clean and prepare the dataset using pandas
- Create SQL tables and run queries inside Jupyter using
%sql - Perform exploratory analysis using SQL:
- Launch sites
- Booster versions
- Payload statistics
- Landing outcomes
- Mission trends over time
| Task | Description |
|---|---|
| π Task 1 | List all distinct launch sites |
| π Task 2 | Filter launches starting with "CCA" |
| π Task 3 | Total payload mass by NASA (CRS) |
| π Task 4 | Average payload for F9 v1.1 |
| π Task 5 | First successful ground pad landing date |
| π Task 6 | Boosters with 4000β6000kg payloads & drone ship landings |
| π Task 7 | Count of missions by outcome |
| π Task 8 | Boosters carrying the maximum payload |
| π Task 9 | 2015 landing outcome by month |
| π Task 10 | Landing outcomes between specific dates |
- ποΈ SQL query writing & filtering
- π Aggregations, sorting, and conditional grouping
- π§©
CASE WHENlogic for labeling and categorization - π Date parsing and filtering by time ranges
- π Jupyter Notebook integration for SQL workflows using
%load_ext sqland%sqlmagic - π§Ή Basic data cleaning and preprocessing with pandas
- π SQL database setup with Python using
sqlite3and pandas.to_sql()method for seamless transition from CSV to SQL
Rahul Arya
π B.Sc. Physics | π IBM & Stanford ML Certified | π‘ Data Science Enthusiast
