EcoPredict AI is an advanced web-based application for predicting greenhouse gas (GHG) emissions in supply chains using artificial intelligence and machine learning technologies. Built with Streamlit and powered by a Linear Regression model trained on EPA's official supply chain emission data from 2010β2016, the application provides intelligent predictions for emission factors across various US industries and commodities.
The system features a modern, interactive GUI with glassmorphism design, animated gradients, and professional styling. It allows users to input parameters such as:
-
Greenhouse gas type (carbon dioxide, methane, nitrous oxide, or others)
-
Measurement units (kg, tons, COβe, etc.)
-
Emission factors
-
Comprehensive data quality metrics:
- Reliability
- Temporal correlation
- Geographic correlation
- Technological correlation
- Data collection quality
User inputs are processed through a sophisticated preprocessing pipeline, standardized using scaling, and sent to the ML model for real-time predictions. Visual analytics include interactive Plotly charts such as gauges, bar charts, and radar plots. The platform also offers AI-powered sustainability recommendations based on input trends.
It serves as both an educational tool and practical solution for environmental impact analysisβideal for researchers, analysts, and organizations aiming to optimize their carbon footprint.
πΏ Climate change is one of the most pressing global challenges. Accurate emission forecasting can support better decisions for a greener future. EcoPredict AI helps:
- π Predict GHG emissions by sector or activity
- π§ Visualize trends & outcomes
- β‘ Deliver real-time results via a beautiful interface
- π Encourage data-driven climate action
- β AI-powered emission predictions using EPA data
- π Glassmorphism UI with responsive layout
- π Real-time charts: Gauge, Bar, Radar (Plotly-powered)
- π Data quality scoring system (5 metrics)
- π§ AI suggestions for emission reduction
- π Upload and compare industry datasets
- π‘οΈ Error handling and validation checks
- π Deployable via Streamlit Cloud or localhost
EcoPredict AI provides graphical output that enhances interpretation and decision-making. Users receive:
- π Line Graphs: Track emission trends across inputs
- π Bar Charts: Compare emissions between industries
- π― Gauge Charts: Assess overall impact severity
- π Radar Charts: Visualize data quality profiles
All visualizations are interactive and rendered using Plotly.
The AI model was built and trained using the following pipeline:
-
Core Data Processing:
pandas,numpy
-
Visualization:
matplotlib.pyplot,seaborn,plotly
-
Machine Learning:
-
sklearn.model_selection:train_test_split,GridSearchCV
-
sklearn.preprocessing:StandardScaler
-
sklearn.linear_model:LinearRegression
-
sklearn.ensemble:RandomForestRegressor
-
sklearn.metrics:mean_squared_error,r2_score
-
-
Model Persistence:
joblib
- Preprocessing: null value handling, scaling
- Feature engineering
- Model training using Linear Regression (final model)
- Evaluation using MSE and RΒ²
- Hyperparameter tuning with GridSearchCV
- Models saved as
LR_model.pklandscaler.pkl
| Layer | Tools Used |
|---|---|
| Backend | Python, Pandas, Scikit-learn, LinearRegression |
| Frontend/UI | Streamlit, Plotly |
| Deployment | Streamlit Cloud / Local Host |
| Visualization | Plotly, Seaborn, Matplotlib |
git clone https://github.com/yourusername/EcoPredict-AI.git
cd EcoPredict-AIpip install -r requirements.txtstreamlit run app.pyCheck out the app live here π EcoPredict AI - Streamlit App
EcoPredict-AI/
βββ app.py # Streamlit App Logic
βββ model/
β βββ LR_model.pkl # Trained Linear Regression Model
βββ scaler/
β βββ scaler.pkl # StandardScaler instance
βββ data/
β βββ emissions.csv # Industry Emissions Data
βββ assets/
β βββ ecopredict_banner.png # Project banner image
βββ utils/
β βββ preprocess.py # Data Cleaning & Helper Functions
βββ requirements.txt
βββ README.md
Input: Industry: "Steel Manufacturing", GHG: "Methane", Unit: "Ton" Prediction: ~4.82 metric tons COβe emitted per ton produced.
Gauge, radar, and bar charts will help interpret the output visually.
We welcome all contributions, ideas, and suggestions!
- Fork the repository
- Create your feature branch (
git checkout -b feature/AmazingFeature) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/AmazingFeature) - Open a Pull Request
Licensed under the MIT License.
Copyright (c) 2025 Aayush Kumar
Permission is hereby granted to use, copy, modify, and distribute this software for any purpose with or without fee, provided that the above copyright notice appears in all copies.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND.
- EPA Supply Chain GHG Emissions Data
- IBM Greenhouse Gas Project
- Streamlit Community
- Scikit-learn Contributors
- OpenAI Copilot
Aayush Kumar π« Email: [aayush05.af@gmail.com] π LinkedIn: linkedin.com/in/aayush-kumar-146252314
"The best way to predict the future is to design it sustainably." π







