Skip to content

Ra638/Movie_Revenue_Prediction-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

11 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🎬 Movie Revenue Prediction πŸ’°

This project is a Machine Learning-based application that predicts a movie's gross revenue based on budget, number of votes, runtime, and other important factors.

The project applies Linear Regression, Polynomial Regression, Random Forest, and XGBoost models to find the best predictions. We use GridSearchCV to fine-tune hyperparameters and deploy an interactive Streamlit web app for real-time predictions.


πŸš€ Features

βœ” Data Analysis & Visualization – Correlation heatmaps, bar plots, scatter plots, etc.
βœ” Machine Learning Models – Linear Regression, Polynomial Regression, Random Forest, and XGBoost.
βœ” Hyperparameter Optimization – Uses GridSearchCV for the best model settings.
βœ” Feature Importance Analysis – Identifies the most important factors affecting revenue.
βœ” Interactive Web App – A Streamlit-based UI where users can enter movie details and get predictions.
βœ” Model Persistence – Saves the best-trained model for future use.


πŸ“‚ Project Structure

Movie_Revenue_Prediction/
│── πŸ“ data/ # Contains raw dataset
β”‚ β”œβ”€β”€ movies.csv # Raw movie dataset
β”‚
│── πŸ“ models/ # Contains scripts for ML training
β”‚ β”œβ”€β”€ analysis_and_charts.py # Data analysis & visualization (correlation heatmaps, graphs)
β”‚ β”œβ”€β”€ model.py # Machine Learning model implementation
β”‚ β”œβ”€β”€ trained_model.py # Script for model training & tuning
β”‚
│── πŸ“„ app.py # Streamlit web app for predictions
│── πŸ“„ requirements.txt # Dependencies needed for the project
│── πŸ“„ README.md # Documentation

πŸ“Š Machine Learning Models Used

  • Linear Regression πŸ“‰
  • Polynomial Regression πŸ”΅
  • Random Forest 🌳
  • XGBoost ⚑ (Best performing model)

We applied hyperparameter tuning using GridSearchCV to optimize performance.


βš™οΈ Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/Ra638/Movie_Revenue_Prediction.git  
cd Movie_Revenue_Prediction  
pip install -r requirements.txt  
python models/analysis_and_charts.py  
python models/trained_model.py  
python models/trained_model.py

πŸ”₯ Model Performance

Model                           MAE   MSE        RΒ² Score  
---------------------------------------------------------  
Linear Regression               70M   1.77e+16   0.50  
Polynomial Regression (Degree 2) 49M   1.16e+16   0.67  
Polynomial Regression (Degree 3) 49M   1.16e+16   0.67  
Random Forest                   49M   1.09e+16   0.69  
XGBoost (Tuned)                 49M   9.69e+15   0.77  

βœ… XGBoost performs the best with RΒ² = 0.77!

🎨 Web App Preview

The project includes a Streamlit Web App where users can enter details and get predictions.

πŸ›  Future Improvements

πŸš€ Enhance Feature Engineering – Add more relevant features πŸ“ˆ Try Deep Learning Models – Test Neural Networks for better accuracy 🌐 Deploy Online – Host the app on AWS/GCP/Heroku

🀝 Contributing

Want to improve this project? Feel free to fork and submit a Pull Request 🎯

About

πŸ“Š Movie Revenue Prediction – A data-driven project leveraging EDA, feature engineering, and ML models (XGBoost, Random Forest, Regression) to predict movie revenue. Includes data visualization, model tuning, and an interactive Streamlit app! πŸš€πŸŽ¬

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors