Credit Scoring Model

This project develops a Credit Scoring Model to evaluate the creditworthiness of clients using statistical and data science techniques.
It includes data preprocessing, exploratory data analysis (EDA), feature engineering, and implementation of predictive models using machine learning.

🧠 Project Overview

The purpose of this project is to design and implement a credit risk scoring system capable of classifying clients according to their probability of default.
The workflow follows the full data science pipeline, from data ingestion and cleaning to exploratory analysis and model construction.

🎯 Objectives

Analyze and preprocess client financial data.
Identify key risk factors affecting credit performance.
Build classification models to predict credit scores using multiple machine learning algorithms.
Visualize results to support business interpretation.

🏗️ Project Structure

Analyze and preprocess client financial data.
Identify key risk factors affecting credit performance.
Build a classification model to predict credit scores.
Visualize results to support business interpretation.

🏗️ Project Structure

Modelo_Puntuacion_Crediticia/
│
├── Data/ # Raw and cleaned datasets
│ └── clean_data.xlsx
├── analysis/ # EDA scripts and notebooks
├── Models/ # Python scripts for ML models
│ ├── NN.py # Neural Network implementation
│ ├── XGBoost.py # XGBoost implementation
│ ├── benchmarkmodel.py # Logistic Regression benchmark
│ ├── decision_tree.py # Decision Tree implementation
│ ├── random_forest.py # Random Forest implementation
│ ├── data.py # Data preprocessing functions
│ └── functions.py # Helper functions
├── save_models/ # Trained model files
│ ├── DecisionTree.pkl
│ ├── LogisticRegression_benchmark.pkl
│ ├── XGBoost.pkl
│ └── credit_score_nn_model.h5
├── notebooks/ # Jupyter notebooks
│ └── Modelo_puntuacion_crediticia.ipynb
├── documentation/ # Exported reports
│ ├── Modelo_puntuacion_crediticia.docx
│ ├── Modelo_puntuacion_crediticia.html
│ └── Modelo_puntuacion_crediticia.pdf
├── .gitattributes
├── .gitignore
├── LICENSE
├── README.md
└── requirements.txt

⚙️ Methodology

Data Loading & Exploration

Import datasets and perform initial checks on data quality, structure, and distribution.

Data Cleaning

Handle missing values, outliers, and inconsistencies in the financial data.

Feature Engineering

Encode categorical variables with LabelEncoder and create new features based on financial indicators.

Exploratory Data Analysis (EDA)

Use pandas, matplotlib, and seaborn to visualize correlations, distributions, and patterns.

Modeling

Implement multiple classification models to predict creditworthiness:
- Logistic Regression (benchmark)
- Decision Tree
- Random Forest
- XGBoost
- Neural Network
Evaluate models using appropriate metrics (accuracy, AUC, confusion matrix, etc.)

🧩 Technologies Used

Library	Purpose
NumPy	Numerical operations
Pandas	Data manipulation and analysis
Matplotlib / Seaborn	Data visualization
SciPy	Statistical analysis
scikit-learn	Preprocessing and machine learning models
TensorFlow / Keras	Neural Network modeling
os	File and directory management

🚀 Installation & Usage

Clone the Repository

git clone https://github.com/diegotita4/MC_P2.git

2. Install Dependencies

pip install -r requirements.txt

📊 Results (Summary)

Full EDA identifying main drivers of credit performance.
Feature transformations improving model interpretability.
Multiple machine learning models with saved predictions and trained models for future use.
Reports and visualizations available in documentation/.

Detailed metrics and model evaluation can be found in the notebook.

👥 Authors

Project: Credit Scoring Model
Team:

Mugica Liparoli Juan Antonio
Enríquez Nares Diego Emilio
Brizuela Casarín Ana Sofía
Course: Credit Models
Professor: Rodolfo Slay Ramos
Date: September 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Credit Scoring Model

🧠 Project Overview

🎯 Objectives

🏗️ Project Structure

🏗️ Project Structure

⚙️ Methodology

Data Loading & Exploration

Data Cleaning

Feature Engineering

Exploratory Data Analysis (EDA)

Modeling

🧩 Technologies Used

🚀 Installation & Usage

2. Install Dependencies

📊 Results (Summary)

👥 Authors

🪪 License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Data		Data
analysis		analysis
save_models		save_models
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
Modelo_puntuacion_crediticia.docx		Modelo_puntuacion_crediticia.docx
Modelo_puntuacion_crediticia.html		Modelo_puntuacion_crediticia.html
Modelo_puntuacion_crediticia.ipynb		Modelo_puntuacion_crediticia.ipynb
Modelo_puntuacion_crediticia.pdf		Modelo_puntuacion_crediticia.pdf
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Credit Scoring Model

🧠 Project Overview

🎯 Objectives

🏗️ Project Structure

🏗️ Project Structure

⚙️ Methodology

Data Loading & Exploration

Data Cleaning

Feature Engineering

Exploratory Data Analysis (EDA)

Modeling

🧩 Technologies Used

🚀 Installation & Usage

2. Install Dependencies

📊 Results (Summary)

👥 Authors

🪪 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages