🧠 English to Italian Translation using Generative Transformers

This repository contains a custom-built Generatively Pretrained Transformer (GPT) model from scratch for English to Italian machine translation, based on the seminal paper “Attention Is All You Need”.

🚀 Built with PyTorch | Hugging Face | Google Colab | Streamlit

🖥️ Demo

📌 Features

✅ Fully custom implementation of Transformer architecture
⚙️ Encoder–Decoder blocks, Multi-Head Attention, Positional Encoding, Layer Normalization
🔍 Beam Search decoding for improved translation fluency
🌐 Attention heatmap visualizations (encoder, decoder, and cross-attention)
💻 Interactive Streamlit app for real-time translation and visualization
☁️ Trained on NVIDIA A100 GPU via Google Colab

🔧 Technologies Used

Frameworks: PyTorch, Hugging Face Datasets
Training Tools: Google Colab (A100), OneCycleLR scheduler, Label Smoothing
Visualization: Matplotlib, Seaborn, Streamlit
Languages: Python
Dataset: Helsinki-NLP / Opus Books (en-it)

🧠 Model Architecture

6 Encoder + 6 Decoder layers
8 Attention Heads
Positional Encoding (sinusoidal)
Masked Multi-Head Attention for autoregressive decoding
Feed-forward sublayers per token
Residual connections + LayerNorm

⚙️ How to Run

🔁 Clone and Install

git clone https://github.com/Mehardeep79/Transformer-Translator.git
cd english-to-italian-transformer
pip install -r requirements.txt

🛠️ Train the Model

If you have strong GPUs, then go ahead by running the train.py file or Train.ipynb notebook.
An alternative to not having strong GPUs is to train the model using Google Colab's paid version which gives you access to Nvidia's A100 GPU along with 100 compute units. You can access the google colab notebooks in the Colab Notebooks directory where you can find the Final_Training.ipynb colab notebook.
Your models will be saved in the opus_books_weights directory for all 30 epochs and the best model will be named with the highest BLEU score.

🗣️ Inference and Custom Inference

For inference, you can either run the Inference.ipynb or run it on colab by running Final_Inference.ipynb in the Colab Notebooks directory.
For custom inference, you can either run the Translate.py file or the same colab notebook Final_Inference.ipynb.

📊 Attention Visualization

You can either visualize the attention by running attention_visualization.py file or attention.ipynb notebook.
For visualizing the attention on google colab, you can run Attention_Visualization.ipynb in the Colab Notebooks directory.

💻 Streamlit Application

You can have an interactive interface for this translation model along with the visualizations by running the app2.py file which can be done directly in the terminal using:

streamlit run app2.py

🧑‍💻 Authors

Mehardeep Singh Sandhu

BTech Electronics & Communication

Shiv Nadar University

LinkedIn | GitHub
Pasumarthy Akshat Naidhruv

BTech Electronics & Communication

Shiv Nadar University

LinkedIn | GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
Colab Notebooks		Colab Notebooks
__pycache__		__pycache__
assets		assets
vocab		vocab
Inference.ipynb		Inference.ipynb
README.md		README.md
Train.ipynb		Train.ipynb
Translate.py		Translate.py
app2.py		app2.py
attention.ipynb		attention.ipynb
attention_visualization.py		attention_visualization.py
config.py		config.py
dataset.py		dataset.py
model.py		model.py
requirements.txt		requirements.txt
runtime.txt		runtime.txt
tokenizer_en.json		tokenizer_en.json
tokenizer_it.json		tokenizer_it.json
train.py		train.py
upload_model.py		upload_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 English to Italian Translation using Generative Transformers

🖥️ Demo

📌 Features

🔧 Technologies Used

🧠 Model Architecture

⚙️ How to Run

🔁 Clone and Install

🛠️ Train the Model

🗣️ Inference and Custom Inference

📊 Attention Visualization

💻 Streamlit Application

🧑‍💻 Authors

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 English to Italian Translation using Generative Transformers

🖥️ Demo

📌 Features

🔧 Technologies Used

🧠 Model Architecture

⚙️ How to Run

🔁 Clone and Install

🛠️ Train the Model

🗣️ Inference and Custom Inference

📊 Attention Visualization

💻 Streamlit Application

🧑‍💻 Authors

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages