AirTrace: Gesture-Controlled Spatial Sketching

AirTrace is a real-time computer vision application that transforms your hand into a digital paintbrush. By leveraging Google MediaPipe’s Tasks API and OpenCV, the system tracks hand landmarks with high precision, allowing users to draw in a 3D-like digital space through simple gestures.

Features

Pinch-to-Draw: Uses a natural "pinch" gesture (connecting thumb and index finger) to activate the digital ink.
Persistent Canvas: Drawings are maintained on a dedicated transparent layer overlaid on the live camera feed.
Spatial UI: Features an on-screen "RESET" button at the bottom-right corner for clearing the canvas.
High-Quality Rendering: Uses anti-aliasing to ensure smooth, professional-looking strokes even at high speeds.
Optimized Performance: Built with the MediaPipe 2026 Tasks API for low-latency tracking on modern Python environments.

Installation

1. Clone the Repository

git clone https://github.com/NKumar-B/VisionSketch_OpenCV.git
cd VisionSketch_OpenCV

2. Set Up a Virtual Environment (Recommended)

python -m venv .venv
.\.venv\Scripts\activate

3. Install Dependencies

pip install -r requirements.txt  #see the requirements.txt file to install the required libraries

4. Download the Model File

You must download the Hand Landmarker model from Google and place it in the root directory:

Model Name: hand_landmarker.task
Download Link: Google MediaPipe Model Garden

How to Use

Run the script: python AirTrace<img width="798" height="632" alt="AirTrace" src="https://github.com/user-attachments/assets/c1fd0d2f-8598-4c1f-acc2-d39997aee9e4" /> .py
Drawing: Bring your Index Finger and Thumb together (pinch) to begin drawing in green ink.
Moving: Release the pinch to move your hand without drawing.
Resetting: Pinch your fingers while hovering over the red RESET box in the bottom-right corner to clear the screen.
Exit: Press the 'q' key on your keyboard to close the application.

Technical Overview

The application follows a modular computer vision pipeline:

Preprocessing: The input frame is flipped horizontally to provide a "mirror-like" intuitive experience for the user.
Detection: MediaPipe's HandLandmarker identifies 21 unique landmarks. We specifically track Landmark 8 (Index Tip) and Landmark 4 (Thumb Tip).
Distance Logic: We calculate the Euclidean distance between these two points. If the distance falls below a specific threshold (e.g., 6% of the screen width), the "pinch" is triggered.
Layer Merging: Drawing occurs on a separate black canvas. We create a binary mask of this canvas and use bitwise operations to overlay only the non-black pixels onto the live camera feed.

License

Distributed under the MIT License. See LICENSE for more information.

Acknowledgments

Google MediaPipe: For providing the robust Face Landmarker Tasks API and pre-trained .task models.
OpenCV (Open Source Computer Vision Library): For the powerful real-time image processing and visualization tools.
The COCO Dataset Team: For their foundational work in standardizing computer vision training data.
NumPy: For the efficient numerical processing required for coordinate mapping.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
AirTrace.py		AirTrace.py
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AirTrace: Gesture-Controlled Spatial Sketching

Features

Installation

1. Clone the Repository

2. Set Up a Virtual Environment (Recommended)

3. Install Dependencies

4. Download the Model File

How to Use

Technical Overview

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AirTrace: Gesture-Controlled Spatial Sketching

Features

Installation

1. Clone the Repository

2. Set Up a Virtual Environment (Recommended)

3. Install Dependencies

4. Download the Model File

How to Use

Technical Overview

License

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages