VitalSign

Express more than words. An accessibility tool that helps Deaf and non-verbal users communicate naturally by translating sign language and emotional context into clear, expressive speech.

💡 Inspiration

For Deaf and non-verbal individuals, digital communication often strips away the nuance of human connection. Standard text-to-speech tools are "robotic" and lose the urgency, joy, or hesitation behind the words.

We didn't want to build just another translator. We wanted to build a communication co-pilot—one that understands not just what you are saying, but how you are feeling, restoring the emotional bridge that is often lost in translation.

Try it out live here!

🚀 What It Does

VitalSign is a web-based accessibility dashboard that:

Observes hand gestures in real-time using your webcam.
Detects emotional context (facial expressions) alongside the signs.
Refines disjointed sign glosses (e.g., "I doctor scared pain chest") into fluent, natural English ("I'm scared—my chest hurts and I need a doctor") using Google Gemini.
Speaks the message using ElevenLabs, dynamically adjusting the voice's stability and tone to match the user's detected emotion (e.g., calm vs. urgent).

⚙️ How We Built It

We built VitalSign as a responsive Next.js application to ensure it works on any device with a browser and camera.

The Eye (Computer Vision) 👁️

MediaPipe Tasks Vision: We implemented HandLandmarker to track 21 distinct hand points directly in the browser.
Gesture Logic: We built a custom geometric analyzer in HandTracker.jsx to map finger states and motion vectors to specific ASL phrases (e.g., distinguishing "Help" vs. "Hello" based on palm orientation and movement).

The Brain (Generative AI) 🧠 Google Gemini 2.0 Flash: We use Gemini not just for grammar, but for intent classification. It takes the raw stream of glosses and the "Physical Intensity" score from the vision engine to rewrite the sentence with the correct emotional weight.
The Voice (Expressive TTS) 🔊 ElevenLabs API: Instead of a flat voice, we dynamically stream audio buffers. If Gemini flags the sentiment as "Urgent" or "Happy," we modulate the stability and style parameters of the ElevenLabs engine to produce a voice that sounds genuinely human.
The Frontend (UI/UX) 💻

React + Next.js: Built for low-latency performance.
Broadcast Mode: Designed for integration with OBS Studio, allowing users to pipe their translated subtitles directly into Zoom, Slack, or Google Meet calls.

🛠️ Installation & Usage

Prerequisites

Node.js 18+
A webcam
API Keys for Google Gemini and ElevenLabs

Steps

Clone the repo

git clone https://github.com/liamma06/vitalsign.git
cd vitalsign

Install dependencies

npm install

Set up environment variables Create a .env.local file in the root directory:

GEMINI_API_KEY=your_gemini_key_here
ELEVENLABS_API_KEY=your_elevenlabs_key_here
ELEVENLABS_VOICE_ID=your_preferred_voice_id

Run the development server

npm run dev

Open in Browser Navigate to http://localhost:3000 to see the dashboard.
Broadcast Mode (For Zoom/Meet) Navigate to http://localhost:3000/broadcast and use OBS Studio to capture the window as a Virtual Camera.

🧠 What We Learned

Latency Matters: Balancing the heavy lifting of MediaPipe (client-side) with the intelligence of Gemini (server-side) required careful optimization to keep the "conversation" feeling real-time.

Geometric Math is Hard: Distinguishing between similar signs (like "Yes" vs. "No") purely using vector math was a fun challenge in 3D spatial reasoning.

Emotion is Data: We learned that "accessibility" isn't just about utility; it's about dignity. Giving a user a voice that sounds like them (happy, sad, urgent) is just as important as the words themselves.

🔮 What's Next for VitalSign

Full ASL Vocabulary: Training a custom model to recognize thousands of signs beyond our MVP demo set.

Biometric Integration: Using smartwatches to detect heart rate and automatically trigger "Urgent" voice modes during high-stress situations.

Mobile App: Porting the logic to React Native for a truly portable pocket interpreter.

---

Built with 💜 for DeltaHacks.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
docs		docs
public		public
scripts		scripts
src		src
vitalsign		vitalsign
.gitignore		.gitignore
README.md		README.md
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
plan.md		plan.md
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VitalSign

💡 Inspiration

🚀 What It Does

⚙️ How We Built It

🛠️ Installation & Usage

Steps

🧠 What We Learned

🔮 What's Next for VitalSign

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VitalSign

💡 Inspiration

🚀 What It Does

⚙️ How We Built It

🛠️ Installation & Usage

Steps

🧠 What We Learned

🔮 What's Next for VitalSign

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages