Azure Speech Avatar Demo

A React + Express application demonstrating Azure Speech Services Avatar capabilities with real-time text-to-speech synthesis and visual avatar representation using WebRTC.

🚀 Features

Real-time Avatar Synthesis: Visual avatar with synchronized lip movements
WebRTC Integration: Low-latency streaming of avatar video and audio
Chat Interface: Interactive conversation with message history
Azure Speech Services: Powered by Azure Cognitive Services
TypeScript Support: Full type safety for both frontend and backend

📋 Prerequisites

Node.js (v18+)
npm or yarn
Azure Speech Services subscription (Standard S0 tier required for Avatar)
Modern browser with WebRTC support (Chrome, Edge, Firefox, Safari)

🏗️ Project Structure

test-AZ-speech/
├── backend/                 # Express.js server
│   ├── src/
│   │   └── index.ts        # Main server file with API endpoints
│   ├── dist/               # Compiled JavaScript
│   ├── package.json
│   ├── tsconfig.json
│   └── .env.example        # Example environment variables
│
└── frontend/               # React application
    ├── src/
    │   ├── App.tsx        # Main component with avatar logic
    │   ├── App.css        # Styling
    │   └── index.tsx      # Entry point
    ├── public/
    └── package.json

🔧 Installation

1. Clone the repository

git clone git@github.com:marcus888-techstack/test-AZ-Speech-Avatar.git
cd test-AZ-Speech-Avatar

2. Backend Setup

cd backend
npm install

# Copy the example environment file
cp .env.example .env

Edit .env with your Azure credentials:

AZ_SPEECH_KEY=your_azure_speech_key_here
AZ_SPEECH_REGION=your_azure_region_here
PORT=3000

3. Frontend Setup

cd ../frontend
npm install

🚀 Running the Application

Development Mode

Start the backend server:

cd backend
npm run dev

The server will run on http://localhost:3000

Start the frontend (in a new terminal):

cd frontend
npm start

The application will open at http://localhost:3000

Production Build

Build the backend:

cd backend
npm run build
npm start

Build the frontend:

cd frontend
npm run build

Serve the build folder with any static file server.

💡 Usage

Open the application in your browser
Wait for the avatar to initialize (you'll see "Initializing avatar...")
Type a message in the chat input
Press Enter or click Send
The avatar will appear and speak your message with synchronized lip movements
Continue the conversation!

🔌 API Endpoints

Endpoint	Method	Description
`/api/health`	GET	Health check endpoint
`/api/speech/token`	GET	Get Azure Speech token and region
`/api/speech/ice-token`	GET	Get ICE server configuration for WebRTC
`/api/speech/synthesize`	POST	Optional: Server-side speech synthesis

🛠️ Configuration

Azure Speech Services

Create an Azure Speech Services resource in the Azure Portal
Select Standard S0 pricing tier (required for Avatar)
Copy your key and region
Add them to the backend .env file

Supported Avatar Characters

lisa (default)
jason
And more available in Azure documentation

Supported Avatar Styles

casual-sitting (default)
technical-standing
business-standing
And more available in Azure documentation

🐛 Troubleshooting

Avatar doesn't appear

Check browser console for errors
Verify Azure credentials in .env
Ensure you're using Standard S0 pricing tier
Check if your region supports Avatar feature

Connection errors

Ensure both backend and frontend are running
Check CORS configuration
Verify firewall settings allow WebRTC connections

No audio/video

Check browser permissions for audio/video
Ensure WebRTC is supported in your browser
Try using a different browser

🔒 Security Notes

Never commit .env files with real credentials
Use environment variables for production deployments
Consider implementing authentication for production use
Use HTTPS in production for WebRTC security

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Azure Speech Services team for the Avatar API
Microsoft Cognitive Services Speech SDK
React and Express.js communities

📚 Resources

Built with ❤️ using React, Express, and Azure Speech Services

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
PRD_Azure_Avatar_Demo.md		PRD_Azure_Avatar_Demo.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Azure Speech Avatar Demo

🚀 Features

📋 Prerequisites

🏗️ Project Structure

🔧 Installation

1. Clone the repository

2. Backend Setup

3. Frontend Setup

🚀 Running the Application

Development Mode

Production Build

💡 Usage

🔌 API Endpoints

🛠️ Configuration

Azure Speech Services

Supported Avatar Characters

Supported Avatar Styles

🐛 Troubleshooting

Avatar doesn't appear

Connection errors

No audio/video

🔒 Security Notes

🤝 Contributing

📝 License

🙏 Acknowledgments

📚 Resources

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Azure Speech Avatar Demo

🚀 Features

📋 Prerequisites

🏗️ Project Structure

🔧 Installation

1. Clone the repository

2. Backend Setup

3. Frontend Setup

🚀 Running the Application

Development Mode

Production Build

💡 Usage

🔌 API Endpoints

🛠️ Configuration

Azure Speech Services

Supported Avatar Characters

Supported Avatar Styles

🐛 Troubleshooting

Avatar doesn't appear

Connection errors

No audio/video

🔒 Security Notes

🤝 Contributing

📝 License

🙏 Acknowledgments

📚 Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages