Skip to content
This repository was archived by the owner on Jul 10, 2025. It is now read-only.

kot-vn/vtac-poc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 

Repository files navigation

Setup

First time

Create conda environment with python 3.11

conda create -n vtatc-poc python=3.11

Active conda environment

conda activate vtatc-poc

Install ffmpeg

conda install -c conda-forge ffmpeg

Install ffmpeg-python, ipython, SpeechRecognition, google-cloud-speech, google-cloud-language, pydub, google-cloud-storage

pip install ffmpeg-python ipython SpeechRecognition google-cloud-speech google-cloud-language pydub google-cloud-storage protobuf

Setup Google Authentication

export GOOGLE_APPLICATION_CREDENTIALS="/path/to/keyfile.json"

Google Cloud Storage

The file must not be compressed (for example, gzip).

Only FLAC audio file are supported.

Only Google Cloud Storage URIs are supported.

Run cmd

Active conda environment

conda activate vtatc-poc

Setup Google Authentication

export GOOGLE_APPLICATION_CREDENTIALS="/path/to/keyfile.json"

Run python file

cd src

python main.py

About

Simple Video to Text proof-of-concept

Topics

Resources

Stars

Watchers

Forks

Contributors

Languages