2022-04-19 12:07:18 -04:00
2022-04-19 12:07:18 -04:00
2022-04-19 12:07:18 -04:00
2022-04-19 12:07:18 -04:00
2022-04-19 12:07:18 -04:00
2022-04-19 12:07:18 -04:00
2022-04-19 12:07:18 -04:00
2022-04-19 12:07:18 -04:00

Create conda env with:

conda create -y -n sist2-scripts -c conda-forge python=3.7 cudnn=8.1 cudatoolkit=11.2
conda clean --force-pkgs-dirs -y && conda clean --all -y
conda activate sist2-scripts
pip install -r requirements.txt

transcribe.py

Transcribe audio files using transformers STT

Example usage (Don't use multithreading!!):

find /path/to/audio/files/ -name "*.mp3" -exec python transcribe.py {} \;

transcribe_aws.py

Transcribe audio files using AWS Transcribe

Example usage:

find /path/to/audio/files/ -name "*.mp3" | parallel -j8 python transcribe_aws.py --bucket my-s3-bucket-name {}

export_meta.py

Save all .s2meta files to a zip archive for easy sharing

Example usage:

python export_meta.py [--json] /path/to/dataset/
Description
No description provided
Readme 151 KiB
Languages
Python 100%