sist2-scripts/README.md
2022-04-19 12:07:18 -04:00

39 lines
783 B
Markdown

Create conda env with:
```
conda create -y -n sist2-scripts -c conda-forge python=3.7 cudnn=8.1 cudatoolkit=11.2
conda clean --force-pkgs-dirs -y && conda clean --all -y
conda activate sist2-scripts
pip install -r requirements.txt
```
## transcribe.py
Transcribe audio files using transformers STT
Example usage (Don't use multithreading!!):
```
find /path/to/audio/files/ -name "*.mp3" -exec python transcribe.py {} \;
```
## transcribe_aws.py
Transcribe audio files using AWS Transcribe
Example usage:
```
find /path/to/audio/files/ -name "*.mp3" | parallel -j8 python transcribe_aws.py --bucket my-s3-bucket-name {}
```
## export_meta.py
Save all .s2meta files to a zip archive for easy sharing
Example usage:
```
python export_meta.py [--json] /path/to/dataset/
```