mirror of
https://github.com/simon987/yt-metadata.git
synced 2025-04-10 14:06:42 +00:00
22 lines
1.1 KiB
Markdown
22 lines
1.1 KiB
Markdown
# yt-metadata
|
|
Script to import [youtube-dl](https://github.com/rg3/youtube-dl) metadata to PostgreSQL.
|
|
The actual `.jpg` files for the thumbnails are saved into the database as byte arrays (Only the **default**
|
|
thumbnail saved by **youtube-dl**)
|
|
|
|
### Scraping metadata using youtube-dl
|
|
This tool expects the files to be in the format that this bash script will output:
|
|
```bash
|
|
id="$1"
|
|
mkdir "$id"; cd "$id"
|
|
youtube-dl -v --print-traffic --restrict-filename --write-description --write-info-json --write-annotations --write-thumbnail --all-subs --write-sub --skip-download --ignore-config --ignore-errors --geo-bypass --youtube-skip-dash-manifest https://www.youtube.com/watch?v=$id
|
|
```
|
|
|
|
### Setup instructions:
|
|
* Make sure you have PostgreSQL version **>=9.5** installed
|
|
* Create the database and schema with the tool of your choice using `schema.sql`
|
|
* Change the directory in `import.py` so it points to the location of your youtube metadata
|
|
* Run `import.py`
|
|
|
|
### Schema:
|
|

|