18 lines
1.1 KiB
Markdown
18 lines
1.1 KiB
Markdown
# STT-Function
|
|
With the Speech-to-Text (STT) Function you can transcribe a file ("convert" an audio/video file into text).
|
|
Internally Whisper from OpenAI (https://github.com/openai/whisper) is used to transcribe the audiofile.
|
|
|
|
|
|
## Structure
|
|
* The container has two folders attached, the input folder with files that should be transcribed and the output path, where the transcript should be saved to.
|
|
|
|
|
|
## Setup
|
|
Make sure [Podman](https://podman.io/docs/installation) or [Docker](https://docs.docker.com/get-docker/) is installed.
|
|
|
|
Download the Model into the Folder where you will build the container image. [Download Link](https://openaipublic.azureedge.net/main/whisper/models/e5b1a55b89c1367dacf97e3e19bfd829a01529dbfdeefa8caeb59b3f1b81dadb/large-v3.pt) or run `wget https://openaipublic.azureedge.net/main/whisper/models/e5b1a55b89c1367dacf97e3e19bfd829a01529dbfdeefa8caeb59b3f1b81dadb/large-v3.pt`
|
|
|
|
```
|
|
podman build -t stt-function .
|
|
podman run -e LANGUAGE_CODE='de' -e WHISPER_MODEL='tiny' -v '/path/to/audio_video/file/':/app/input_files/ -v /output_path/of/transcript/:/app/transcripts/ --name stt-function_container --rm -t stt-function
|
|
```
|