1.1 KiB
1.1 KiB
STT-Function
With the Speech-to-Text (STT) Function you can transcribe a file ("convert" an audio/video file into text). Internally Whisper from OpenAI (https://github.com/openai/whisper) is used to transcribe the audiofile.
Structure
- The container has two folders attached, the input folder with files that should be transcribed and the output path, where the transcript should be saved to.
Setup
Make sure Podman or Docker is installed.
Download the Model into the Folder where you will build the container image. Download Link or run wget https://openaipublic.azureedge.net/main/whisper/models/e5b1a55b89c1367dacf97e3e19bfd829a01529dbfdeefa8caeb59b3f1b81dadb/large-v3.pt
podman build -t stt-function .
podman run -e LANGUAGE_CODE='de' -e WHISPER_MODEL='tiny' -v '/path/to/audio_video/file/':/app/input_files/ -v /output_path/of/transcript/:/app/transcripts/ --name stt-function_container --rm -t stt-function