Spal.Transcriber

A simple self-hosted transcription tool. It runs locally, so your audio/video never leaves your machine or server.

Features

Upload local files (.mp4, .mp3, .wav, …)
Automatic transcription using faster-whisper
Job queue with progress tracking
Download transcripts as .txt
Optional email delivery (SMTP, with group support)
Recipient groups (e.g. recipients_main.txt, recipients_team.txt)
Web interface with dark theme
Basic authentication (via .env)
Docker support

Usage

Clone this repository
Copy .env.example to .env and adjust settings
Install Python requirements (or use Docker, see below)

Run locally:

pip install -r requirements.txt
python app.py

Then open http://localhost:15551.

Docker (compose)

For running in Docker, create a .env file with your settings (see .env.example). Then use Docker Compose:

services:
  transcriber:
    image: spaleks/transcriber:latest
    # or if you prefer quay.io:
    # image: quay.io/spaleks/transcriber:latest
    container_name: spal-transcriber
    ports:
      - "15551:15551"
    volumes:
      - ./data:/app/data
      - ./config:/app/config
      - ./.env:/app/.env:ro

Example workflow

Go to http://localhost:15551
Upload a file (e.g. meeting.mp4 or recording.mp3)
Select which recipient group should be notified (if SMTP is configured)
Wait until processing finishes
Download the transcript or send it via email

Recipient groups

Recipients are defined in plain text files inside the RECIPIENTS_DIR (default: ./config).

recipients.txt → main recipients (always notified if email sending is enabled)
recipients_<group>.txt → group-specific recipients (notified only if the job is assigned to that group)

Example:

config/
  recipients.txt           # always notified
  recipients_Team.txt      # group "Team"
  recipients_Clients.txt   # group "Clients"

Inside each file, list one email address per line. Lines starting with # are ignored.

Webhooks (optional)

If configured, a webhook is called when a job finishes. The transcript is sent as multipart/form-data with metadata.

Enable

Set these in .env:

WEBHOOK_URL – target endpoint (if unset, webhooks are disabled)
WEBHOOK_BEARER – optional token for the Authorization Bearer
WEBHOOK_TIMEOUT – request timeout in seconds (default: 15)
WEBHOOK_VERIFY – TLS verification (1 = verify, 0 = skip)

When it fires

Automatically after a job reaches done.
Manually via a button in the GUI.

HTTP request

Method: POST
Content-Type: multipart/form-data
Parts:
- metadata: JSON string with job info (see below)
- file: transcript as text/plain (<slug>.txt)

Example metadata JSON:

{
  "slug": "meeting-2025-09-26",
  "name": "Weekly Meeting",
  "job_id": 123,
  "recipient_group": "Team",
  "created_at": "2025-09-26 10:05:11",
  "updated_at": "2025-09-26 10:22:44",
  "source": "spal.transcriber",
  "status": "done",
  "filename": "meeting-2025-09-26.txt"
}

Environment variables

Here’s what you can configure via .env:

App

APP_DATA_DIR – Directory for storing jobs and outputs (default: ./data)
APP_HOST – Host to bind (default: 127.0.0.1)
PORT – Port to bind (default: 15551)
APP_AUTH_USER – Username for Basic Auth
APP_AUTH_PASS – Password for Basic Auth

Whisper

WHISPER_MODEL – Model size (tiny, base, small, medium, large-v3)
WHISPER_DEVICE – cpu or cuda
WHISPER_COMPUTE – Compute type (int8, float32, int8_float16, float16)
WHISPER_ALLOWED_LANGUAGES – Comma-separated allowlist for language detection (e.g. en,de), or */unset to allow any language
WHISPER_THREADS – CPU threads (integer)
WORKER_CONCURRENCY – Number of concurrent workers.

SMTP / Email

AUTO_SEND_EMAIL – If 1, emails are sent automatically after transcription (default: 0)
SMTP_HOST – SMTP server hostname
SMTP_PORT – SMTP port (default: 587)
SMTP_USER – SMTP username (optional)
SMTP_PASS – SMTP password (optional)
SMTP_SENDER – Sender email address
SMTP_SENDER_NAME – Display name for sender
SMTP_USE_TLS – Use STARTTLS (default: 1)
SMTP_USE_SSL – Use SSL/TLS directly (default: 0)
SMTP_VERIFY – Verify server certificate (1 = yes, 0 = no)

Recipients

RECIPIENTS_DIR – Directory containing recipient list files (default: ./config)
RECIPIENTS_FILE – Path to main recipients file (default: ./config/recipients.txt)

Mail template

MAIL_SUBJECT – Email subject (supports {name}, {slug} placeholders)
MAIL_BODY – Email body (supports {name}, {slug}, supports \n)
MAIL_BODY_FILE – Optional path to a file containing email body

Privacy notice

All files are stored and processed locally.
Nothing is uploaded to third-party servers.
Please always respect the privacy of speakers and only process content you are allowed to handle.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.build		.build
bin		bin
lib		lib
routes		routes
static		static
templates		templates
.env.example		.env.example
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
LICENSE.md		LICENSE.md
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spal.Transcriber

Features

Usage

Docker (compose)

Example workflow

Recipient groups

Webhooks (optional)

Enable

When it fires

HTTP request

Environment variables

App

Whisper

SMTP / Email

Recipients

Mail template

Privacy notice

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spal.Transcriber

Features

Usage

Docker (compose)

Example workflow

Recipient groups

Webhooks (optional)

Enable

When it fires

HTTP request

Environment variables

App

Whisper

SMTP / Email

Recipients

Mail template

Privacy notice

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages