Structure

meetings/
├── bin/
│   ├── meeting-start          # starts recording + live transcription
│   ├── meeting-stop           # stops recording, asks for meeting name, renames files
│   ├── meeting-follow         # follow transcript while it is being written
│   └── summarize-meeting      # create post-meeting summaries (planned, not yet realized)
│
├── lib/
│   ├── paths.sh               # creates session dirs + defines file paths
│   └── whisper.sh             # whisper.cpp binary, model, ASR parameters
│
└── recordings/
    └── 2026-02-21T013852/     # session directory (created on meeting-start)
        ├── audio.wav          # raw system audio recording
        ├── transcript.txt     # live transcript (grows during meeting)
        ├── meta.env           # session metadata (PIDs, language, timestamps)
        ├── 2025-03-24T1930_project-sync_transcript.txt   # the actual renamed transcript
        ├── 2025-03-24T1930_project-sync_audio.wav        # (see data protection discussion)
        └── summary.md         # created by summarize-meeting (planned, not yet realized)

Description of Paths and Scripts

Paths

Path: meetings/bin/

Path: meetings/lib/

Path: meetings/recordings/

Scripts (current architecture)

TranscriptOMatic is implemented as a small set of composable shell scripts. Each script has a clearly defined responsibility within the session lifecycle. No script relies on implicit system state or hard-coded audio devices.

Library scripts (meetings/lib/)

meetings/lib/paths.sh

Responsible for session creation and path management.

On invocation, it:

~/meetings/recordings/<ISO_TIMESTAMP>/

This script is the only place where session directories are created.

meetings/lib/whisper.sh

Defines the speech recognition backend configuration.

It contains:

The setup is explicitly optimized for live transcription using whisper-stream, not for batch processing.

No audio devices are referenced here.

Executable scripts (meetings/bin/)

meeting-start

Starts a new live transcription session.

Responsibilities

  1. Session initialization
    • creates a new session directory via paths.sh
    • writes the active session path to ~/meetings/recordings/.current
  2. Audio graph setup (PipeWire)

    • ensures a persistent null sink (discord_sink)
    • routes Discord audio into that sink
    • exposes the sink monitor as a virtual microphone (whisper_mic)
  3. Processing
    • records audio from whisper_mic via ffmpeg
    • performs (almost) live transcription using whisper-stream
    • appends output to transcript.txt
  4. State tracking
    • writes all relevant runtime information (PIDs, module IDs, paths) to meta.env

Usage

meeting-start --en    # force English

Planned, not yet realized:

meeting-start --de    # force German (planned, not yet realized)
meeting-start --auto  # auto-detect language (planned, not yet realized)

meeting-start is self-contained: it does not require any pre-existing audio configuration and can be run after a reboot.

meeting-follow

Passively follows the live transcript of the currently active session.

Behaviour

This allows meeting-follow to be started:

It will never attach to archived sessions.

Usage

meeting-follow # stop with ctrl+c

meeting-stop

Stops the active transcription session and finalises all session files.

Responsibilities

  1. Process teardown kills the FFmpeg and whisper-stream processes via their stored PIDs
  2. State clean-up removes ~/meetings/recordings/.current unloads the whisper_mic remap-source module from PipeWire
  3. File finalisation prompts for a meeting name and slugifies it
    • renames transcript.txt and audio.wav to timestamped, named files (e.g. 2025-03-24T1930_project-sync_transcript.txt)

To prevent accidental deletions, meeting-stop does not delete files automatically. To maintain data protection, recording files have to be deleted by hand. 

Usage

meeting-stop              # stops the most recent session


Created 2026-01-01 18:55:51 UTC by Mela
Updated 2026-02-22 04:56:04 UTC by Mela