Advanced Search
Search Results
30 total results found
Structure
meetings/ ├── bin/ │ ├── meeting-start # starts recording + live transcription │ ├── meeting-stop # stops recording, asks for meeting name, renames files │ ├── meeting-follow # follow transcript while it is being written ...
System Information
This page was part of the project write-up I mainly did for myself and is kept for reference. Device Device Name Cox Device Type Raspberry Pi 500 DE Version Networking Address Type 10.20.0.94 Wi-Fi Ethernet Hardware Name Type Notes ...
Scripts in meetings/lib/
Scripts used by the executable scripts, providing basic configuration information. Script: meetings/lib/paths.sh Creates: Session Directory Session Files audio.wav transcript.txt meta.env Timestamps (ISO) and Provides path information for all scr...
Script meetings/bin/meeting-start
Starts a new live transcription session. ./bin/meeting-start Usage German: meeting-start --de English: meeting-start --en Auto: meeting-start --auto --de and --auto do NOT work at this stage of the project. Only use --en. meeting-start --en Script #!/...
Script meetings/bin/meeting-stop
./bin/meeting-stop Stops the active transcription session and finalises all session files. Usage meeting-stop Script #!/usr/bin/env bash set -euo pipefail BASE="$HOME/meetings/recordings" CURRENT="$BASE/.current" SESSION="${1:-$(ls -d "$HOME/meetings/recordi...
Script meetings/bin/meeting-follow
./bin/meeting-follow Display the transcript while it is generated. Usage meeting-follow Script #!/usr/bin/env bash set -euo pipefail BASE="$HOME/meetings/recordings" CURRENT="$BASE/.current" echo "📄 Waiting for active meeting session…" # Wait until a curre...
Installing and Building the Right whisper.cpp
Cloning the Github repository locally cd ~ git clone https://github.com/ggerganov/whisper.cpp Build additional libraries sudo apt install -y libsdl2-dev Building whisper-stream The -j2 flag limits parallel build jobs to 2. Using -j4 or higher may cause t...
Download the Right Model(s)
Download the tiny.en language model → this is the only model that will provide useful results on a Raspberry Pi 500 bash ~/whisper.cpp/models/download-ggml-model.sh tiny.en Better hardware will support bigger language models. Language Models: What is the...
Introduction
I've started this project after fiddling around with Notions "Meeting Notes" feature and running the idea of using a Speech to Text tool for our session transcripts by the members of one of my TTRPG groups. Consensus was that a tool like this would be nice an...
Privacy Concerns
Recordings of biometrical data, like a person's voice, create privacy concerns. My goal was, to pass audio through the tool chain without the need for a recording that exists as a file on the device. This turned out not to be feasible with whisper-stream's cur...
DISCLAIMER: Frozen WIP
This project has reached a status that most likely won't see any changes in the future. A Raspberry Pi 500 with just 8GB RAM is too limited to be a really good platform for a low-cost home-grown transcription tool. Yet, TranscriptOMatic is not dead and ...
Impressum
Information according to § 5 DDGBernd & Mela EckenfelsMörscher Straße 876185 Karlsruhe Verantwortlich für den Inhalt nach § 55 Abs. 2 RStV:Mela EckenfelsMörscher Straße 876185 Karlsruhe UrheberrechtDie durch die Seitenbetreiber erstellten Inhalte und Werke...
Creating a Dedicated Discord Account
To limit the complexity and to not have to deal with mapping and capturing several audio devices, I chose to create a new Discord account that is solely used for running TranscriptOMatic. This is necessary because one account can't be joining a voice session f...
Installing and Setting-Up Legcord
The official Discord client currently isn't available for devices operating ARM processors on Windows or Linux2. The app Legcord will act as a replacement. Installation Open Pi-Apps and go to Internet → Communication → Legcord to install. Post-Installat...
Installing Pi-Apps
To install the Pi-Apps app store for Raspberry Pi, follow the instructions on their website: wget -qO- https://raw.githubusercontent.com/Botspot/pi-apps/master/install | bash Check the URL before running — piping directly to bash executes code without revie...
Audio Input (the Almost Easy Way)
How the TranscriptOMatic should operate is, to connect to Discord voice using a dedicated Discord account for this purpose. During sessions, the Discord client (Legcord) will join the discord voice session. To reduce complexity, no microphone or speaker is att...
From Voice to Text
Discord (Legcord) ↓ PipeWire graph ↓ discord_sink (virtual null sink) ↓ discord_sink.monitor (loopback source) ↓ whisper_mic (remap-source, mono, 16kHz) ↓ ffmpeg ↓ audio.wav (growing file) ↓ whisper-stream On the device us...
Things to Look Into
https://picovoice.ai/blog/whisper-cpp-speaker-diarization/ OpenAI Whisper delivers highly accurate speech-to-text transcription, but it does not track speaker changes. Applications that rely on Whisper cannot determine who is speaking in a conversation. To...