GitHub - kristoferlund/ostt: Open source voice-to-text for the terminal. Record from a hotkey, transcribe with any provider, pipe to AI or shell commands.

Open source voice-to-text for Linux and macOS

Features • Install • Quick Start • Processing • Docs

OSTT is a terminal-native speech-to-text tool. Record from a hotkey, transcribe with your chosen provider, then send the result to your clipboard, a file, stdout, an AI prompt, or any shell command. It does not assume one vendor, one subscription, or one app-specific workflow: bring your own API key and choose from OpenAI, Deepgram, Groq, DeepInfra, AssemblyAI, Berget, and ElevenLabs.

OSTT is built for people who treat the terminal as a normal place for voice input to land. You can print to stdout, copy to the clipboard, write to files, retry the same recording with another model, transcribe existing audio, and post-process text with AI prompts or shell commands. Voice becomes text that can move through the same tools as everything else.

Tip

Bind Alt+Space to ostt launch -c for a global hotkey popup. Press once to start recording, press again to stop and transcribe. Use Alt+Ctrl+Space with ostt launch -c -p for a popup with an action picker.

ostt-demo.mp4

Features

Linux-first voice input - Global hotkey setup for Omarchy/Hyprland, GNOME, KDE, and other Linux desktops, with macOS support too.
Provider choice - Bring your own API key and switch between OpenAI, Deepgram, Groq, DeepInfra, AssemblyAI, Berget, and ElevenLabs.
Terminal-native workflow - Use stdout, clipboard, files, aliases, shell completions, logs, and pipes.
Scriptable post-processing - Transform transcripts with AI prompts or bash commands using ostt -p and ostt process.
Retry without re-recording - Save recordings locally, then re-transcribe them with a different provider or model.
File transcription and replay - Transcribe existing audio files and replay saved recordings from history.
Keywords and custom vocabulary - Improve recognition for names, technical terms, and project-specific language.
Open source, no subscription - Public code, local configuration, and no vendor lock-in beyond the providers you choose.

Documentation

Full documentation is available at https://ostt.ai.

Start here:

Install

curl -fsSL https://ostt.ai/install | bash

The installer detects your platform, installs supported runtime dependencies, downloads the latest release, verifies its checksum, and installs the ostt CLI.

If you prefer platform package managers, see the docs for Homebrew, AUR, .deb, and .rpm options.

Quick Start

ostt auth           # Choose provider/model and save API key
ostt                # Record, transcribe, print to stdout
ostt -c             # Record, transcribe, copy to clipboard
ostt launch -c      # Popup workflow for global hotkeys

By default, press Enter to stop and transcribe, Space to pause/resume, and Esc, q, or Ctrl+C to cancel.

Processing

Processing actions transform transcriptions after recording or from history.

ostt -p clean -c              # Record, transcribe, clean, copy
ostt launch -c -p clean       # Popup hotkey workflow with processing
ostt process                  # Process most recent history item, show picker
ostt process clean            # Process most recent history item with clean action
ostt process 3 clean -c       # Process history item #3 with clean action
ostt process --list           # List configured actions

Actions are configured in ~/.config/ostt/ostt.toml and can run either bash commands or AI CLI tools. See Processing Actions for examples.

Common Commands

ostt                         # Record audio, print transcription
ostt -c                      # Record audio, copy transcription
ostt -o notes.txt            # Record audio, write transcription to file
ostt launch -c               # Open popup recorder
ostt transcribe file.mp3     # Transcribe existing audio
ostt retry 2 -c              # Re-transcribe recording #2 and copy
ostt replay                  # Play most recent recording
ostt history                 # Browse transcription history
ostt keywords                # Manage transcription keywords
ostt config                  # Open config file
ostt list-devices            # List audio input devices
ostt logs                    # View recent logs
ostt completions zsh         # Generate shell completions
ostt completions bash --install  # Install completions system-wide
ostt --version               # Show version
ostt --help                  # Show help

Common aliases: r for record, t for transcribe, l for launch, p for process, a for auth, h for history, k for keywords, c for config, and rp for replay.

Providers

OSTT is bring-your-own-API-key and currently supports OpenAI, Deepgram, DeepInfra, Groq, AssemblyAI, Berget, and ElevenLabs transcription models.

Run ostt auth to select your provider/model and save credentials securely.

Platform Setup

Suggested default keybindings:

Hotkey	Command	Action
`Alt+Space`	`ostt launch -c`	Popup recorder, clipboard output
`Alt+Ctrl+Space`	`ostt launch -c -p`	Popup with action picker

Platform-specific setup notes are available in the docs:

Development

git clone https://github.com/kristoferlund/ostt.git
cd ostt
cargo build
cargo test --all-targets --all-features
cargo clippy --all-targets --all-features

Release builds use the dist profile:

cargo build --profile dist --locked

Contributing

Contributions are welcome. Please open an issue or submit a pull request.

Contributors

_{axo bot}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 210 Commits
.github/workflows		.github/workflows
environments		environments
specs		specs
src		src
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
DISTRIBUTION.md		DISTRIBUTION.md
LICENSE		LICENSE
PROMPT.md		PROMPT.md
README.md		README.md
dist-workspace.toml		dist-workspace.toml
loop.sh		loop.sh
ostt.png		ostt.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Features

Documentation

Install

Quick Start

Processing

Common Commands

Providers

Platform Setup

Development

Contributing

Contributors

License

About

Uh oh!

Releases 10

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Features

Documentation

Install

Quick Start

Processing

Common Commands

Providers

Platform Setup

Development

Contributing

Contributors

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 10

Contributors

Uh oh!

Languages