User Guide v1.0

Documentation

The complete guide to mastering TalkTyper - your intelligent voice-to-text companion.

Introduction

TalkTyper is a powerful macOS menu bar application that converts your speech to text with intelligent processing options. Whether you need simple voice-to-text transcription or AI-enhanced formatting, TalkTyper provides flexible modes to suit your workflow.

Key Features

Local & Cloud Processing: Use offline Whisper transcription or cloud providers for maximum flexibility.
7 Intelligent Modes: From basic transcription to AI-powered email and code assistance.
Global Hotkeys: Record from anywhere with customizable keyboard shortcuts.
Privacy-First: Local processing by default, with optional context capture and full transparency.
History & Export: Review, search, filter, and export past transcriptions.

💡

Quick Start

Press ⌃+⌥+Space anywhere on your Mac to start recording. Your transcription will be automatically typed into the active application.

Getting Started

Installation

1
Download TalkTyper
Get TalkTyper from the Mac App Store.
2
Open the App
TalkTyper will appear as an icon in your macOS menu bar.
3
Complete Onboarding
Follow the setup wizard to configure permissions and preferences.
- Grant Accessibility permission (required for global hotkeys)
- Grant Microphone permission (required for recording)
- Configure your preferred hotkey

Download from App Store

First Launch

On first launch, TalkTyper automatically configures sensible defaults:

Appears as an icon in your macOS menu bar
Uses the default "Voice to Text" mode
Uses local Whisper processing (no API keys required)
Sets default hotkey to ⌃+⌥+Space

ℹ️

Permissions Required

TalkTyper requires Accessibility and Microphone permissions to function. You'll be prompted to grant these during first launch.

Main Interface

TalkTyper Main Menu Interface — Main dropdown menu

Menu Bar Icon

The TalkTyper icon in your menu bar provides visual status feedback:

Default State: Ready to record
Recording: Animated indicator with pulsing red dot
Processing: Processing animation while transcribing

Main Menu Elements

Header Section

Displays "TalkTyper" with Pro badge and status indicator.

Mode Selector

Grid of available modes with Pro and AI badges.

Recording Section

Large Record/Stop button with mode display and hotkey badge.

Quick Actions

History and Settings buttons for quick access.

Recording & Transcription

TalkTyper Recording Interface — Recording interface

Starting a Recording

You can start recording in three ways:

Menu Bar: Click the microphone button
Global Hotkey: Press ⌃+⌥+Space
Recording Window: Use dedicated controls

During Recording

The menu bar icon shows an animated recording indicator
Live duration is displayed in real-time
Waveform visualizes your audio levels
Real-time transcription appears (if enabled)

Transcription Pipeline

1
Audio Processing
Audio is captured and prepared for transcription. Noise reduction is applied if enabled.
2
Speech-to-Text
Audio is processed by your configured STT provider (Local Whisper or Cloud).
3
AI Enhancement
If using an AI-powered mode, the text is processed by your LLM provider.
4
Output Delivery
Final text is copied to clipboard and/or auto-pasted into the active application.

Intelligent Modes

TalkTyper includes 7 built-in modes designed for specific use cases. Each mode transforms your speech into optimized output.

Voice to Text

Free

Fast, raw transcription with minimal formatting. No AI processing required. Best for quick notes, dictation, and when you need exact speech capture.

Super

Pro

Context-aware AI assistant that adapts to your current application, selected text, and clipboard contents. Intelligently formats output based on where you're working.

Coding

Pro

Transforms voice into structured prompts optimized for AI coding assistants like Cursor, Copilot, and Claude. Extracts requirements, technical details, and intent.

Message

Pro

Clean, casual messages for Slack, Discord, iMessage, and chat apps. Removes filler words, fixes grammar, and maintains conversational tone.

Email

Pro

Professional emails with proper structure. Automatically adds appropriate greetings, organizes content into paragraphs, and includes sign-offs.

Note

Pro

Structured notes with Markdown formatting. Creates headings, bullet points, and organized sections. Ideal for brainstorming, meeting notes, and documentation.

Meeting

Pro

Produces comprehensive meeting summaries including key discussion points, decisions made, and action items with assignees.

Custom Mode Settings — Custom mode settings

Custom Modes (Pro)

Pro subscribers can create unlimited custom modes with personalized prompts, context settings, and provider choices. Duplicate any built-in mode as a template.

Settings

General Settings

Appearance

Show in Dock/Menu Bar
Theme (System/Light/Dark)
Launch at Login

Behavior

Default Mode selection
Auto-paste behavior
Notification preferences

Audio Settings

Input Device: Select your preferred microphone
Input Level: Adjust gain from 0x to 2x
Microphone Test: Verify your setup is working
Sound Feedback: Enable/disable recording sounds

Keyboard Shortcuts

Action	Default
Start/Stop Recording	`⌃⌥Space`
Cancel Recording	`ESC`
Open Settings	`⌘,`
Open History	`⌘H`
Quit TalkTyper	`⌘Q`

💡

Mode-Specific Shortcuts

Pro users can assign unique hotkeys to individual modes for instant access.

Vocabulary Settings

Custom Words

Add technical terms, names, or acronyms that TalkTyper should recognize correctly.

Text Replacements

Define auto-corrections like "gonna" → "going to" for cleaner output.

Privacy Settings

Context Capture: Enable/disable clipboard and selection context for AI modes
History Retention: Control how long transcriptions are stored
Data Export: Export or delete all your data

Advanced Settings

Mini Window: Configure behavior of the floating recording window
Real-time Transcription: Toggle live transcription preview
Local Model: Configure CPU threads and GPU acceleration
Storage Limits: Set maximum history entries

Provider Configuration

Configure your Speech-to-Text and LLM providers to customize how TalkTyper processes your audio and generates enhanced output.

Speech-to-Text Providers

Local Whisper (Default)

100% offline processing on your device. No API key required.

Model sizes: Tiny, Base, Small, Medium, Large, LargeV3
Optimized for Apple Silicon

Cloud Providers (Pro)

Fast, accurate cloud transcription. Requires your own API keys.

OpenAI Whisper API
Deepgram
AssemblyAI

LLM Providers

Cloud Models

OpenAI (GPT-4o, GPT-4, GPT-3.5)
Anthropic (Claude 3.5 Sonnet, Claude 3 Opus)
Groq (Llama 3, Mixtral)

Local Models

Ollama - Run open source models locally
LM Studio - User-friendly local inference

Automation

Intelligent Mode Switching Settings — Automation settings

Intelligent Mode SwitchingPRO

TalkTyper can automatically switch to the most appropriate mode based on which application you're currently using. This eliminates the need to manually change modes when switching between tasks.

💡

How It Works

When enabled, TalkTyper detects your active application and automatically selects the optimal mode for that context. For example, recording while in Messages will use Message mode, while VS Code triggers Coding mode.

Default App-to-Mode Mappings

TalkTyper comes with pre-configured mappings for popular applications:

Message Mode

Messages
Discord
Slack
WhatsApp
Telegram

Meeting Mode

Microsoft Teams
Zoom
Google Meet
Webex

Coding Mode

Visual Studio Code
Cursor
Xcode
Terminal

Email Mode

Mail
Outlook
Gmail (in browser)
Spark

Customizing Mappings

You can customize which mode is triggered for each application:

1
Open Automation Settings
Navigate to Settings → Automation tab.
2
Enable Intelligent Mode Switching
Toggle the feature on to activate automatic mode detection.
3
Modify App Mappings
Click on any application in the list to change its associated mode, or add new applications.

ℹ️

Override Behavior

If you manually select a mode before recording, that selection takes precedence over the automatic mapping for that session. The next recording will resume automatic switching.

History

Features

Chronological List: View transcriptions with timestamps and previews.
Search & Filter: Find by content, Favorites, Date, or Mode.
Quick Actions: Copy, Favorite, Delete, or Reprocess.
Reprocess: Run old recordings through different modes.
Context Snapshot: See which app was active during recording.

ℹ️

Export Options

Export your transcription history in plain text, Markdown, or JSON format for backup or integration with other tools.

Subscription & Pro

Subscription Management — Manage subscription

Managing Your Subscription

Access your subscription details from Settings → General, where you can view your current plan, manage billing, and upgrade or modify your subscription at any time.

View current subscription status
Upgrade from Free to Pro
Manage payment methods
View billing history

Free Tier

Voice to Text mode only
Local Whisper processing
10 transcriptions per day
1-minute recording limit

Pro Tier

All 7 Intelligent Modes
Unlimited transcriptions
No recording time limits
Cloud & Local provider support
Custom Mode creation
Context Capture features

Feature Comparison

Feature	Free	Pro
Voice to Text Mode	✓	✓
AI-Powered Modes	—	✓
Local Whisper	✓	✓
Cloud STT Providers	—	✓
LLM Providers	—	✓
Custom Modes	—	✓
Context Capture	—	✓
Daily Limit	10	Unlimited
Recording Time	1 min	Unlimited

Privacy & Security

🛡️

Privacy-First Design

TalkTyper is designed with privacy as a core principle. By default, all processing happens locally on your device.

Local Processing

With Local Whisper, your audio is processed entirely on your Mac. No audio data is ever sent to external servers.

Secure Storage

All transcription history is stored locally. API keys are stored securely in the macOS Keychain.

Cloud Processing

If you opt to use cloud providers, data is sent only to that specific provider according to their privacy policy.

Context Control

Context capture (clipboard, selection) is optional and clearly labeled. You control what data AI modes can access.

⭐

Your Data, Your Control

You can export or delete all your data at any time from Settings → Privacy. TalkTyper does not collect telemetry or share your data with third parties.

Troubleshooting

Microphone Not Working

Recording doesn't capture audio or the waveform appears flat.

Check microphone permission in System Settings → Privacy & Security → Microphone
Verify the correct input device is selected in TalkTyper Settings → Audio
Test your microphone with the built-in test function in Audio settings
Try unplugging and reconnecting external microphones

Global Hotkey Not Responding

Pressing the configured hotkey doesn't start recording.

Verify Accessibility permission is granted in System Settings → Privacy & Security → Accessibility
Check for conflicts with other applications using the same hotkey
Try setting a different hotkey combination in Settings → Shortcuts
Restart TalkTyper after granting permissions

Text Not Auto-Inserting

Transcription works but text doesn't appear in the target application.

Ensure Accessibility permissions are enabled for TalkTyper
Grant Automation permission in System Settings → Privacy & Security → Automation
Some applications may block automated typing - try copying to clipboard instead
Check if "Auto-paste" is enabled in Settings → General

Slow Transcription

Processing takes longer than expected.

Local Whisper processing depends on your Mac's hardware - Apple Silicon is recommended
Try a smaller Whisper model (e.g., Tiny or Base) in Provider settings
For faster results, consider using a cloud STT provider
Close resource-intensive applications while transcribing

API Key Errors

Cloud provider returns authentication or rate limit errors.

Verify your API key is correct and has not expired
Check that your API key has the required permissions for the service
Ensure you haven't exceeded your API rate limits or quota
Try regenerating your API key from the provider's dashboard

Frequently Asked Questions

By default, TalkTyper uses Local Whisper processing, which means your audio never leaves your device. If you choose to use cloud providers (OpenAI, Deepgram, etc.), audio is sent only to that specific provider according to their privacy policy.

For local processing with Whisper, no internet is required. Cloud STT and LLM providers require an internet connection. Voice to Text mode with local Whisper works completely offline.

Free tier includes Voice to Text mode with local Whisper processing, limited to 10 transcriptions per day with 1-minute recordings. Pro unlocks all 7 modes, unlimited transcriptions, no time limits, cloud providers, and custom mode creation.

Yes! TalkTyper supports bringing your own API keys for OpenAI, Anthropic, Groq, Deepgram, and AssemblyAI. Keys are stored securely in the macOS Keychain.

Yes, TalkTyper is optimized for Apple Silicon (M1/M2/M3/M4) and takes advantage of the Neural Engine for faster local AI processing. Intel Macs with AVX2 support are also supported.

Pro subscribers can create custom modes in Settings → Modes. You can duplicate an existing mode as a template, then customize the prompt, context settings, and provider choices.

TalkTyper supports 99+ languages through Whisper, including English, Spanish, French, German, Chinese, Japanese, Korean, Arabic, Hindi, and many more. Language detection is automatic.

Yes, you can export your history in various formats including plain text, Markdown, and JSON. Access this through the History panel's export function.

Appendix

Supported Languages

TalkTyper supports transcription in 99+ languages through Whisper, including:

English

Spanish

French

German

Italian

Portuguese

Chinese

Japanese

Korean

Russian

Arabic

Hindi

System Requirements

OS: macOS 14.0 (Sonoma) or later
Processor: Apple Silicon (M1/M2/M3/M4) or Intel with AVX2
RAM: 8GB minimum (16GB recommended for local AI)
Disk Space: 500MB - 3GB (depends on model size)

Keyboard Modifier Reference

Command⌘

Option⌥

Control⌃

Shift⇧

Documentation

Introduction

Key Features

Quick Start

Getting Started

Installation

Download TalkTyper

Open the App

Complete Onboarding

First Launch

Permissions Required

Main Interface

Menu Bar Icon

Main Menu Elements

Recording & Transcription

Starting a Recording

During Recording

Transcription Pipeline

Audio Processing

Speech-to-Text

AI Enhancement

Output Delivery

Intelligent Modes

Voice to Text

Super

Coding

Message

Email

Note

Meeting

Custom Modes (Pro)

Settings

General Settings

Appearance

Behavior

Audio Settings

Keyboard Shortcuts

Mode-Specific Shortcuts

Vocabulary Settings

Custom Words

Text Replacements

Privacy Settings

Advanced Settings

Provider Configuration

Speech-to-Text Providers

Local Whisper (Default)

Cloud Providers (Pro)

LLM Providers

Cloud Models

Local Models

Automation

Intelligent Mode SwitchingPRO

How It Works

Default App-to-Mode Mappings

Message Mode

Meeting Mode

Coding Mode

Email Mode

Customizing Mappings

Open Automation Settings

Enable Intelligent Mode Switching

Modify App Mappings

Override Behavior

History

Features

Export Options

Subscription & Pro

Managing Your Subscription

Free Tier

Pro Tier

Feature Comparison

Privacy & Security

Privacy-First Design

Local Processing

Secure Storage

Cloud Processing

Context Control

Your Data, Your Control

Troubleshooting

Microphone Not Working