ChatGPT logoChatGPT

Dictation Mode

ChatGPT's dictation mode allows users to speak naturally and have their speech converted to text in real-time, enabling hands-free interaction with the AI.

Dictation mode activated with microphone button and transcription display

Dictation mode activated with microphone button and transcription display

What's happening

ChatGPT provides a dictation mode that converts speech to text in real-time. The interface shows visual feedback during listening and displays the transcribed text as the user speaks.

Patterns

Input Mode Toggle

Microphone icon button to activate dictation mode

Open playground
Voice Visualizer

Animated bars showing voice activity and listening state

Open playground

UX Insights

  • Dictation mode completely replaces text input when active
  • Visual feedback is crucial for voice interactions
  • Real-time transcription builds user confidence
  • Clear exit mechanism (X button) to return to text mode

Design Decisions

Dictation mode provides a hands-free way to interact with ChatGPT, making it accessible for users who prefer speaking over typing. The real-time transcription gives immediate feedback that the system is accurately capturing speech.

Captured: December 29, 2025Type: desktop
voicemultimodalaccessibility

More real-world AI UX in your inbox

Weekly gallery picks, interface patterns, and notes on how products ship AI - no spam, unsubscribe anytime.

Subscribe on Substack