Free AI Tools for speech to text
Browse 22 free and freemium AI tools built for speech to text.
All tools tagged “speech to text”
ScreenApp is an AI notetaker and recorder that transcribes, summarizes, and organizes audio and video into searchable insights.
Cartesia builds fast speech AI models and voice agents for real-time text-to-speech, transcription, and interactive conversations.
DeVoice is an AI speech-to-text and transcription tool that converts audio and video files into editable text online.
Inworld AI provides realtime voice AI tools for text-to-speech, speech-to-speech, speech-to-text, and model routing for conversational applications.
GreenConvert is an AI transcription platform for converting audio and video into text with speaker recognition, multilingual support, and export tools.
SubEasy is an AI platform for transcription, subtitles, translation, dubbing, and transcript-based content tools.
AssemblyAI provides speech-to-text, speech understanding, voice agent, and LLM gateway APIs for building voice AI products.
Deepgram provides enterprise voice AI APIs for speech-to-text, text-to-speech, and voice agents in one platform.
Sonix is an AI transcription platform that turns audio and video into accurate, audit-ready text in minutes.
Lingvanex provides on-premise and API-based machine translation and speech recognition tools for secure multilingual processing.
ParakeetAI is a real-time AI interview copilot that transcribes questions and generates answers during interviews and coding calls.
Transkriptor is an AI transcription tool that converts audio and video into text, summaries, and action items in 100+ languages.
Free online AI transcription tool that turns videos and audio into text with no sign-up required.
Maestra is an AI media localization platform for transcription, subtitles, translation, dubbing, and voice generation.
UniScribe is an AI transcription tool that converts audio, video, and YouTube links into text, summaries, mind maps, and exports.
Clipto is a fully local Mac app for AI transcription and natural-language search across video, audio, and media libraries.
Rev is a legal investigative intelligence platform for transcription, evidence review, and searchable case prep.
Notta is an AI note taker that transcribes meetings, lectures, and interviews into searchable notes, summaries, and visual deliverables.
Fish Audio is an AI voice platform for text-to-speech, voice cloning, speech-to-text, and real-time voice agents.
HappyScribe is an AI transcription, meeting notetaking, subtitle, and translation platform for multilingual audio and video workflows.
TurboScribe is an AI transcription service that converts audio and video into text in 98+ languages with speaker recognition and export options.
ElevenLabs is an AI audio platform for text-to-speech, voice cloning, dubbing, speech-to-text, music, and voice agents.