I’m using Gemini for speech-to-text and it often misrecognizes company names and acronyms.
Is there any way to use a custom lexicon or vocabulary with Gemini to improve recognition accuracy?
If not directly supported, what are practical workarounds people use — e.g. preprocessing prompts, fine-tuning, or combining Gemini with another ASR that supports phrase boosting?
It's not perfect, but it's taken it from being an issue that made all our transcripts look terrible, to an issue I no longer think about.
I imagine just using the second spellchecking pass with Gemini would be almost as effective.
reply