Skip to content

Speechify Expands Chrome Extension with Voice Typing and Conversational AI Assistant

Speechify Expands Chrome Extension with Voice Typing and Conversational AI Assistant
Published:

Speechify has introduced new voice detection capabilities to its Chrome extension, integrating voice typing and a conversational AI assistant. This expansion marks a strategic move for the company, which has traditionally focused on text-to-speech tools for articles, PDFs, and documents, into the growing voice AI market.

The new dictation tool supports English and is designed to correct errors and remove filler words during transcription. Speechify's entry into voice typing aligns with a recent increase in voice detection tools, attributed to advancements in speech recognition models over the past year.

In initial evaluations, the tool demonstrated functionality within applications like Gmail and Google Docs, though some users reported difficulties in triggering dictation and achieving consistent performance on other platforms, such as WordPress. The company has stated it is implementing gradual optimization for popular websites.

Regarding accuracy, initial observations indicated a higher word error rate compared to some established dictation tools, including Wispr Flow, Willow, and Monologue. Speechify has noted that its model is designed to improve accuracy and reduce error rates as users engage with it over time.

A conversational voice assistant has also been integrated into the browser's sidebar. This feature allows users to query website content, asking for summaries of key ideas or simplified explanations. Rohan Pavuluri, Speechify's chief business officer, stated via email that the company positions its voice-centric approach as a primary interaction method, contrasting it with platforms like ChatGPT and Gemini, where voice is often a secondary feature. Pavuluri indicated that a segment of the market, including Speechify's user base, prefers voice as the default mode of interaction with AI.

Currently, the assistant does not operate with browsers that feature built-in sidebar assistants, such as OpenAI's Atlas, Perplexity's Comet, and Dia, a limitation the company attributes to focusing on the Chrome browser and its user base. Speechify plans to roll out both voice typing and the voice assistant to its full suite of desktop and mobile applications incrementally.

Looking ahead, the startup aims to develop AI agents capable of completing tasks on behalf of users, citing examples such as making appointments or managing customer support hold times. This direction parallels efforts by other companies in the sector, including Truecaller and Cloaked, which are exploring similar AI-driven task automation functionalities.

Tags: Live AI AI Agents

More in Live

See all

More from Industrial Intelligence Daily

See all

From our partners