JavaScript Text to Speech

Top Text-to-Speech Models of 2026: Proprietary vs Open Source Compared

Open source TTS models Kokoro, Orpheus, and Piper are tested on symbols, abbreviations, and prosody with CER and MOS results.

Slator

Mistral Completes Voxtral Speech Stack With Launch of Text-to-Speech Model

Mistral launches Voxtral TTS, extending its model family into speech generation and enabling end-to-end voice workflows.

Google’s Latest AI-Powered App Won’t Need Internet to Work

The official website for Google AI Edge Eloquent is hosted on Google’s developer-focused google.dev domain, underscoring that ...

Duncan Banner

How to spot an Amazon text scam and other common Amazon scams

Javascript is required for you to be able to read premium content. Please enable it in your browser settings.

WinBuzzer

Cohere’s Open-Source Transcribe Model Tops ASR Leaderboard

Cohere has released Transcribe, a 2-billion-parameter open-source speech recognition model that tops the Hugging Face Open ...

aibusiness

Mistral AI Launches Text-to-Speech Model

Mistral AI is expanding its Voxtral model family with its first text-to-speech model. The launch comes amid intensifying competition in the fast-growing AI voice market, with Voxtral TTS pitched as an ...

TechCrunch

Mistral releases a new open source model for speech generation

French AI company Mistral released a new open source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support. The model, which lets ...

GitHub

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Small and fast: only 123M parameters. High-quality voice cloning: state-of-the-art performance in speaker similarity, intelligibility, and naturalness. Multi-lingual: support Chinese and English.

IEEE

Advancing Text-to-Speech Systems for Low-Resource Languages: Challenges, Innovations, and Future Directions

Abstract: Speech synthesis, the technology that converts text into spoken words, has advanced significantly for high-resource languages like English, Spanish, and Mandarin. However, many languages ...

GitHub

real-time-transcription

#3 Winner of Best Use of Zoom API at Stanford TreeHacks 2025! An AI-powered meeting assistant that captures video, audio and textual context from Zoom calls using multimodal RAG. WhisperVoice is a ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results