Talk to Qwen2Audio with Gradio and WebRTC ⚡️
CPU powered, low RTF, emotional, multilingual TTS
Convert spoken words into text
ヘスティアのAI音声合成モデルを作りました。
MaskGCT TTS Demo
Generate speech using a speaker's voice
Convert text into speech in Japanese
"Designed for all users, including those with disabilities."
Whisper model to transcript japanese audio to katakana.
Generate speech from text with customizable voices
Voice Clone Multilingual TTS
A demo of Indic Parler-TTS
Generate audio and SRT subtitles from text
Talk To Qwen Webrtc is a Speech Synthesis application designed to convert spoken words into text and generate responses using Gradio and WebRTC. It allows users to engage in real-time communication where spoken inputs are transcribed and processed to provide meaningful outputs. Ideal for users looking for an interactive and intuitive way to convert speech to text and beyond.
• Real-Time Speech-to-Text Conversion: Transcribes spoken words into text with low latency for immediate feedback. • WebRTC Integration: Leverages WebRTC technology for secure and reliable peer-to-peer communication. • AI-Powered Responses: Generates responses based on the transcribed text, enabling natural-sounding interactions. • Cross-Platform Compatibility: Works seamlessly across browsers and devices, ensuring accessibility. • User-Friendly Interface: Intuitive design for easy navigation and interaction.
1. What browsers are supported by Talk To Qwen Webrtc?
The application supports modern browsers like Google Chrome, Mozilla Firefox, and Microsoft Edge that are compatible with WebRTC.
2. Why am I not able to use the microphone?
Ensure your browser has permission to access your microphone. Check your browser settings or refresh the page and try again.
3. Is there any latency in the transcription process?
While the app strives for real-time transcription, slight delays may occur based on your internet connection speed and device performance.