MP-SENet is a speech enhancement model.
Convert speech to text from audio files
Generate sexual voice sounds from text
Generate realistic voices from text
High-fidelity Text-To-Speech
Generate high-quality speech from text with specified emotion and voice
ML-powered speech recognition directly in your browser
Voice Clone Multilingual TTS
Generate audio from text or modify voice pitch
Generate text and audio responses to user queries
Generate speech from text or files
✨[With v1.0.0] Accelerated TTS on Kokoro-82M
Generate audio from text with adjustable speed
MP-SENet is a deep learning-based speech enhancement model designed to clean up noisy audio signals. It leverages advanced neural network architectures to effectively remove unwanted background noise, restoring high-quality speech for better intelligibility and listening experiences.
What types of noise can MP-SENet remove?
MP-SENet is designed to handle a wide range of background noises, including environmental sounds, machinery noise, and crowd chatter.
Can MP-SENet work with real-time audio streams?
Yes, MP-SENet supports real-time processing, making it suitable for live applications such as voice calls or live recordings.
What audio formats does MP-SENet support?
MP-SENet supports common audio formats like WAV, MP3, and AAC, ensuring compatibility with most audio sources.