MP-SENet is a speech enhancement model.
Generate high-quality speech from text with specified emotion and voice
Moonshine ASR models running on-device, in your web browser.
Transcribe audio or YouTube videos into text
Voice Clone Multilingual TTS
Convert text to speech with different voices
"Designed for all users, including those with disabilities."
Convert text to speech in multiple languages
Generate realistic audio from text
Identify speakers in an audio file
High-fidelity Text-To-Speech
✨[With v1.0.0] Accelerated TTS on Kokoro-82M
Turn text into speech with customizable voice, rate, and pitch
MP-SENet is a deep learning-based speech enhancement model designed to clean up noisy audio signals. It leverages advanced neural network architectures to effectively remove unwanted background noise, restoring high-quality speech for better intelligibility and listening experiences.
What types of noise can MP-SENet remove?
MP-SENet is designed to handle a wide range of background noises, including environmental sounds, machinery noise, and crowd chatter.
Can MP-SENet work with real-time audio streams?
Yes, MP-SENet supports real-time processing, making it suitable for live applications such as voice calls or live recordings.
What audio formats does MP-SENet support?
MP-SENet supports common audio formats like WAV, MP3, and AAC, ensuring compatibility with most audio sources.