西北工业大学ASLP实验室OSUM项目demo展示
Transcribe audio to text
Transcribe audio to text
Transcribe audio to text
Transcribe audio files into text
Transcribe audio into text
Hebrew audio-to-text by ivirit-ai model
Transcribe audio to text using voice input
Generate podcast audio from text or documents
Generate transcript from audio input
Transcribe audio to text with speaker diarization
Transcribe audio to text
voice to text
OSUM is a transcription tool developed by the ASLP laboratory at Northwestern Polytechnical University. It is designed to transcribe audio from podcasts into readable text. This tool provides users with an efficient way to convert spoken content into a written format, making it easier to analyze, share, or reference later. OSUM emphasizes accuracy and usability, catering to both researchers and general users who need reliable transcription services.
• Audio-to-Text Conversion: Accurately transcribes podcast audio files into text. • Customizable Options: Offers flexibility in transcription settings to meet specific needs. • Text Export: Allows users to export transcribed text for further use. • User-Friendly Interface: Provides an intuitive interface for easy navigation and use. • Support for Multiple Formats: Compatible with various audio file formats.
What formats does OSUM support for audio files?
OSUM supports common audio formats such as MP3, WAV, and OGG. For a full list of supported formats, refer to the tool's documentation.
Can I edit the transcribed text directly on the platform?
Yes, OSUM allows users to edit the transcribed text within the interface before exporting it.
How do I access OSUM?
OSUM is available as a web-based tool through the ASLP laboratory's official website. Simply navigate to the demo page and follow the instructions to start using it.