Speech Corpus Creation Tool
Manage and label your datasets
Data annotation for Sparky
Upload files to a Hugging Face repository
Explore recent datasets from Hugging Face Hub
Build datasets using natural language
Clean and process datasets
Generate a Parquet file for dataset validation
Generate synthetic datasets for AI training
List of French datasets not referenced on the Hub
Display translation benchmark results from NTREX dataset
Generate dataset for machine learning
Dhravani is a Speech Corpus Creation Tool designed to help users efficiently create and manage speech datasets. It enables the recording of voices and the transcription of speech into text, making it an essential tool for building high-quality speech corpora. The tool is user-friendly, with AI-powered transcription capabilities that ensure accuracy and speed in processing speech data.
• Voice Recording: Capture high-quality audio directly through the platform. • Automatic Transcription: AI-driven transcription converts speech to text in real-time. • Noise Reduction: Advanced filters to minimize background noise and improve audio clarity. • Multi-Language Support: Create speech corpora in multiple languages. • Customizable Formats: Export data in various formats suitable for different applications. • Collaboration Tools: Work with teams to annotate and validate transcriptions. • Scalability: Handle large-scale projects with ease.
What languages does Dhravani support?
Dhravani supports a wide range of languages, including English, Spanish, Mandarin, French, and many others. Check the official documentation for the full list of supported languages.
Can I use Dhravani for commercial projects?
Yes, Dhravani is suitable for both academic and commercial projects. Ensure compliance with licensing terms when using the tool for commercial purposes.
Is my data secure on Dhravani?
Dhravani prioritizes data security. All recordings and transcriptions are encrypted and stored securely on the platform.