Display a treemap of languages and datasets
More advanced and challenging multi-task evaluation
Parse bilibili bvid to aid / cid
Generate plots for GP and PFN posterior approximations
Browse and filter AI model evaluation results
This is a timeline of all the available models released
Browse and compare Indic language LLMs on a leaderboard
statistics analysis for linear regression
Make RAG evaluation dataset. 100% compatible to AutoRAG
Generate synthetic dataset files (JSON Lines)
Migrate datasets from GitHub or Kaggle to Hugging Face Hub
https://huggingface.co/spaces/VIDraft/mouse-webgen
Search for tagged characters in Animagine datasets
Corpus Map is a data visualization tool designed to display a treemap of languages and datasets. It provides a hierarchical and proportional representation of data, making it easier to understand and analyze the distribution of languages and datasets within a corpus. This tool is particularly useful for visualizing large-scale linguistic data in an intuitive and accessible manner.
• Interactive Treemap: Visualize languages and datasets as an interactive treemap, allowing users to explore and drill down into specific categories.
• Proportional Representation: Each rectangle in the treemap represents a language or dataset, with size proportional to its frequency or size in the corpus.
• Color Coding: Customizable color schemes to differentiate between languages or datasets.
• Zoom and Hover: Dynamic interaction features such as zooming and hovering over elements for detailed information.
• Filtering Options: Ability to filter datasets based on specific criteria.
• Export Functionality: Save or export visualizations for further analysis or reporting.
• Multi-Language Support: Designed to handle multiple languages and datasets simultaneously.
What types of data can Corpus Map visualize?
Corpus Map is designed to visualize linguistic data, including language distributions, dataset sizes, and corpus compositions.
How do I customize the color scheme in Corpus Map?
To customize the color scheme, go to the settings menu and select the "Color" option. Choose from predefined palettes or create your own custom colors for different categories.
Can I filter datasets in Corpus Map?
Yes, Corpus Map allows users to filter datasets based on specific criteria such as language, size, or other attributes. Use the "Filter" option in the toolbar to apply your desired settings.