VLMEvalKit Evaluation Results Collection
Generate detailed data reports
Analyze and visualize your dataset using AI
Leaderboard for text-to-video generation models
Build, preprocess, and train machine learning models
View and compare pass@k metrics for AI models
Cluster data points using KMeans
Generate synthetic dataset files (JSON Lines)
Visualize dataset distributions with facets
Display a Bokeh plot
https://huggingface.co/spaces/VIDraft/mouse-webgen
Try the Hugging Face API through the playground
Analyze data to generate a comprehensive profile report
The Open VLM Leaderboard is a data visualization tool designed to showcase the performance and results of various Vision-Language Models (VLMs). It serves as a centralized platform where users can explore and compare evaluation metrics of different VLMs across multiple datasets and tasks.
• Comprehensive Results Collection: Aggregates performance metrics from a wide range of VLM models and datasets.
• Interactive Filters: Enables users to filter results by model type, dataset, or evaluation metric.
• Customizable Visualizations: Provides detailed charts and graphs to help users understand model performance.
• Real-Time Updates: Reflects the latest evaluation results as new models or datasets are added.
• Bleaderboard Comparisons: Highlights top-performing models across different tasks and datasets.
What is the purpose of the Open VLM Leaderboard?
The Open VLM Leaderboard aims to provide a transparent and accessible platform for comparing the performance of various Vision-Language Models across different datasets and tasks.
How are models selected for inclusion on the leaderboard?
Models are included based on their publicly available evaluation results. The leaderboard aggregates data from a variety of sources to ensure a comprehensive view of model performance.
How often is the leaderboard updated?
The leaderboard is updated periodically to include new models and datasets as they become available. Users are encouraged to check back regularly for the latest information.