Can You Run It? LLM version
Determine GPU requirements for large language models
Model Memory Utility
Calculate memory needed to train AI models
GAIA Leaderboard
Submit models for evaluation and view leaderboard
Open Medical-LLM Leaderboard
Browse and submit LLM evaluations
LLM Performance Leaderboard
View LLM Performance Leaderboard
mergekit-gui
Merge machine learning models using a YAML configuration file
Low-bit Quantized Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
Open Object Detection Leaderboard
Request model evaluation on COCO val 2017 dataset
Hallucinations Leaderboard
View and submit LLM evaluations
Vidore Leaderboard
Explore and benchmark visual document retrieval models
HHEM Leaderboard
Browse and submit language model benchmarks
Modelcard Creator
Create and upload a Hugging Face model card
MTEB Arena
Teach, test, evaluate language models with MTEB Arena
European Leaderboard
Benchmark LLMs in accuracy and translation across languages
Nexus Function Calling Leaderboard
Visualize model performance on function calling tasks
LLM Safety Leaderboard
View and submit machine learning model evaluations
Open LLM Leaderboard
Track, rank and evaluate open LLMs and chatbots
SD To Diffusers
Convert Stable Diffusion checkpoint to Diffusers and open a PR
La Leaderboard
Evaluate open LLMs in the languages of LATAM and Spain.
Export to ONNX
Export Hugging Face models to ONNX