SolidityBench Leaderboard
View and submit LLM benchmark evaluations
View and compare language model evaluations
Compare code model performance on benchmarks
Generate leaderboard comparing DNA models
Create demo spaces for models on Hugging Face
Evaluate RAG systems with visual analytics
Display genomic embedding leaderboard
Benchmark AI models by comparison
Convert PaddleOCR models to ONNX format
Compare and rank LLMs using benchmark scores
Submit deepfake detection models for evaluation
Convert and upload model files for Stable Diffusion
Compare audio representation models using benchmark results
Compare model weights and visualize differences
Measure over-refusal in LLMs using OR-Bench
Predict customer churn based on input details
Calculate memory usage for LLM models
Explore and manage STM32 ML models with the STM32AI Model Zoo dashboard
Calculate GPU requirements for running LLMs