AIDir.app
  • Hot AI Tools
  • New AI Tools
  • AI Tools Category
AIDir.app
AIDir.app

Save this website for future use! Free to use, no login required.

About

  • Blog

Ā© 2025 • AIDir.app All rights reserved.

  • Privacy Policy
  • Terms of Service
Home
Model Benchmarking
šŸ¤— Persian LLM Leaderboard

šŸ¤— Persian LLM Leaderboard

Evaluate Persian LLMs on various tasks

You May Also Like

View All
🦾

GAIA Leaderboard

Submit models for evaluation and view leaderboard

360
🐨

Open Multilingual Llm Leaderboard

Search for model performance across languages and benchmarks

56
šŸŽ™

ConvCodeWorld

Evaluate code generation with diverse feedback types

0
šŸ†

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

84
😻

2025 AI Timeline

Browse and filter machine learning models by category and modality

56
šŸ“ˆ

Building And Deploying A Machine Learning Models Using Gradio Application

Predict customer churn based on input details

2
šŸ“ˆ

Ilovehf

View RL Benchmark Reports

0
šŸ”

Project RewardMATH

Evaluate reward models for math reasoning

0
šŸ“

Cetvel

Pergel: A Unified Benchmark for Evaluating Turkish LLMs

16
šŸš€

README

Optimize and train foundation models using IBM's FMS

0
šŸ†

Low-bit Quantized Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots

165
šŸ†

KOFFVQA Leaderboard

Browse and filter ML model leaderboard data

9

What is šŸ¤— Persian LLM Leaderboard ?

The šŸ¤— Persian LLM Leaderboard is a comprehensive resource for evaluating and comparing Persian language models (LLMs) across various tasks and metrics. It provides a centralized platform for researchers, developers, and users to assess the performance of different models and make informed decisions based on their needs. The leaderboard is designed to promote transparency and innovation in the field of Persian natural language processing.

Features

• Model Performance Tracking: Detailed performance metrics for various Persian LLMs on tasks like text classification, summarization, and question answering.
• Task-Specific Benchmarking: Evaluation across a wide range of NLP tasks tailored to the Persian language.
• Comparative Analysis: Side-by-side comparison of models to identify strengths and weaknesses.
• Regular Updates: Continuous updates with new models, tasks, and metrics.
• Open Accessibility: Available to everyone, including researchers, developers, and enthusiasts.
• Documentation and Resources: Access to datasets, evaluation scripts, and best practices for benchmarking.

How to use šŸ¤— Persian LLM Leaderboard ?

  1. Visit the Leaderboard Website: Access the platform through the official link.
  2. Select a Model: Choose from the list of available Persian LLMs to view their performance.
  3. Explore Tasks and Metrics: Filter results by specific tasks or metrics, such as BLEU score for translation or accuracy for classification.
  4. Compare Models: Use the comparison feature to view side-by-side results of different models.
  5. Analyze Results: Review detailed benchmarks and documentation to understand model performance in depth.
  6. Inform Your Decision: Use the insights gained to select the most suitable model for your specific use case.

Frequently Asked Questions

What models are included in the leaderboard?
The leaderboard includes a variety of Persian language models, ranging from smaller, efficient models to larger, state-of-the-art architectures. Models are added continuously as they are developed and benchmarked.

How are models rated or ranked?
Models are ranked based on their performance on specific tasks and metrics. The ranking is determined by evaluation results on standardized datasets and may vary depending on the task or metric being considered.

How often is the leaderboard updated?
The leaderboard is updated regularly to include new models, tasks, and metrics. Updates are typically announced on the official platform or through associated communication channels.

Recommended Category

View All
šŸ”–

Put a logo on an image

ā€‹šŸ—£ļø

Speech Synthesis

šŸŽŽ

Create an anime version of me

ā“

Visual QA

🚨

Anomaly Detection

🩻

Medical Imaging

šŸ”

Detect objects in an image

āœ‚ļø

Separate vocals from a music track

šŸ˜‚

Make a viral meme

šŸ‘¤

Face Recognition

šŸ’»

Code Generation

šŸ’»

Generate an application

šŸ“¹

Track objects in video

🚫

Detect harmful or offensive content in images

šŸ“

Generate a 3D model from an image