Benchmark

Full Form: AI Benchmark

Category: AI Evaluation

📖 Definition

Benchmarks are standardized tests used to evaluate and compare AI models. They provide objective metrics for measuring capabilities across different tasks.

🔑 Key Points

💡 Why It Matters

Benchmarks help you compare AI models objectively. However, real-world testing is still important as benchmarks don't capture everything.