Quantization

Category: AI Techniques

📖 Definition

Quantization reduces AI model size by using fewer bits to represent numbers. It trades some accuracy for dramatically smaller models that run faster and need less memory.

🔑 Key Points

💡 Why It Matters

Quantization makes powerful AI accessible to more people. It enables running large models on regular computers and even phones.