GPT
Full Form: Generative Pre-trained Transformer
Category: AI Fundamentals
📖 Definition
GPT is a type of language model architecture developed by OpenAI. It uses the Transformer architecture and is pre-trained on large datasets before being fine-tuned for specific tasks.
🔑 Key Points
- Pre-trained on diverse internet text using unsupervised learning
- Generates text by predicting the next word given previous context
- Can be fine-tuned for specific applications like coding or conversation
- Versions include GPT-3, GPT-4, and GPT-4o with increasing capabilities
💡 Why It Matters
GPT models are the foundation of ChatGPT and many other AI applications. Knowing about GPT helps you understand how most AI chatbots work.