GPT

Simple Definition

GPT stands for Generative Pre-trained Transformer. It’s the family of AI models developed by OpenAI that powers ChatGPT. Each word in the name describes how it works:

  • Generative — it generates new text
  • Pre-trained — it was trained on a large dataset before being deployed
  • Transformer — it uses the transformer neural network architecture

The GPT Model Line

ModelReleasedKey Milestone
GPT-12018Proof of concept
GPT-22019So capable OpenAI initially withheld it
GPT-32020Shocked the world with text quality
GPT-3.52022Powered the original ChatGPT
GPT-42023Multimodal, much more capable
GPT-4o2024Faster, cheaper, natively multimodal

GPT vs. ChatGPT

GPT is the underlying model — the AI technology itself.

ChatGPT is the product — the chat interface that lets people interact with GPT models.

When people say “I use ChatGPT,” they mean the product. The technology running underneath it is GPT (or increasingly GPT-4o and its variants).

How GPT Models Work

GPT models are trained to predict the next token in a sequence. After training on billions of documents from the internet, books, and code, this basic task generalizes into the ability to write essays, answer questions, write code, and reason through complex problems.

See AI terms in action

Browse practical AI workflows that use the concepts in this glossary.

Last updated: