Large language model (LLM)

A statistical language model trained on large text corpora, typically using a transformer architecture, used for generation and understanding tasks.

A large language model (LLM) is a neural network (usually based on the transformer architecture) trained to predict the next token in a sequence. At scale, this objective produces systems that can follow instructions, summarize text, translate, and assist with coding: within limits of factual reliability and safety. LLMs are a flagship application of natural language processing (NLP).

LLMs are not “databases of facts”; they generalize from patterns in training data. They are powerful but narrow relative to AGI speculation. Many chatbots are built on LLMs; the underlying model may be shared across products. For autonomous tool use and planning, see AI agents.