Large language model (LLM)
A statistical language model trained on large text corpora, typically using a transformer architecture, used for generation and understanding tasks.
A large language model (LLM) is a neural network (usually based on the transformer architecture) trained to predict the next token in a sequence. At scale, this objective produces systems that can follow instructions, summarize text, translate, and assist with coding: within limits of factual reliability and safety.
LLMs are not “databases of facts”; they generalize from patterns in training data. For autonomous tool use and planning, see AI agents.