Top News

OpenAI launches GPT‑5.4 Thinking and Pro, its ‘most factual and efficient’ model yet
ETtech | March 6, 2026 5:57 PM CST

Synopsis

OpenAI has introduced GPT-5.4 Thinking and GPT-5.4 Pro, the newest upgrades to its GPT-5 AI models. The company says the model is more factual and token-efficient, with faster responses, improved research abilities, stronger context retention, better benchmark scores, and lower error rates, alongside new steerability and safety evaluation features.

OpenAI has unveiled GPT‑5.4 Thinking and GPT‑5.4 Pro, the latest upgrades to its GPT-5 family of artificial intelligence (AI) models, designed to provide solutions for professional workflows.

Taking to social media platform X, the company described GPT‑5.4 as its most “factual and efficient” model, using fewer tokens while providing faster responses. In ChatGPT, GPT‑5.4 Thinking offers improved deep web research and better context retention over longer interactions, the company said.

“....and oh—you can now interrupt the model and add instructions or adjust its direction mid-response,” it added.


GPT‑5.4: Key features

The API version of GPT‑5.4 will support context windows as large as 1 million tokens, the largest available from OpenAI to date.

OpenAI emphasised the model’s improved token efficiency, noting it can solve the same problems with far fewer tokens than its predecessor. To put that in context, tokens refer to the fundamental, smallest units of data that AI models (especially Large Language Models) use to process, understand, and generate text or images.

“GPT‑5.4 is our most token efficient reasoning model yet, using significantly fewer tokens to solve problems when compared to GPT‑5.2—translating to reduced token usage and faster speeds,” OpenAI said in a blog post.

“We’ve designed GPT‑5.4 to be performant across a wide range of computer-use workloads. It is excellent at writing code to operate computers via libraries such as Playwright, as well as issuing mouse and keyboard commands in response to screenshots ….. Developers can even configure the model’s safety behavior to suit different levels of risk tolerance by specifying custom confirmation policies,” it wrote.

GPT‑5.4 shows record-breaking benchmark performance, including top scores in computer use benchmarks OSWorld-Verified and WebArena Verified, according to OpenAI. It achieved 83% on OpenAI’s GDPval test for knowledge work.

OpenAI has also continued its efforts to reduce hallucinations and factual errors. GPT‑5.4 is 33% less likely to make errors in individual claims compared with GPT‑5.2, and overall responses are 18% less likely to contain mistakes.

Steerability

GPT‑5.4 Thinking in ChatGPT introduces a preamble for longer, more complex queries, similar to Codex (OpenAI’s coding agent, which understands and generates code from natural language). Users can add instructions or change the model’s direction mid-response, making it easier to guide outputs without starting over or requiring multiple additional turns.

This feature is already available on ChatGPT and the Android app, with iOS access coming soon.

The model can also think for longer on difficult tasks while maintaining a strong awareness of earlier conversation steps. This allows it to handle longer workflows and more complex prompts while keeping responses coherent and relevant throughout.

Availability and pricing

GPT‑5.4 is being gradually rolled out from Friday across ChatGPT and Codex.

  • ChatGPT Plus, Team, and Pro users now have access to GPT‑5.4 Thinking, which replaces GPT‑5.2 Thinking.
  • GPT‑5.2 Thinking remains available for three months for paid users under the Legacy Models section, after which it will be retired on June 5, 2026.
  • Enterprise and Education plans can enable early access via admin settings.
  • GPT‑5.4 Pro is available for Pro and Enterprise plans.
  • Context windows in ChatGPT for GPT‑5.4 Thinking remain unchanged from GPT‑5.2 Thinking.

Pricing:

pricing

Safety

A new safety evaluation has been added to examine the model’s chain-of-thought (CoT), the running commentary used to explain reasoning in multi-step tasks. Researchers have long been concerned that models could misrepresent their CoT under certain conditions.

“We find that GPT‑5.4 Thinking’s ability to control its CoT is low, which is a positive property for safety, suggesting that the model lacks the ability to hide its reasoning and that CoT monitoring remains an effective safety tool,” OpenAI said.


READ NEXT
Cancel OK