{"id":11152,"date":"2026-03-06T10:04:05","date_gmt":"2026-03-06T10:04:05","guid":{"rendered":"https:\/\/serisec.com\/index.php\/2026\/03\/06\/openai-launches-gpt-5-4-with-advanced-reasoning-coding-and-computer-use-capabilities\/"},"modified":"2026-03-06T10:04:05","modified_gmt":"2026-03-06T10:04:05","slug":"openai-launches-gpt-5-4-with-advanced-reasoning-coding-and-computer-use-capabilities","status":"publish","type":"post","link":"https:\/\/serisec.com\/index.php\/2026\/03\/06\/openai-launches-gpt-5-4-with-advanced-reasoning-coding-and-computer-use-capabilities\/","title":{"rendered":"OpenAI Launches GPT-5.4 With Advanced Reasoning, Coding, and Computer-Use Capabilities"},"content":{"rendered":"<p>    OpenAI Launches GPT-5.4 With Advanced Reasoning, Coding, and Computer-Use Capabilities<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p>OpenAI on March 5, 2026, released GPT-5.4, its most capable and efficient frontier model to date, combining advanced reasoning, coding, and agentic workflows into a single unified system.<\/p>\n<p>The model is rolling out across ChatGPT (as GPT-5.4 Thinking), the API, and Codex, with a higher-performance GPT-5.4 Pro variant available for users requiring maximum compute on complex tasks.<\/p>\n<p>GPT-5.4 consolidates capabilities previously spread across separate models, integrating the industry-leading coding strengths of GPT-5.3-Codex with improved general <a href=\"https:\/\/cybersecuritynews.com\/anthropic-claude-under-large-scale-distillation-attacks\/\" target=\"_blank\" rel=\"noreferrer noopener\">reasoning and native computer-use capabilities<\/a>.<\/p>\n<p>The result is a model engineered for end-to-end professional workflows from spreadsheets and presentations to complex multi-step agentic tasks with less back-and-forth interaction required from users.<\/p>\n<p>In ChatGPT, GPT-5.4 Thinking introduces an upfront reasoning plan that allows users to interrupt and redirect the model mid-response without restarting, enabling more targeted, context-accurate outputs. This real-time steerability is a notable shift from prior reasoning models, where course corrections required starting over entirely.<\/p>\n<h2 class=\"wp-block-heading\" id=\"benchmark-performance\"><strong>GPT-5.4 Launched<\/strong><\/h2>\n<p>GPT-5.4 sets new state-of-the-art scores across several critical industry benchmarks:<\/p>\n<figure class=\"wp-block-table\">\n<table class=\"has-fixed-layout\">\n<thead>\n<tr>\n<th>Benchmark<\/th>\n<th>GPT-5.4<\/th>\n<th>GPT-5.3-Codex<\/th>\n<th>GPT-5.2<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>GDPval (wins or ties)<\/td>\n<td><strong>83.0%<\/strong><\/td>\n<td>70.9%<\/td>\n<td>70.9%<\/td>\n<\/tr>\n<tr>\n<td>SWE-Bench Pro (Public)<\/td>\n<td><strong>57.7%<\/strong><\/td>\n<td>56.8%<\/td>\n<td>55.6%<\/td>\n<\/tr>\n<tr>\n<td>OSWorld-Verified<\/td>\n<td><strong>75.0%<\/strong><\/td>\n<td>74.0%<\/td>\n<td>47.3%<\/td>\n<\/tr>\n<tr>\n<td>Toolathlon<\/td>\n<td><strong>54.6%<\/strong><\/td>\n<td>51.9%<\/td>\n<td>46.3%<\/td>\n<\/tr>\n<tr>\n<td>BrowseComp<\/td>\n<td><strong>82.7%<\/strong><\/td>\n<td>77.3%<\/td>\n<td>65.8%<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/figure>\n<p>On GDPval, which tests agents across 44 occupations spanning the top 9 U.S. GDP industries, GPT-5.4 matches or exceeds industry professionals in 83% of comparisons, up from 70.9% with GPT-5.2.<\/p>\n<p>On the BigLaw Bench evaluation for legal document work, the model scored 91%, according to Harvey\u2019s Head of Applied Research, Niko Grupen.<\/p>\n<p>GPT-5.4 is OpenAI\u2019s first general-purpose model with native computer-use capabilities, enabling agents to interact directly with software through screenshots, <a href=\"https:\/\/cybersecuritynews.com\/mouse-movement-in-microsoft-powerpoint-presentations-to-deliver-malware\/\" target=\"_blank\" rel=\"noreferrer noopener\">mouse commands, and keyboard inputs<\/a>.<\/p>\n<p>On OSWorld-Verified, it achieves a 75.0% success rate, surpassing human performance benchmarked at 72.4% and far exceeding GPT-5.2\u2019s 47.3%.<\/p>\n<p>On WebArena-Verified, GPT-5.4 achieves a 67.3% browser success rate, while scoring 92.8% on Online-Mind2Web using screenshot-based observations alone.<\/p>\n<p>The model also supports 1 million tokens of context in the API, enabling long-horizon task execution across large-scale agent workflows matching context window offerings from Google and Anthropic.<\/p>\n<p><a href=\"https:\/\/openai.com\/index\/introducing-gpt-5-4\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">OpenAI emphasized that GPT-5.4<\/a> is its most factual model yet, with individual claims 33% less likely to be false and full responses 18% less likely to contain errors compared to GPT-5.2.<\/p>\n<p>The model also delivers significant token-efficiency gains, using substantially fewer tokens to solve the same reasoning problems, translating directly into reduced API costs and faster response times for enterprise developers.<\/p>\n<p>In production environments, Mainstay CEO Dod Fraser reported GPT-5.4 achieved a 95% first-attempt success rate across ~30,000 property portals, completing sessions three times faster while using 70% fewer tokens versus prior computer-use models.<\/p>\n<p>GPT-5.4 Thinking is available now for ChatGPT Plus, Team, and Pro subscribers, replacing GPT-5.2 Thinking over the next three months. Developers can access GPT-5.4 and GPT-5.4 Pro through the OpenAI API, with priority processing enabled for faster token velocity in production environments.<\/p>\n<p class=\"has-text-align-center has-background\" style=\"background:linear-gradient(180deg,rgb(238,238,238) 94%,rgb(169,184,195) 100%)\"><strong>Follow us on <a href=\"https:\/\/news.google.com\/publications\/CAAqMggKIixDQklTR3dnTWFoY0tGV041WW1WeWMyVmpkWEpwZEhsdVpYZHpMbU52YlNnQVAB?hl=en-IN&amp;gl=IN&amp;ceid=IN:en\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Google News<\/a>, <a href=\"https:\/\/www.linkedin.com\/company\/cybersecurity-news\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">LinkedIn<\/a>, and <a href=\"https:\/\/x.com\/cyber_press_org\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">X<\/a> for daily cybersecurity updates. <a href=\"https:\/\/cybersecuritynews.com\/contact-us\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Contact us<\/a> to feature your stories.<\/strong><\/p>\n<p>The post <a href=\"https:\/\/cybersecuritynews.com\/gpt-5-4-launched\/\">OpenAI Launches GPT-5.4 With Advanced Reasoning, Coding, and Computer-Use Capabilities<\/a> appeared first on <a href=\"https:\/\/cybersecuritynews.com\/\">Cyber Security News<\/a>.<\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    Guru Baran<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/cybersecuritynews.com\/gpt-5-4-launched\/\">Go to cyber-security-news<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI Launches GPT-5.4 With Advanced Reasoning, Coding, and Computer-Use Capabilities OpenAI on March 5, 2026, released GPT-5.4, its most capable and efficient frontier model to date, combining advanced reasoning, coding, and agentic workflows into a single unified system. The model is rolling out across ChatGPT (as GPT-5.4 Thinking), the API, and Codex, with a higher-performance [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[129,63,1440],"tags":[130],"class_list":["post-11152","post","type-post","status-publish","format-standard","hentry","category-cyber-security","category-cyber-security-news","category-tech-news","tag-cyber-security-news"],"_links":{"self":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/11152"}],"collection":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/comments?post=11152"}],"version-history":[{"count":0,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/posts\/11152\/revisions"}],"wp:attachment":[{"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/media?parent=11152"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/categories?post=11152"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/serisec.com\/index.php\/wp-json\/wp\/v2\/tags?post=11152"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}