Mike Wheatley
2025-03-11 19:51:00
siliconangle.com
The future of artificial intelligence will be dominated by AI agents, and OpenAI is now trying to accelerate that reality by letting developers build their own.
Today the AI company announced the availability of a new “Responses API,” which simplifies the process of creating and deploying AI agents that can perform tasks for their users independently.
The Responses API lets developers create AI agents that are powered by OpenAI’s large language models, and is set eventually to replace the existing Assistants API, which will be retired in about one year, the company said.
OpenAI says the new offering will facilitate the creation of AI agents that are able to employ a file search tool to scan a company’s internal datasets and search the wider internet. Such capabilities are similar to OpenAI’s recently announced Operator agent, which relies on a Computer-Using-Agent or CUA model to help automate tasks such as data entry.
It’s worth pointing out that OpenAI has previously acknowledged that the CUA model is somewhat unreliable when trying to automate tasks on operating systems, and it has been known to make mistakes. As such, OpenAI warns developers that the Responses API should still be considered an “early iteration,” and says it will become more reliable over time.
When using Responses API to create an AI agent, developers can choose from two models: GPT-4o search and GPT-4o mini search. According to the company, both are capable of browsing the web autonomously to try and find answers to questions, and they also cite the sources that inform their responses.
It’s an important capability because OpenAI said the ability to search the web and scour a company’s private datasets can significantly improve the accuracy of its models and therefore the agents based on them. The company demonstrated just how superior its search-capable models are on its own, SimpleQA benchmark, which is designed to measure the confabulation rate of AI systems.
According to OpenAI, GPT-4o search achieved a 90% score, while GPT-4o mini search scored 88%. In contrast, the new GPT-4.5 model, which has many more parameters and is therefore much more powerful, scored only 63% on the same benchmark, because it lacks the ability to search for additional information.
Even so, developers would do well to remember that although these models bring improvements, the search functionality doesn’t completely fix all AI confabulations or hallucinations. The benchmark scores suggest GPT-4o search still makes factual mistakes in around 10% of its responses. Such an error rate may be intolerably high for many agentic AI workloads.
Still, OpenAI wants to encourage developers to get started, at least. In addition to the Responses API, it released an open-source Agents SDK that provides tools for integrating AI models and agents with internal systems. It also provides tools for implementing safeguards and monitoring the activities of AI agents. It follows the release of another tool called Swarm, which provides a framework for developers to manage and orchestrate multiple AI agents.
No doubt some developers will be eager to see what kinds of AI agents they can create, but it’s important to remember that these technologies are still nascent and not always as effective as some users might claim. Earlier this week, a Chinese startup took the internet by storm with the debut of an AI agent called Manus that wowed some early adopters, only to be quickly found wanting once it became more widely available.
Image: OpenAI
Your vote of support is important to us and it helps us keep the content FREE.
One click below supports our mission to provide free, deep, and relevant content.
Join our community on YouTube
Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.
THANK YOU
Enjoy the perfect blend of retro charm and modern convenience with the Udreamer Vinyl Record Player. With 9,041 ratings, a 4.3/5-star average, and 400+ units sold in the past month, this player is a fan favorite, available now for just $39.99.
The record player features built-in stereo speakers that deliver retro-style sound while also offering modern functionality. Pair it with your phone via Bluetooth to wirelessly listen to your favorite tracks. Udreamer also provides 24-hour one-on-one service for customer support, ensuring your satisfaction.
Don’t miss out—get yours today for only $39.99 at Amazon!
Help Power Techcratic’s Future – Scan To Support
If Techcratic’s content and insights have helped you, consider giving back by supporting the platform with crypto. Every contribution makes a difference, whether it’s for high-quality content, server maintenance, or future updates. Techcratic is constantly evolving, and your support helps drive that progress.
As a solo operator who wears all the hats, creating content, managing the tech, and running the site, your support allows me to stay focused on delivering valuable resources. Your support keeps everything running smoothly and enables me to continue creating the content you love. I’m deeply grateful for your support, it truly means the world to me! Thank you!
BITCOIN bc1qlszw7elx2qahjwvaryh0tkgg8y68enw30gpvge Scan the QR code with your crypto wallet app |
DOGECOIN D64GwvvYQxFXYyan3oQCrmWfidf6T3JpBA Scan the QR code with your crypto wallet app |
ETHEREUM 0xe9BC980DF3d985730dA827996B43E4A62CCBAA7a Scan the QR code with your crypto wallet app |
Please read the Privacy and Security Disclaimer on how Techcratic handles your support.
Disclaimer: As an Amazon Associate, Techcratic may earn from qualifying purchases.