Databricks brings GPT-5.5 to enterprise agent workflows
Databricks is integrating GPT-5.5 into its enterprise agent workflows, leveraging the model's state-of-the-art performance on complex document tasks. GPT-5.5 has significantly reduced errors and improved accuracy in handling scanned PDFs and multi-step processes.
Databricks is making GPT-5.5 available for customer agent workflows. This integration follows the model's achievement of a new state of the art on OfficeQA Pro, Databricks’ benchmark for complex enterprise document tasks. This advancement is set to enhance how businesses handle intricate document processes.
OfficeQA Pro evaluates how AI models manage parsing, retrieval, and grounded reasoning across various document types, including scanned PDFs, legacy files, and long-context documents. These tasks often pose significant challenges for production agent systems due to the complexity and potential for error.
GPT-5.5 demonstrated a substantial improvement, reducing errors by 46% compared to GPT-5.4 in the agent-harness setting. It also became the first model to surpass 50% accuracy on the OfficeQA Pro benchmark, highlighting its enhanced capability in handling enterprise-level document challenges.
The most significant gains from GPT-5.5 were observed in parsing-heavy workflows. Earlier models struggled with correctly extracting information from older documents and scanned PDFs. However, GPT-5.5 shows a remarkable improvement in these areas, ensuring greater accuracy from the initial stages of document processing.
Improvements were also noted in the orchestration of multi-step tasks. Previous models sometimes engaged in inefficient search detours, which GPT-5.5 has largely overcome. This leads to more reliable retrieval of relevant context and more efficient completion of complex workflows without extensive supervision.
Databricks customers can now access GPT-5.5 through AI Unity Gateway, using it within workflows built with AgentBricks and the Agent Supervisor API. In these systems, GPT-5.5 handles the orchestration of parsing, retrieval, and execution across specialized agents, offering a robust solution for enterprise automation.
Related articles
Build real agentic apps using CUGA: two dozen working examples on a lightweight harness
CUGA, IBM's open-source Agent Harness, simplifies building agentic applications by handling infrastructure, allowing developers to focus on tools and prompts. It offers pre-assembled components for planning, execution, and state management, significantly reducing development time. CUGA has topped agent benchmarks like AppWorld and WebArena.
OpenAI launches new initiative to help find and patch open source bugs
OpenAI has launched "Patch the Planet," a new initiative in partnership with cybersecurity firm Trail of Bits, to enhance the security of open-source projects. This program aims to assist maintainers in identifying and patching bugs, utilizing OpenAI's AI-powered security tools while reducing the burden on project teams.
PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters
Baidu has released PP-OCRv6, an advanced optical character recognition (OCR) model supporting 50 languages. Available on Hugging Face, this version significantly improves accuracy and efficiency across various parameter sizes, from 1.5 million to 34.5 million, marking a substantial leap in multilingual OCR technology.
