Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

NVIDIA introduces Cosmos 3, an open omni-model designed for physical AI reasoning and action. This innovative model integrates various data modalities to empower robots and other physical AI systems with advanced understanding and interaction capabilities.

Author: Morein.ai EditorialPublished: June 1, 2026Updated: 6/1/2026

NVIDIA has unveiled Cosmos 3, a groundbreaking open omni-model. This innovation is specifically engineered for physical AI systems, enabling advanced reasoning and action capabilities. It represents a significant leap forward in how AI can interact with the physical world.

The core of Cosmos 3 lies in its ability to integrate diverse data modalities. This allows the model to process and understand information from various sources, such as visual input, tactile feedback, and auditory data. By combining these, Cosmos 3 creates a more comprehensive perception of its environment.

This integrated approach empowers robots and other physical AI entities. They can now perform complex tasks with greater autonomy and intelligence. The model's design facilitates a deeper understanding, leading to more responsive and effective interactions in real-world scenarios.

Read original source

Tools & Platforms

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

CUGA, IBM's open-source Agent Harness, simplifies building agentic applications by handling infrastructure, allowing developers to focus on tools and prompts. It offers pre-assembled components for planning, execution, and state management, significantly reducing development time. CUGA has topped agent benchmarks like AppWorld and WebArena.

Hugging Face - BlogJun 23, 2026

Tools & Platforms

OpenAI launches new initiative to help find and patch open source bugs

OpenAI has launched "Patch the Planet," a new initiative in partnership with cybersecurity firm Trail of Bits, to enhance the security of open-source projects. This program aims to assist maintainers in identifying and patching bugs, utilizing OpenAI's AI-powered security tools while reducing the burden on project teams.

AI News & Artificial Intelligence | TechCrunchJun 23, 2026

Tools & Platforms

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters

Baidu has released PP-OCRv6, an advanced optical character recognition (OCR) model supporting 50 languages. Available on Hugging Face, this version significantly improves accuracy and efficiency across various parameter sizes, from 1.5 million to 34.5 million, marking a substantial leap in multilingual OCR technology.

Hugging Face - BlogJun 22, 2026

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

Related articles

Build real agentic apps using CUGA: two dozen working examples on a lightweight harness

OpenAI launches new initiative to help find and patch open source bugs

PP-OCRv6 on Hugging Face: 50-Language OCR from 1.5M to 34.5M Parameters