OpenAI Just Released ChatGPT Agent, Its Most Powerful Agent Yet

Name: OpenAI Just Released ChatGPT Agent, Its Most Powerful Agent Yet
Uploaded: 2025-07-22T12:00:00.000000Z
Description: OpenAI’s Isa Fulford, Casey Chu, and Edward Sun explain how merging Deep Research and Operator into a single RL-trained agent unlocks hour-long autonomous computer tasks. Agent runs tasks up to one ho…

Jul 22, 2025 · ai-agents · Source ↗

Watch on YouTube ↗ Summary based on the YouTube transcript and episode description.

OpenAI’s Isa Fulford, Casey Chu, and Edward Sun explain how merging Deep Research and Operator into a single RL-trained agent unlocks hour-long autonomous computer tasks.

Agent runs tasks up to one hour on a virtual computer with text browser, GUI browser, terminal, and API integrations sharing a single file system state.
Trained via reinforcement learning across hundreds of thousands of VMs; the model discovers when to use each tool without being explicitly programmed.
Agent benchmark scores surpass human baseline on data science tasks; spreadsheet and slide generation are highlighted as new breakout capabilities.
A demo task had the agent autonomously build an OpenAI valuation model with financial projections, a spreadsheet, and a slide deck in 28 minutes.
Bio-risk was the top safety concern; weeks of red teaming were run, and a real-time monitor flags suspicious trajectories like an antivirus layer.
Core team was 3–4 researchers on Deep Research side and 6–8 on computer-use agent side — combined for only a few months before shipping.
RL is highly data-efficient: the fine-tuning dataset is orders of magnitude smaller than pre-training data, enabling fast capability iteration.
Roadmap priorities: proactive agents that act without user prompting, personalization and memory, and new interaction paradigms beyond chat.

2025-07-22 · Watch on YouTube

Related coverage

Show HN: AgentSwift – Open-source iOS builder agent

An open-source spec for Codex orchestration: Symphony

Open source Xiaomi MiMo-V2.5 and V2.5-Pro are among the most efficient (and affordable) at agentic 'claw' tasks

GitHub Copilot is moving to usage-based billing