Home
News
Business Stories AI Technology Travel Visa Asia Business Registration Telecommunication Medical Services
About Us
Home News AI Technology OpenAI o4-proto-4 achieves 82% autonomous resolution on full SWE-Bench Verified + Multi-File

OpenAI o4-proto-4 achieves 82% autonomous resolution on full SWE-Bench Verified + Multi-File

525    2026-02-25

[AI-NEWS]

Date: 2026-02-25

Content: o4-proto-4 sets new internal record by resolving 82% of SWE-Bench Verified multi-file tasks completely autonomously (design → code → test → debug → commit) over multi-hour sessions without any human edits or guidance.

Keywords : autonomous software engineering, SWE-Bench Verified, multi-file resolution, o4-proto-4, end-to-end coding agent

Previous article
Mistral Mathstral 2.0 sets new open-source SOTA on MATH and GSM8K-Hard
Next article
Meta Launches New Round of 10% Layoffs, Transfers 7,000 Employees to AI Projects
new
Meta Launches New Round of 10% Layoffs, Transfers 7,000 Employees to AI Projects Elon Musk Loses Lawsuit Against OpenAI! Google I/O 2026 Unveils Gemini 3.5 Series, Omni Video Generation and Spark Agent Baidu Releases ERNIE 5.1: Pre-Training Cost Only 6% of Industry, No.1 in China Search Kuaishou’s Kling AI Video Spinoff Planned at ~$20B Valuation OpenAI Launches $4B Deployment Company & Acquires Tomoro for Enterprise AI Moonshot AI's Kimi Secures $2B Funding, Valuation Exceeds $20B Post‑Investment Musk Explains xAI Shutdown: Merged into SpaceX, Renamed SpaceXAI, Focused on Space Compute OpenAI Launches GPT-5.5 Instant with 52.5% Lower Hallucination Rate Microsoft and OpenAI Revised Cooperation, Ending Exclusive Tie
Email subscription
About
Navigation
News
©bizyet.com