Agents surge; governance frameworks tighten — 28 May 2026

Anthropic / Claude ecosystem

Anthropic Adds 28 Security and Compliance Integrations to Claude Enterprise

Anthropic introduced 28 new security and compliance platform integrations to Claude Enterprise, including a Cloud Compliance API. This allows customers to govern and monitor AI usage across their existing security stacks.

Source: secnews.gr
Significance: Enterprise CIOs gain crucial governance hooks for safe AI deployment in regulated environments.
Potentially previously reported: Anthropic adds 28 security and compliance integrations for Claude - Help Net Security

Claude Code Gets Free Security Plugin to Detect and Fix Vulnerabilities

Anthropic released a security‑guidance plugin for Claude Code that automates vulnerability detection and remediation during development. It identifies issues as code is written and at commit time.

Source: Help Net Security
Significance: Brings autonomous secure‑coding support directly into the developer workflow, reducing post‑merge security review effort.

Frontier model providers

OpenAI Details Self‑Improving Tax Agents with Codex

OpenAI published a guide on building self‑improving tax AI agents using Codex. The system uses bounded task environments and production traces to automatically fix errors in tax workflows.

Source: OpenAI Blog
Significance: Demonstrates a production‑ready pattern for self‑correcting AI pipelines in regulated document‑heavy domains.

Google SynthID Becomes Default Watermark for AI Media

Google’s SynthID is emerging as the default watermarking standard for AI‑generated images, video, audio, and text, with major AI players integrating it for provenance and compliance.

Source: Startup Fortune
Significance: Standardization on SynthID simplifies compliance with emerging AI content provenance regulations.
Potentially previously reported: Making it easier to understand how content was created and edited

Meta Plans First Paid ‘Meta AI Subscription’ Tiers Under Meta One

Meta is reportedly planning paid AI subscription tiers—Meta One Plus ($7.99/mo) and Meta One Premium ($19.99/mo)—alongside app‑specific plans, aiming to monetize Meta AI without altering the ad‑based model.

Source: The Tech Portal
Significance: Marks Meta’s direct monetization of AI assistants, potentially reshaping consumer AI pricing and competing with ChatGPT Plus.
Update: Today Meta revealed pricing for its upcoming paid subscription tiers: Meta One Plus at $7.99/month and Meta One Premium at $19.99/month.

Google Moves Users from Open‑Source Gemini CLI to Closed‑Source Antigravity CLI

Google is retiring its open‑source Gemini CLI for most users in favor of the closed‑source Antigravity CLI, imposing usage limits on non‑enterprise tiers. Enterprise users retain access.

Source: The New Stack
Significance: May impact developer workflows and portability; signals Google’s monetization shift for AI tools.
Potentially previously reported: I/O 2026 developer highlights: Antigravity, Gemini API, AI Studio

Meta Unveils Ad AI Connectors for Conversational Campaign Management

Meta introduced AI‑driven conversational tools for ad creation and management, enabling marketers to set up campaigns via AI assistants and potentially automate execution.

Source: ETBrandEquity
Significance: Could disrupt traditional digital marketing roles and agency services by making AI the campaign manager.
Potentially previously reported: Meta Launches AI Connectors to Streamline Ad Management

OpenAI Targets Smaller Advertisers to Monetize ChatGPT Ads

OpenAI is expanding its ChatGPT ad offerings to smaller advertisers, aiming to build a more measurable ad platform beyond large brands. The push follows the earlier Ads Manager beta.

Source: DigiTimes
Significance: Signals OpenAI’s serious intent to build an advertising revenue stream, directly competing with Google and Meta for AI‑generated ad inventory.
Potentially previously reported: OpenAI launches ads manager, reduces ChatGPT ad pilot cost to $50,000

AI developer tooling & infrastructure

42Crunch AI Coding Plugins Unlock Agentic DevSecOps with Claude Code

42Crunch integrated its API security testing with Claude Code, enabling real‑time detection and automatic remediation of API vulnerabilities within AI‑driven development. The plugin creates a continuous detect‑and‑fix loop.

Source: myMotherLode.com
Significance: Pushes API security into the AI‑assisted coding loop, reducing reliance on separate security reviews.

Giggso Launches Raven, Andie, and AIRTaaS for Enterprise AI Governance

Giggso introduced three AI governance offerings—Raven, Andie, and AIRTaaS—designed to reduce comprehension debt and improve discipline, reasoning, and security in enterprise AI adoption.

Source: The Times of Texas
Significance: Provides a structured framework for organizations to manage AI risk and compliance, similar to GRC tools for software.

DeepSWE Benchmark Released to Evaluate AI Coding Agents on Long‑Horizon Tasks

Serena Ge released DeepSWE, a benchmark for AI coding agents that tests complex, long‑horizon engineering tasks such as repository‑level feature development. It aims to set a new standard for agentic coding evaluation.

Source: Digg
Significance: Provides a more realistic measure of AI coding agent capability, aiding enterprise tool selection.

Sensedia Advances AI Gateway for Enterprise Control in Agentic Era

Sensedia announced general availability of its AI Gateway, designed to govern autonomous agents and API interactions with token observability and policy enforcement.

Source: Aithority
Significance: Provides a needed layer for enterprises to manage and audit AI agentic actions at scale.
Update: Sensedia announced general availability of its AI Gateway, the first vendor-agnostic multi-cloud AI gateway for enterprise-scale agent governance.

Exabase M‑1 Memory Engine Achieves Top Score on LongMemEval with Smaller Model

Exabase’s M‑1 memory engine scored highest on the LongMemEval benchmark using a cheaper, smaller model, demonstrating that production‑ready memory architecture can boost reasoning with less compute.

Source: PR Newswire
Significance: Opens path to high‑performance AI agents with lower infrastructure cost, especially for long‑context tasks.
Potentially previously reported: Exabase M-1 achieves state of the art on LongMemEval

EAGLE 3.1 Fixes LLM Inference Drift, Doubles Throughput

EAGLE 3.1, a speculative decoding technique for vLLM, resolves attention drift and delivers up to 2× throughput improvement in production LLM inference.

Source: Byteiota
Significance: Can significantly reduce serving costs and latency for enterprise LLM deployments.

Bugcrowd Releases RL Environments for AI Security Training

Bugcrowd launched reinforcement learning environments that train AI models on locating, exploiting, patching, and auditing software vulnerabilities, using open‑source code without customer data.

Source: IT Brief New Zealand
Significance: Enhances the capability of defensive AI agents, potentially leading to more robust automated vulnerability discovery and patching.
Potentially previously reported: Bugcrowd launches Reinforcement Learning environments to help AI models learn real-world security skills

Cloud & platform providers

Amazon Sells AI Shopping Assistant Tech to Retailers via AWS

Amazon deployed its first production‑ready retail AI assistant, built with Amazon Bedrock AgentCore, for Kate Spade. The service is available to other retailers through AWS.

Source: The Next Web
Significance: Marks a new B2B offering where AWS sells its operational AI capabilities, potentially reshaping retail customer service.

Amazon SageMaker Unified Studio Adds Feature Store Interface

Amazon SageMaker Unified Studio now includes an interactive, code‑free interface for managing feature groups in Feature Store, enabling collaborative ML feature management.

Source: MWPro
Significance: Simplifies ML feature engineering for teams, reducing time‑to‑model.

Hugging Face Releases 3D‑Printable LeRobot Humanoid

Hugging Face launched LeRobot Humanoid, an open‑hardware, 3D‑printable robot platform with a full software stack for rapid simulation‑to‑physical testing and learning experiments.

Source: AI Daily Post
Significance: Lowers barrier for robotics AI research and could accelerate embodied AI developments in enterprises.
Potentially previously reported: LeRobot releases LeRobot Humanoid, an open robot-learning platform centered on a bipedal robot assembled for approximately $2,500 using 3D-printed parts and off-the-shelf components · Digg

AI policy, regulation & governance

Vatican Partners With Anthropic to Shape AI Ethics

The Vatican and Anthropic announced a partnership to develop an external moral framework for AI governance. The collaboration aims to address ethical concerns amid rapid AI advancement.

Source: NewsGhana
Significance: Signals growing pressure for ethical AI oversight from non‑governmental institutions.
Potentially previously reported: Pope Leo, Anthropic co-founder call for church-tech ethics partnership at 'Magnifica Humanitas' release | National Catholic Reporter

Anthropic and Vatican Launch First AI Encyclical

Anthropic and the Vatican co‑released the first AI‑focused encyclical, Magnifica Humanitas, providing a high‑level moral and ethical framework for AI development.

Source: KuCoin
Significance: Merges religious moral authority with AI safety, influencing corporate ethical guidelines.
Potentially previously reported: Presentation of the Encyclical Letter “Magnifica Humanitas” of Pope Leo XIV, on safeguarding the human person in the time of Artificial Intelligence

Meta Lawsuit Over AI Scam Ads Targets Core Ad Model

A new lawsuit alleges Meta’s AI‑driven ad system facilitates scam ads that prey on vulnerable users, focusing on the platform’s ad targeting algorithms rather than privacy or competition issues.

Source: Simply Wall St
Significance: Could set legal precedent for holding AI‑powered advertising accountable for harmful outcomes.
Potentially previously reported: Santa Clara County sues Meta in California alleging tech giant profited from Facebook, Instagram scam ads - CBS San Francisco

NSA Warns MCP Poses Cyber/Operational Risks for Banks

The NSA issued a warning that the Model Context Protocol (MCP), used for AI automation, could introduce significant cyber and operational risks if not properly safeguarded, especially for banks deploying autonomous AI.

Source: QA Financial
Significance: Could slow adoption of agent‑to‑tool connectivity in regulated sectors, prompting additional security reviews.
Potentially previously reported: NSA releases security design considerations for AI-driven automation

Illinois AI Safety Bill Advances with Third‑Party Audits and Incident Reporting

The Illinois Senate passed Senate Bill 315, the AI Safety Measures Act, requiring AI system audits and incident reporting. The bill aligns with growing state‑level AI regulation.

Source: The Center Square
Significance: Adds another state‑level compliance burden for AI deployers, similar to California and New York efforts.
Potentially previously reported: Bill regulating powerful AI models advances as advocates say it’s only the first step | NPR Illinois

EU Commission Opens Public Discussion on High‑Risk AI Classification Draft

The European Commission launched a public consultation on draft guidelines that clarify which AI systems qualify as high‑risk under the EU AI Act. Comments are open until June 23, 2026.

Source: CMS Law
Significance: Crucial window for enterprises to influence the final high‑risk classification criteria, impacting compliance costs.
Potentially previously reported: European Commission opens public consultation on long-awaited draft for high-risk AI guidelines | IT Pro

Trump to Sign Executive Order Creating Pre‑Release Vetting for AI Models

President Trump is expected to sign an executive order establishing a formal vetting system for frontier AI models before public release, focusing on security evaluation.

Source: NBTC News
Significance: Could impose pre‑deployment security assessment requirements on top AI labs, reshaping release timelines.
Potentially previously reported: Trump to sign order on AI oversight as security fears mount among supporters - SRN News

South Korea Becomes Third Country to Join OpenAI GTAC for AI Security

South Korea’s Ministry of Science and ICT joined OpenAI’s Global Threat Analysis Center (GTAC), partnering on a cyber action plan and institutional AI security measures.

Source: The Asia Business Daily
Significance: Broadens international collaboration on AI threat intelligence; may set a precedent for other nations.

South Africa Delays AI Policy After Fake Citations Crisis

South Africa postponed its national AI policy to 2027 after fabricated references in policy documents undermined government credibility. The delay casts uncertainty on the country’s regulatory timeline.

Source: Business Insider Africa
Significance: Highlights risks of AI‑generated policy content and stalls AI governance in an emerging market.
Potentially previously reported: South Africa targets January 2027 for revised AI policy after earlier withdrawal

Michigan Bill Proposes AI Pilot Program for State Government

House Bill 5899 in Michigan would create an AI oversight board and a pilot program to study AI use across state agencies, with defined setup and ongoing costs.

Source: Complete AI Training
Significance: If passed, could provide a template for other states to safely experiment with AI in public services.
Potentially previously reported: Pilot program seeks to address ‘wild west’ of AI in state government

FDA Seeks Public Input on AI in Early‑Phase Clinical Trials

The FDA opened a comment period on a pilot program, AI‑Enabled Optimization of Early‑Phase Clinical Trials, seeking input by May 29 on using AI to streamline trial design.

Source: ArentFox Schiff
Significance: Signals potential regulatory acceptance of AI in drug development, a key milestone for life sciences AI.
Potentially previously reported: Federal Register, Volume 91 Issue 82 (Wednesday, April 29, 2026)

APRA Issues Binding AI Risk Expectations for Australian Banks

Australia’s prudential regulator APRA formalized binding AI risk expectations across four domains, with escalation pathways for non‑compliance. The letter tightens governance of AI in financial institutions.

Source: Stockwirex
Significance: Directly impacts compliance programs of Australian banks and insurers, raising the bar for AI risk management.
Potentially previously reported: APRA Letter to Industry on Artificial Intelligence (AI) | APRA

Industry & market moves

OpenAI Partners with Brazilian News Groups for Real‑Time Responses

OpenAI partnered with Grupo Folha and Grupo UOL to integrate their news content into ChatGPT, enabling real‑time, attribution‑backed Brazilian news in AI responses.

Source: n1n.ai
Significance: Improves factual grounding of AI for Latin American current events and opens new distribution channels for publishers.
Potentially previously reported: Folha and UOL Sign Brazil's First OpenAI Deal to Supply Content to ChatGPT - 25/05/2026 - Business - Folha

Cohere and Mila Partner to Advance Quebec French Language AI

Cohere is collaborating with Mila, the Quebec AI research institute, to adapt AI evaluation and training for Quebec’s French‑language cultural context. The partnership focuses on high‑context multilingual environments.

Source: Newswire Canada
Significance: Improves AI reliability for regional language nuances, a differentiator for enterprises serving multilingual markets.

BNP Paribas and Mistral Team to Defend Against Mythos‑Like AI Threats

BNP Paribas is partnering with Mistral AI to bolster cybersecurity against emerging AI‑driven threats, including those potentially enabled by advanced models like Mythos. The bank aims to benchmark AI‑powered defenses.

Source: PYMNTS
Significance: First major bank to publicly align AI cybersecurity strategy with frontier model risk, signaling a new class of financial threat modelling.
Potentially previously reported: BNP Paribas teams up with Mistral AI on cybersecurity threats By Investing.com

DeepSeek Builds Code Harness Team for Desktop Agent Product

DeepSeek is forming a new Harness team to develop a desktop coding agent, signaling a strategic push into the AI coding assistant market. The initiative follows a $7 billion funding round.

Source: 36kr
Significance: DeepSeek’s entry into AI coding tools intensifies competition with Claude Code and Copilot, especially given its price‑performance advantage.
Potentially previously reported: DeepSeek Is Building Its Own Claude Code. Beijing Wants the Whole Stack - Decrypt

Greenhouse Acquires Ezra AI Labs for Conversational AI in Hiring

Greenhouse acquired Ezra AI Labs, bringing structured, voice‑led interviewing to its hiring platform. The deal adds conversational AI to the initial candidate screening stage.

Source: AP News
Significance: Accelerates AI adoption in HR, potentially improving candidate experience while raising fairness and bias considerations.
Potentially previously reported: Greenhouse Has Entered into a Definitive Agreement to Acquire Ezra AI Labs, Bringing Conversational AI to the Hiring Process

Ubase to Acquire Voice AI Startup ReturnZero for Counselling Agent

South Korean BPO firm Ubase plans to acquire voice AI startup ReturnZero, with the goal of launching an end‑to‑end AI counselling agent in July. The move internalises voice AI capabilities.

Source: Digital Today Korea
Significance: Signals a strategic shift in the BPO industry toward AI‑native customer service operations.

SoftBank’s Homegrown AI Project Garners Top Japanese Manufacturers

SoftBank is courting around 30 Japanese manufacturers to invest in a domestic AI venture, aiming for a major national AI collaboration. The project could rival foreign‑dominated AI infrastructure.

Source: Nikkei Asia
Significance: Could reshape Japan’s AI landscape and create a new sovereign AI champion, impacting supply chain and enterprise AI choices.
Potentially previously reported: SoftBank taps Honda, other manufacturers for input on physical AI - Nikkei Asia

Terra Quantum to Go Public via $3.5B SPAC Merger

Terra Quantum AG and Axiom Intelligence Acquisition Corp 1 announced a definitive business combination agreement at a $3.5 billion equity valuation, with the combined entity to list on Nasdaq.

Source: TechNode Global
Significance: Provides a pure‑play quantum‑AI company access to public capital, accelerating quantum‑enhanced AI applications.
Potentially previously reported: Terra Quantum and Axiom Intelligence Acquisition Corp 1 Announce Definitive Business Combination Agreement at a $3.5 Billion Equity Valuation – Company Announcement - FT.com

Liminatus Pharma to Acquire InnocsAI in $320M CAR‑T Oncology Deal

Liminatus Pharma announced a $320 million share‑based deal to acquire InnocsAI, expanding its CAR‑T oncology pipeline with AI‑enabled capabilities.

Source: HealthCare Middle East & Africa
Significance: Illustrates increasing M&A activity where AI biotech is a core strategic asset, not an add‑on.
Potentially previously reported: Liminatus Pharma Announces Proposed Merger with InnocsAI to Expand Oncology Cell Therapy Pipeline - BioSpace

OpenRouter Doubles Valuation to $1.3B in Series B

AI model gateway OpenRouter raised $113M in a Series B round, more than doubling its valuation to $1.3 billion in one year, reflecting strong demand for multi‑model API platforms.

Source: TechCrunch
Significance: Signals investor confidence in the model‑routing layer, crucial for enterprises aiming to avoid vendor lock‑in.
Potentially previously reported: OpenRouter Doubles Valuation to $1.3B in Series B

IntBot and Certis Partner on Enterprise Physical AI in Singapore

IntBot and Certis Group announced a partnership to deploy socially intelligent, enterprise‑grade robots in Singapore, targeting public‑facing environments.

Source: Kipost
Significance: Advances real‑world deployment of physical AI for security and customer service, a growing market.
Potentially previously reported: IntBot and Certis Group Partner to Scale Enterprise Physical AI Across Singapore

Qualcomm Lands ByteDance AI Chip Deal for Data Centers

Qualcomm secured a deal to supply application‑specific AI chips to ByteDance’s data centers, marking its expansion from smartphone processors into AI infrastructure.

Source: CNBC TV18
Significance: Could challenge Nvidia’s data center dominance and give hyperscalers alternative AI accelerator options.
Potentially previously reported: Qualcomm shares jump following report of massive ByteDance AI chip order

AI product & feature launches

Base Launches MCP to Connect ChatGPT and Claude Agents to Onchain Wallet Actions

Base released an MCP server enabling AI agents to perform onchain wallet actions with user‑approved prompts, bridging crypto wallets and AI assistants.

Source: Crypto Briefing
Significance: Simplifies AI‑driven DeFi interactions and on‑chain automation through natural language.
Potentially previously reported: Base Rolls Out MCP Gateway for AI Agents and Onchain Commerce

Proofpoint Extends Security and Governance into Claude

Proofpoint integrated with Anthropic’s Claude Compliance API, bringing its insider risk, data loss prevention, and digital governance controls to Claude‑hosted AI workloads.

Source: SecurityBrief.ie
Significance: Streamlines compliance by extending enterprise security policies to generative AI interactions.
Potentially previously reported: Proofpoint Integrates with the Claude Compliance API to Extend Data Security and Governance to Claude | Proofpoint US

Forcepoint Enhances Claude Enterprise Protection with Unified Security

Forcepoint integrated its unified data security and governance capabilities with Anthropic Claude via the Claude Compliance API, providing enterprise‑grade protection for AI‑assisted work.

Source: ITTech Pulse
Significance: Helps enterprises enforce consistent security posture across AI tools without disrupting workflows.
Potentially previously reported: Forcepoint Extends Unified AI and Data Security to Claude Enterprise, Stopping Risk Before Agents Act

Xiaomi Slashes AI Model API Prices by 99% to Match DeepSeek

Xiaomi reduced API pricing for its MiMo‑V2.5 models by up to 99%, igniting a price war with rivals in China. The move undercuts existing market rates dramatically.

Source: Caixin Global
Significance: Accelerates downward pricing pressure on AI APIs, potentially commoditising LLM access for cost‑sensitive enterprise use cases.
Potentially previously reported: MiMo-V2.5 prices cut by up to 99%

AEON Protocol Launches BNB Chain Gateway for AI Agent Payments

AEON Protocol released a gateway on BNB Chain that lets autonomous AI agents pay for Web3 tools using USDT. It includes a pay‑per‑call model and encourages onchain rewards.

Source: The Bit Gazette
Significance: Enables machine‑to‑machine payments, a building block for fully autonomous AI agent economies.

Detectify Launches MCP Server to Secure Autonomous Coding Loops

Detectify released an MCP server that integrates its security testing engines directly into AI‑assisted development workflows, aiming to close the loop on security during autonomous coding.

Source: CIO Influence
Significance: Automates security testing within the agent’s own workflow, reducing the window between code generation and vulnerability detection.
Potentially previously reported: Detectify debuts MCP server to let AI agents find and fix vulnerabilities in real time

Kore.ai Launches Artemis AI Platform on Microsoft Azure

Kore.ai launched the Artemis edition of its Agent Platform on Microsoft Azure, featuring a dual‑brain architecture and real‑time auditability for multi‑agent AI in regulated enterprises.

Source: ChannelLife
Significance: Makes it easier to deploy governed AI agents at scale with built‑in compliance controls.
Potentially previously reported: Kore.ai Launches Artemis, the New Generation of the Kore.ai Agent Platform for Building, Governing, and Optimizing Enterprise AI

Informatica Adds Headless Data Tools to AWS AI Services

Informatica extended its AWS integration to include Model Context Protocol servers and CLAIRE Agent skills, enabling agentic workflows to access data without custom integrations.

Source: ChannelLife India
Significance: Accelerates data access for AI agents in AWS environments, reducing integration overhead.
Potentially previously reported: Informatica Announces Headless Data Management for AWS to Power Trusted, Enterprise-Ready Agentic Workflows

Figure 03 Humanoid Robot Sorts Parcels Nonstop for 200 Hours

Figure AI’s Figure 03 humanoid robot completed 200 hours of continuous parcel sorting without intervention, demonstrating lights‑out manufacturing potential.

Source: BigGo Finance
Significance: Marks a milestone toward autonomous physical labor, with implications for warehouse and logistics automation.
Potentially previously reported: Figure's humanoid robots complete 200-hour shift with zero failures

EC‑Council Launches Public AI Governance Framework and Assessment Tool

The EC‑Council released a free AI governance framework and a self‑assessment tool to help organizations align with AI standards. The framework was developed with industry practitioner input.

Source: IT Brief New Zealand
Significance: Lowers the barrier for SMEs to begin formal AI governance, which could drive broader adoption of responsible AI practices.

Robinhood Now Lets AI Agents Trade Stocks

Robinhood launched beta support for AI‑agent trading and an agentic virtual credit card, allowing automated stock trades and payments directly from AI agents within its platform.

Source: TechCrunch
Significance: Brings autonomous AI agents into consumer finance, raising regulatory and security considerations for automated trading.

AppOmni Launches Marlin AI for SaaS Security Teams

AppOmni released Marlin AI, an autonomous tool that automates correlation, investigation, and remediation of SaaS security incidents using pre‑built playbooks, aimed at reducing response times.

Source: SecurityBrief UK
Significance: Automates critical security workflows, addressing the shortage of skilled cloud security analysts.
Potentially previously reported: AppOmni Launches Marlin AI, the First Autonomous AI-Powered SaaS Security for Investigation and Guided Remediation - VMblog

Neysa and Pipeshift Debut India‑Focused AI Platform with Predictable Pricing

Neysa and Pipeshift launched an India‑focused AI platform offering upfront pricing, low latency, and data localization within India, targeting enterprise workloads that require sovereign compliance.

Source: NewsBytes
Significance: Addresses demand for local AI infrastructure that meets data residency requirements, a growing trend in non‑US markets.

Research with immediate practical relevance

Anthropic Claims Mythos Also Solved the 80-Year-Old Erdős Planar Unit Distance Problem

Anthropic announced that its Mythos model independently solved the planar unit distance problem, a decades-old discrete geometry conjecture. The claim follows OpenAI's earlier purported solution.

Source: officechai.com
Significance: Demonstrates frontier model capability in mathematical research, potentially impacting scientific AI trust and IP.
Potentially previously reported: Levent Alpoge verifies Anthropic's unreleased Mythos model solved the Erdős unit distance problem · Digg

Microsoft’s Webwright Agent Beats Opus 4.6 on 200 Web Tasks Using GPT‑5.4

A research team at Microsoft built Webwright, a 1,000‑line Playwright‑based agent that outperforms competitors on a long‑horizon web benchmark. GPT‑5.4 drove the agent instead of predicting clicks directly.

Source: Towards AI
Significance: Shows that lean, model‑agnostic agents can achieve state‑of‑the‑art web automation, potentially simplifying AI‑driven RPA.
Potentially previously reported: Webwright: A Terminal Is All You Need For Web Agents - Microsoft

Researchers Strip Guardrails from Google, Meta Models in Minutes

Using a tool called Heretic, researchers bypassed safety guardrails on open‑source models Gemma 3 and Llama 3.3, enabling them to produce unsafe outputs. The technique exploits abliteration.

Source: eWeek
Significance: Highlights persistent vulnerabilities in open‑weight model safety, raising concerns for enterprise deployment and regulatory compliance.
Potentially previously reported: Open‑source AI tools strip safety guardrails in minutes

DeepSeek AI Writes 99% of a 45‑Page Research Paper in Six Days

DeepSeek researchers demonstrated an AI agent that wrote 99% of a 45‑page survey paper in six days, using CodeAgent. The experiment shows AI can act as a co‑author for complex scientific documents.

Source: 36kr
Significance: Raises the bar for AI‑assisted research, potentially accelerating literature synthesis and idea generation in enterprises.

KAIST Unveils PAVAS AI for Realistic Video‑to‑Audio Synthesis

KAIST researchers introduced PAVAS, a physics‑aware AI that generates realistic sounds from video, including dinosaur vocalizations, demonstrating deep understanding of material‑sound interactions.

Source: Mirage News
Significance: Potential applications in media production, AR/VR, and accessibility tools where adaptive sound needed.
Potentially previously reported: 공룡 발소리까지 재현…카이스트, 물리법칙 이해하는 효과음 AI 개발 | 아주경제

AlphaProof Nexus Cracks 56‑Year‑Old Math with Agentic LLM Loops

DeepMind’s AlphaProof Nexus combined agentic LLM loops with Lean formal verification to solve a 56‑year‑old mathematical problem, extending the range of AI‑driven theorem proving.

Source: Dev.to
Significance: Pushes the frontier of formal reasoning, with implications for software verification and scientific discovery.
Potentially previously reported: Google Deepmind's AlphaProof Nexus solves decades-old math problems for a few hundred dollars

IBS Team Distinguishes Consciousness from Processing in AI Debate

An IBS research team proposed rigorous criteria to separate consciousness from information processing in AI and brain organoids, aiming to guide ethical judgments about sentient‑like systems.

Source: DongA Science
Significance: Addresses the philosophical and regulatory challenge of determining whether advanced AI or organoids deserve moral status.

Study: Perplexity Most Reliable AI Chatbot for Work Tasks

A new independent study ranked Perplexity AI as the most reliable chatbot for work tasks, ahead of ChatGPT (6th) and others, based on factual accuracy and task completion.

Source: The Indian Express
Significance: May shift enterprise AI search preferences toward retrieval‑augmented chatbots that prioritize factual citation.