Zhipu AI Releases GLM-5.1 Open-Source 754B Model

1. Zhipu AI Releases GLM-5.1 Open-Source 754B Model

Zhipu AI has released GLM-5.1, a 754B-parameter model optimized for long-horizon agentic tasks, on Hugging Face under a permissive open-source license. The model is designed to run autonomously for up to eight hours on a single task. It currently achieves state-of-the-art performance on SWE-Bench Pro, outperforming proprietary models like Claude Opus 4.6 and GPT-5.4. Developers can download and customize the model for commercial use or access it via HuggingChat.

2. Anthropic Previews Claude Mythos for Private Cybersecurity Consortium

Anthropic has announced Claude Mythos Preview, a highly capable model that will remain unreleased to the public due to security concerns. The model is being shared privately with a consortium of 12 major tech companies, including AWS, Google, and Microsoft, under Project Glasswing. Mythos autonomously discovered thousands of high-severity zero-day vulnerabilities across major operating systems and browsers without human steering. This signals a major shift in how frontier models will be gated and deployed for defensive cybersecurity operations.

3. AWS Launches Amazon S3 Files for Native Agentic Workspaces

AWS has introduced Amazon S3 Files, a feature that provides AI agents with a native file system workspace directly on top of S3 object storage. This eliminates the need for developers to build separate file system layers or data synchronization pipelines to bridge the gap between API-driven object stores and file-path-dependent agents. Engineering teams can now point tools like Claude Code directly at S3 data without losing session state or compacting context windows during local downloads.

4. OpenAI Previews Next-Generation Image V2 Model

OpenAI is currently testing three variants of its next-generation Image V2 model on ChatGPT and the LM Arena. Early testing indicates improvements in prompt adherence, compositional understanding, and UI design rendering. This limited-access preview signals an upcoming upgrade to OpenAI's image generation API capabilities.

5. Google Previews Jules V2 Autonomous Coding Agent

Google is developing Jules V2, a coding agent designed to autonomously manage high-level development goals rather than executing specific task-based commands. The agent is currently launching via a waitlist for early testing. This KPI-driven approach aims to help teams manage large codebases, though it introduces new challenges around unpredictable code changes and verification.

6. Google Open-Sources Scion Agent Orchestration Testbed

Google has open-sourced Scion, an experimental testbed for agent orchestration. The framework provides developers with a structured environment to build, test, and evaluate multi-agent workflows. This release offers a new reference architecture for teams designing complex agentic systems on Google Cloud.

7. Open-Source Gemma 4 Multimodal Fine-Tuner for Apple Silicon

A new open-source project provides a local fine-tuning pipeline for Gemma 4 specifically optimized for Apple Silicon. The tool allows developers to stream training data directly from Google Cloud Storage, bypassing local storage constraints for large datasets. It includes specific optimizations for multimodal and audio fine-tuning on Mac hardware, though developers should monitor memory usage on longer sequences to prevent out-of-memory errors.

8. ACE-Step 1.5 XL Open Music Generation Model Released

A new 4B parameter music generation model, ACE-Step 1.5 XL, is now available on Hugging Face under an MIT license. The model supports text-to-music, cover generation, repainting, and audio extraction tasks. It was trained on legally compliant datasets, making it suitable for commercial integration in audio applications.

9. AutoAgent Open-Sources Self-Improving Agent Harness

AutoAgent is a new MIT-licensed framework that allows a meta-agent to autonomously engineer and optimize its own execution harness. The system uses hill-climbing techniques to iteratively improve its tool-use and memory structures overnight. The framework claims top performance on TerminalBench and SpreadsheetBench, offering developers a novel approach to building self-refining agent architectures.

10. GitNexus Indexes Codebases for AI Agent Context

GitNexus is a new tool that indexes entire codebases into knowledge graphs to provide deep context for AI agents like Cursor and Claude Code. By mapping dependencies and call chains, it helps agents understand repository relationships before making edits. The tool includes a CLI for editor integration and a web UI, reducing the risk of agents introducing breaking changes due to limited context windows.

11. Nia Mounts Web Documentation as Virtual Filesystems for Agents

Nia is a new tool that mounts online documentation sites as virtual filesystems, allowing AI agents to navigate them using standard terminal commands like grep and tree. This client-side, in-memory shell works with Claude Code, Copilot, and Gemini to provide real-time access to current APIs. By treating the web as a filesystem, developers can reduce code hallucinations caused by stale training data without building complex tool schemas.