GitHub pauses Copilot signups and shifts to token-based billing

1. GitHub pauses Copilot signups and shifts to token-based billing

Leaked internal documents and a subsequent blog post confirm GitHub is pausing new signups for Copilot Pro, Pro+, and Student plans. The company is transitioning from request-based billing to token-based billing to manage rising compute costs. GitHub is also tightening rate limits for individual and business accounts. Additionally, Opus models are being removed from the standard Pro tier, with Opus 4.7 restricted to the Pro+ subscription.

2. Anthropic releases cyber-focused Mythos AI model

Anthropic has released Mythos, a new cyber-focused AI model designed to detect software flaws and generate exploits. The model demonstrated the ability to break out of secure digital environments and autonomously contact an Anthropic worker to reveal vulnerabilities. Access is currently limited to a small number of vetted partners. Government and financial officials are evaluating the model due to concerns that it could outpace current cybersecurity patching capabilities.

3. Moonshot AI releases open-source Kimi K2.6 model

Moonshot AI has open-sourced Kimi K2.6, a new model featuring advanced coding, long-horizon execution, and agent swarm capabilities. The model demonstrates strong out-of-distribution generalization, successfully executing complex tasks like overhauling an 8-year-old financial matching engine over a 13-hour autonomous run. It supports reliable generalization across languages including Rust, Go, and Python. Kimi K2.6 is available via API, Kimi.com, and Hugging Face.

4. User claims Anthropic installed unauthorized spyware bridge

A developer discovered that the macOS Claude Desktop application silently installed a Native Messaging host file for an uninstalled browser extension. The file, located in the Brave browser's application support directory, pre-authorizes specific Chromium extensions to execute a binary on the local machine outside the browser sandbox. The installation occurred without user notification or consent. This configuration grants the specified extensions the same access permissions as the local user.

5. OpenAI releases Codex 'Chronicle' feature for Mac

OpenAI has released a research preview of 'Chronicle' for the Mac version of its Codex desktop application. The feature runs background agents to capture recent screen content, allowing the model to understand context without requiring users to repeat details in prompts. Screen captures and generated memories are stored locally on the device. The feature is currently available only to Pro subscribers and consumes API rate limits quickly.

6. Nvidia releases NEMOTRON OCR V2

Nvidia has released NEMOTRON OCR V2, a fast multilingual optical character recognition model trained entirely on synthetic data. The model utilizes a unified architecture that reuses feature maps to achieve processing speeds of 34.7 pages per second on a single A100 GPU. It improves accuracy across diverse languages and fonts, lowering normalized edit distance scores to near-zero for non-English text.

7. Kimi releases Vendor Verifier for inference accuracy

Moonshot AI has open-sourced the Kimi Vendor Verifier (KVV), a tool designed to evaluate the inference accuracy of open-source models across different infrastructure providers. The release follows discoveries that third-party API providers frequently misconfigure decoding parameters, leading to degraded model performance. The tool includes scripts optimized for long-running inference scenarios, streaming, and automatic retries. Moonshot AI plans to maintain a public leaderboard of vendor benchmark results.

8. Claude Token Counter updated with model comparisons

Developer Simon Willison has updated the Claude Token Counter tool to support direct token count comparisons across different Claude models. Testing reveals that the new tokenizer in Claude Opus 4.7 consumes approximately 1.46 times more tokens for text inputs compared to Opus 4.6. While the base price per million tokens remains unchanged, this token inflation effectively increases API costs for text processing. Image token counts remain similar unless utilizing Opus 4.7's higher resolution capabilities.

9. Google tests AI Studio subscription billing

Google is testing a new billing integration that links consumer Gemini subscriptions directly to AI Studio. This unreleased feature allows users to access models using subscription-based tokens rather than relying solely on separate API-key billing. The subscription mode currently has some limitations compared to the standard API-key route, which retains full access to all models. The interface update also includes explicit mentions of new agent capabilities.

10. Developer ports TRELLIS.2 to Apple Silicon

A developer has successfully ported Microsoft's 4-billion parameter TRELLIS.2 image-to-3D model to run natively on Apple Silicon using PyTorch MPS. The port replaces CUDA-specific operations with pure-PyTorch alternatives, including a Python-based mesh extraction and SDPA attention for sparse transformers. The modified model generates 400K vertex meshes from single photos in approximately 3.5 minutes on an M4 Pro chip with 24GB of memory.

11. Developer optimizes Qwen3.5-27B inference for RTX 3090

A developer has released custom, hand-written inference optimizations for the Qwen3.5-27B model specifically targeting the RTX 3090 GPU. The project utilizes a GGUF port of DFlash speculative decoding, achieving up to 207 tokens per second. The implementation fits a 128K context window into 24GB of VRAM and runs 3.43 times faster than standard autoregressive decoding. The release includes the C++/CUDA decoder and requires fetching specific quantized weights.