
The explosion of curiosity in OpenClaw was one of many final objects added to the February 1 tendencies. In February, issues went loopy. We noticed a social community for brokers (no people allowed, although they undoubtedly sneak on); a multiplayer on-line sport for brokers (once more, no people); many clones of OpenClaw, most of which try to mitigate its many safety issues; and rather more. Andrej Karpathy has mentioned that OpenClaw is the following layer on prime of AI brokers. If the safety points could be resolved (which is an effective query), he’s in all probability proper.
AI
- Alibaba has launched a fleet of mid-size Qwen 3.5 fashions. Their theme is offering extra intelligence with much less computing cycles—one thing all of us want to understand.
- Necessary recommendation for agentic engineering: All the time begin by working the checks.
- Google has launched Lyria 3, a mannequin that generates 30-second musical clips from a verbal description. You’ll be able to experiment with it by Gemini.
- There’s a brand new protocol within the agentic stack. Twilio has launched the Agent-2-Human (A2H) protocol, which facilitates handoffs between brokers and people as they collaborate.
- But increasingly mannequin releases: Claude Sonnet 4.6, adopted rapidly by Gemini 3.1 Professional. If you happen to care, Gemini 3.1 Professional at the moment tops the summary reasoning benchmarks.
- Kimi Claw is yet one more variation on OpenClaw. Kimi Claw makes use of Moonshot AI’s most superior mannequin, Kimi K2.5 Considering mannequin, and gives one-click setup in Moonshot’s cloud.
- NanoClaw is one other OpenClaw-like AI-based private assistant that claims to be extra safety aware. It runs brokers in sandboxed Linux containers with restricted entry to outdoors sources, limiting abuse.
- OpenAI has launched a analysis preview of GPT-5.3-Codex-Spark, a particularly quick coding mannequin that runs on Cerebras {hardware}. The corporate claims that it’s attainable to collaborate with Codex in “actual time” as a result of it offers “near-instant” outcomes.
- RAG will not be the most recent thought within the AI world, however text-based RAG is the idea for a lot of enterprise purposes of AI. However most enterprise information contains graphs, photographs, and even textual content in codecs like PDF. Is that this the yr for multimodal RAG?
- Z.ai has launched its newest mannequin, GLM-5. GLM-5 is an open supply “Opus-class” mannequin. It’s considerably smaller than Opus and different high-end fashions, although nonetheless big; the mixture-of-experts mannequin has 744B parameters, with 40B energetic.
- Waymo has created a World Mannequin to mannequin driving habits. It’s able to constructing lifelike simulations of site visitors patterns and habits, primarily based on video collected from Waymo’s autos.
- Recursive language fashions (RLMs) remedy the issue of context rot, which occurs when output from AI degrades as the dimensions of the context will increase. Drew Breunig has a wonderful clarification.
- You’ve heard of Moltbook—and maybe your AI agent participates. Now there’s SpaceMolt—a large multiplayer on-line sport that’s solely for brokers.
- Anthropic and OpenAI concurrently launched Claude Opus 4.6 and GPT-5.3-Codex, each of which supply improved fashions for AI-assisted programming. Is that this “open warfare,” as AINews claims? You imply it hasn’t been open warfare before now?
- If you happen to’re excited by OpenClaw, you may attempt NanoBot. It has 1% of OpenClaw’s code, written in order that it’s simple to grasp and keep. No guarantees about safety—with all of those private AI assistants, watch out!
- OpenAI has launched a desktop app for macOS alongside the strains of Claude Code. It’s one thing that’s been lacking from their lineup. Amongst different issues, it’s supposed to assist programmers work with a number of brokers concurrently.
- Pete Warden has put collectively an interactive information to speech embeddings for engineers, and printed it as a Colab pocket book.
- Aperture is a brand new software from Tailscale for “offering visibility into coding agent utilization,” permitting organizations to grasp how AI is getting used and adopted. It’s at the moment in non-public beta.
- OpenAI Prism is a free workspace for scientists to collaborate on analysis. Its purpose is to assist scientists construct a brand new era of AI-based tooling. Prism is constructed on ChatGPT 5.2 and is open to anybody with a private ChatGPT account.
Programming
- Pi is a quite simple however extensible coding agent that runs in your terminal.
- Researchers at Anthropic have vibe-coded a C compiler utilizing a fleet of Claude brokers. The experiment value roughly $20,000 price of tokens, and produced 100,000 strains of Rust. They’re cautious to say that the compiler is much from manufacturing high quality—however it works. The experiment is a tour de power demonstration of working brokers in parallel.
- I by no means knew that macOS had a sandboxing software. It appears to be like helpful. (It’s additionally deprecated, however appears to be like a lot simpler to make use of than the options.)
- GitHub now permits pull requests to be turned off fully, or to be restricted to collaborators. They’re doing this to permit software program maintainers to eradicate AI-generated pull requests, that are overwhelming many builders.
- After an open supply maintainer rejected a pull request generated by an AI agent, the agent printed a weblog publish attacking the maintainer. The maintainer responded with a wonderful evaluation, asking whether or not threats and intimidation are the way forward for AI.
- As Simon Willison has written, the aim of programming isn’t to jot down code however to ship code that works. He’s created two instruments, Showboat and Rodney, that assist AI brokers demo their software program in order that the human authors can confirm that the software program works.
- Anil Sprint asks whether or not codeless programming, utilizing instruments like Fuel City, is the long run.
Safety
- There’s now an app that alerts you when somebody within the neighborhood has good glasses.
- Agentsh offers execution layer safety by imposing insurance policies to prevents brokers from doing injury. So far as brokers are involved, it’s a alternative for bash.
- There’s a brand new sort of cyberattack: assaults towards time itself. Extra particularly, this implies assaults towards clocks and protocols for time synchronization. These could be devastating in manufacturing unit settings.
- “What AI Safety Analysis Seems Like When It Works” is a wonderful overview of the affect of AI on discovering vulnerabilities. AI generates lots of safety slop, however it additionally finds essential vulnerabilities that may have been opaque to people, together with 12 in OpenSSL.
- Gamifying immediate injection—properly, that’s new. HackMyClaw is a sport (?) by which members ship electronic mail to Flu, an OpenClaw occasion. The purpose is to power Flu to answer with secrets and techniques.env, a file of “confidential” information. There’s a prize for the primary to succeed.
- It was solely a matter of time: There’s now a cybercriminal who’s actively stealing secrets and techniques from OpenClaw customers.
- Deno’s safe sandbox may present a method to run OpenClaw safely.
- IronClaw is a private AI assistant modeled after OpenClaw that guarantees higher safety. It all the time runs in a sandbox, by no means exposes credentials, has some defenses towards immediate injection, and solely makes requests to authorised hosts.
- A pretend recruiting marketing campaign is hiding malware in programming challenges that candidates should full with a view to apply. Finishing the problem requires putting in malicious dependencies which can be hosted on professional repositories like npm and PyPI.
- Google’s Menace Intelligence Group has launched its quarterly evaluation of adversarial AI use. Their evaluation contains distillation, or amassing the output of a frontier AI to coach one other AI.
- Google has upgraded its instruments for eradicating private data and pictures, together with nonconsensual express photographs, from its search outcomes.
- Tirith is a brand new software that hooks into the shell to dam unhealthy instructions. That is typically an issue with copy-and-paste instructions that use curl to pipe an archive into bash. It’s simple for a foul actor to create a malicious URL that’s indistinguishable from a professional URL.
- Claude Opus 4.6 has been used to find 500 0-day vulnerabilities in open supply code. Whereas many open supply maintainers have complained about AI slop, and that abuse isn’t more likely to cease, AI can be changing into a beneficial software for safety work.
- Two coding assistants for VS Code are malware that ship copies of all of the code to China. In contrast to numerous malware, they do their job as coding assistants properly, making it much less doubtless that victims will discover that one thing is flawed.
- Weird Bazaar is the identify for a wave of assaults towards LLM APIs, together with self-hosted LLMs. The assaults try to steal sources from LLM infrastructure, for functions together with cryptocurrency mining, information theft, and reselling LLM entry.
- The enterprise mannequin for ransomware has modified. Ransomware is now not about encrypting your information; it’s about utilizing stolen information for extortion. Small and mid-size companies are frequent targets.
Net
- Cloudflare has a service referred to as Markdown for Brokers that converts web sites from HTML to Markdown when an agent accesses them. Conversion makes the pages friendlier to AI and considerably reduces the variety of tokens wanted to course of them.
- WebMCP is a proposed API customary that enables net purposes to turn into MCP servers. It’s at the moment obtainable in early preview in Chrome.
- Customers of Firefox 148 (which must be out by the point you learn this) will have the ability to decide out of all AI options.
Operations
- Wireshark is a strong—and complicated—packet seize software. Babyshark is a textual content interface for Wireshark that gives an incredible quantity of data with a a lot less complicated interface.
- Microsoft is experimenting with utilizing lasers to etch information in glass as a type of long-term information storage.
Issues
- You want a desk robotic. Why? As a result of it’s there. And enjoyable.
- Do you need to play Doom on a Lego brick? You’ll be able to.


