Research

Agent Research

A living digest of what people are doing with Agentic AI right now, from model drops to practical workflows to strange but useful tangents.

Last UpdatedJun 18, 2026, 10:17 PM UTC

Research Window7 days

Signals Tracked51

Sources Watched4

Snapshot

What the current cycle is saying

This cycle says the local/open-weight stack keeps getting stronger around Qwen-style setups. The deeper pattern is that agentic AI is being compressed, packaged, and made more inspectable at the same time.

Highlights

Best signals across the feed

The evolution of agentic surfaces: building with Claude Managed Agents

Anthropic

4 pts · 0 commentsJun 14, 2026, 01:46 PM UTC

Agent UX is moving beyond chat boxes — interface design is becoming part of the research frontier.

Major labsInterfaces

Rio de Janeiro's city government model Rio3.5 beats Qwen3.7 in recent benchmarks

Hacker News

142 pts · 44 commentsJun 14, 2026, 02:38 PM UTC

Qwen3.7-Max is the clearest signal yet that Alibaba is competing seriously on agentic coding benchmarks, and the "AI factory" positioning shows they're building a vertically integrated agent platform, not just a model.

EvaluationOpen weightsMajor labs

RTX 5080 and RTX 3090 Setup: 80 Tok/s on Qwen 3.6 27B Q8

Hacker News

291 pts · 108 commentsJun 13, 2026, 09:55 AM UTC

Qwen 3.6 keeps showing up as the local-first coding and agent workhorse. The momentum here is less about hype and more about the open ecosystem hardening around one strong base model family.

Open weightsMajor labs

bartowski/command-a-plus-05-2026-GGUF

Hugging Face

10 likes · 5,588 downloadsJun 13, 2026, 11:18 PM UTC

Cohere's Command A+ is a major open-weight release — a 25B active / 218B total MoE model under Apache 2.0, designed for agentic, multimodal, and multilingual tasks, deployable on as little as two H100 GPUs. This directly expands the open-source agentic model ecosystem.

Open weights

google-gemini/gemini-cli

GitHub

105398 stars · updated Thu, 18 Jun 2026 22:10:40 UTCApr 17, 2025, 05:04 PM UTC

The local/open-weight ecosystem keeps making serious agent workflows cheaper to run, inspect, and iterate.

Tool useOpen weightsMajor labsCoding agents

DietrichGebert/ponytail

GitHub

36406 stars · updated Thu, 18 Jun 2026 20:50:17 UTCJun 18, 2026, 08:50 PM UTC

The strongest practical signal right now is still better tooling around how models call, sequence, and recover from actions.

Tool useCoding agents

Source Watch