Jun 13Vibe with Hermes Agent — Bengaluru · RSVP
HomeToolsMCPResearchHow It WorksStoriesPhilosophyCommunityArchitectureStar on GitHub
Research

Agent Research

A living digest of what people are doing with Agentic AI right now, from model drops to practical workflows to strange but useful tangents.

Last UpdatedJun 18, 2026, 10:17 PM UTC
Research Window7 days
Signals Tracked51
Sources Watched4
Snapshot

What the current cycle is saying

This cycle says the local/open-weight stack keeps getting stronger around Qwen-style setups. The deeper pattern is that agentic AI is being compressed, packaged, and made more inspectable at the same time.

Highlights

Best signals across the feed

The evolution of agentic surfaces: building with Claude Managed Agents

Anthropic
4 pts · 0 commentsJun 14, 2026, 01:46 PM UTC

Agent UX is moving beyond chat boxes — interface design is becoming part of the research frontier.

Major labsInterfaces

Rio de Janeiro's city government model Rio3.5 beats Qwen3.7 in recent benchmarks

Hacker News
142 pts · 44 commentsJun 14, 2026, 02:38 PM UTC

Qwen3.7-Max is the clearest signal yet that Alibaba is competing seriously on agentic coding benchmarks, and the "AI factory" positioning shows they're building a vertically integrated agent platform, not just a model.

EvaluationOpen weightsMajor labs

RTX 5080 and RTX 3090 Setup: 80 Tok/s on Qwen 3.6 27B Q8

Hacker News
291 pts · 108 commentsJun 13, 2026, 09:55 AM UTC

Qwen 3.6 keeps showing up as the local-first coding and agent workhorse. The momentum here is less about hype and more about the open ecosystem hardening around one strong base model family.

Open weightsMajor labs

bartowski/command-a-plus-05-2026-GGUF

Hugging Face
10 likes · 5,588 downloadsJun 13, 2026, 11:18 PM UTC

Cohere's Command A+ is a major open-weight release — a 25B active / 218B total MoE model under Apache 2.0, designed for agentic, multimodal, and multilingual tasks, deployable on as little as two H100 GPUs. This directly expands the open-source agentic model ecosystem.

Open weights

google-gemini/gemini-cli

GitHub
105398 stars · updated Thu, 18 Jun 2026 22:10:40 UTCApr 17, 2025, 05:04 PM UTC

The local/open-weight ecosystem keeps making serious agent workflows cheaper to run, inspect, and iterate.

Tool useOpen weightsMajor labsCoding agents

DietrichGebert/ponytail

GitHub
36406 stars · updated Thu, 18 Jun 2026 20:50:17 UTCJun 18, 2026, 08:50 PM UTC

The strongest practical signal right now is still better tooling around how models call, sequence, and recover from actions.

Tool useCoding agents
Source Watch

Where the signal came from

0

items in this cycle

No strong items landed here in this cycle.

1

items in this cycle

No strong items landed here in this cycle.

2

items in this cycle

No strong items landed here in this cycle.

3

items in this cycle

No strong items landed here in this cycle.

4

items in this cycle

No strong items landed here in this cycle.

Tangent Radar

What to keep an eye on next

Tool use

28 signals

The strongest practical signal right now is still better tooling around how models call, sequence, and recover from actions.

Coding agents

27 signals

The comment volume makes this a good proxy for what technically engaged builders think is worth paying attention to.

Interfaces

23 signals

Agent UX is moving beyond chat boxes — interface design is becoming part of the research frontier.

Open weights

20 signals

The local/open-weight ecosystem keeps making serious agent workflows cheaper to run, inspect, and iterate.

Major labs

16 signals

Major lab launches still reshape the practical design space for agent builders almost overnight.

Multi-agent workflows

11 signals

The comment volume makes this a good proxy for what technically engaged builders think is worth paying attention to.

Experiment Queue

Good next moves

Benchmark one local-first agent stack

The local/open-weight ecosystem looks materially better this week, especially around Qwen and Forge guardrails.

Try: Pick one real workflow and compare a local Qwen+Forge stack against a frontier hosted model on tool accuracy, latency, and cost.

Instrument one agent with better traces

Even when state machines are not explicit, better traces are still the easiest way to understand why an agent drifted or stalled.

Try: Log tool calls, retries, and summaries for one real task, then inspect which step actually predicts success or failure.

Prototype an interface beyond chat

The AI-pointer and Interaction Models threads both suggest the interface layer is becoming a genuine research lever.

Try: Build a tiny agent UI that is not just a chat box — for example a timeline, pointer assistant, or live shared workspace — and log what it changes.

Method

How the feed is built

  • Targeted Hacker News searches for high-signal agentic AI posts from the last 7 days
  • Reddit weekly top-post scan in LocalLLaMA and ClaudeAI filtered for practical agent-building signals
  • Hugging Face model search filtered toward fresh open-weight agent, deep-research, and Qwen-related releases
  • Major-lab RSS checks plus official-domain matches surfaced through Hacker News
  • GitHub repo and release scans for fresh agent tooling, dashboards, browsers, workspaces, and coding-agent updates