News
Curated articles, my blog posts, and site announcements.
Curated articles, my blog posts, and site announcements.
Dria released Watchmen, a local open-source tool that writes shared skill files for Claude Code, Codex, and pi from a user's own sessions.
Dria published research using Goodfire AI's interpretability tools to test whether individual developer code style is detectable inside a model trained on that developer's code.
Dria unveiled Kai, a long-horizon AI maintainer that reviews PRs, benchmarks performance on real GPUs, and catches production regressions across GitHub, Modal, Vercel, Linear, and Jira.
Dria compressed 30 days of long-horizon agent operation into a 12-minute benchmark and shared three findings on memory, tool drift, and feedback loops.
Dria detailed how capping agent memory at 100 entries per category and prioritizing curation over accumulation outperformed naive memory growth in long-running coding agents.
Dria pointed its KAI research agent at Qwen3.5-4B and let it author its own fine-tuning curriculum, demonstrating self-directed model training using Thinking Machines tooling.
Dria launched Mem Agent, a 4B model with local interoperable memory across Claude, ChatGPT, and LM Studio that turns stateless LLMs into stateful agents with persistent human-readable memory.