Frontier Notes / Daily Signal Report


Issue —  · 2026-06-23  · 5 signals


Today


Sakana Fugu, an AI orchestration API routing tasks to multiple frontier models, failed to deliver performance gains over individual models like Claude Opus 4.8 in practical tests while being 5x more expensive and 4.5x slower.

Editor's Notes


This week’s videos reveal a growing focus on AI orchestration (Hermes Agent, Sakana Fugu) as a way to break free from single-model dependency, but hands-on tests show significant cost and speed trade-offs. Meanwhile, an architectural deep dive into GPT and Anthropic’s internal Claude Skills patterns offer grounding in how these models work and how to use them effectively.

Key Takeaways

  1. Sakana Fugu’s orchestration route was slower and costlier than using Claude Opus 4.8 directly, with only tie outcomes on most tasks.
  2. Hermes Agent allows dynamic model swapping (e.g., Minimax M3) to reduce costs while maintaining quality, especially via multimodal and voice tasks.
  3. GPT architecture relies on token embedding, multi-headed attention, and feed-forward networks stacked in blocks to predict the next token.
  4. Anthropic employees categorize Claude Skills into utility, verification, data enrichment, and orchestration, with verification being key for quality.
  5. Adding 'gotchas' as living documentation and tuning triggers improves automatic skill invocation in Claude.
  6. Despite orchestration hype, individual frontier models remain more efficient for many knowledge work tasks.
[01] llm 3 signals

Hermes Agent is the Greatest AI Tool Ever

Hermes Agent allows swapping different AI models for different tasks to optimize cost and performance, breaking the Claude/ChatGPT duopoly. Minimax M3 is highlighted as a particularly cost-effective model that rivals top-tier models at a fraction of the price, using sparse attention for efficiency. The creator demonstrates how to integrate Minimax M3 with Hermes Agent via Telegram for multimodal, web-scraping, and voice-interaction tasks.

[llm] [agents] [local-models] [cost-optimization] [multimodal] [hermes-agent]


GPT explained visually..

GPT (Generative Pre-trained Transformer) is the core architecture behind modern LLMs from labs like OpenAI, Anthropic, and DeepSeek, each tuning it for faster token generation, longer context, better tool calling, and more intelligence. The architecture builds from token embedding (giving tokens internal representation) and positional embedding, through multi-headed attention (Q, K, V vectors for relational communication), to feed-forward networks, layer normalization, and residual connections—all stacked in blocks to predict the next token.

[llm] [gpt] [transformers] [attention-mechanism] [architecture] [training]


I Battle Tested Sakana Fugu's Fable Killer

Sakana Fugu Ultra is not a standalone model but an orchestration API that routes tasks to multiple frontier models like Opus, GPT, and Gemini to achieve benchmark results matching Fable and Mythos. In practical tests across 38 tasks, Fugu tied with Claude Opus 4.8 on 36 tasks but was 4.5x slower and 5x more expensive, leading the creator to conclude it isn't worth the cost for his knowledge work, though the orchestration approach is seen as the future of AI efficiency.

[llm] [multi-agent] [orchestration] [api] [benchmarks] [cost-analysis]

[02] sakana-fugu 1 signal

Sakana Fugu Hands-On Test – Does THIS Really Beat Fable 5?

Sakana Fugu is an AI routing system that orchestrates multiple frontier models like Opus 4.8, Gemini 3.1 Pro, and GPT55, claiming superior benchmarks. In hands-on tests, it produced functional browser OS, subway scenes, and games, but often fell short of directly using individual models like GPT55, especially for 3D tasks, and incurred higher costs without clear performance gains.

[sakana-fugu] [model-orchestration] [coding-tests] [ai-agents] [frontier-models] [benchmarks]

[03] claude 1 signal

How Anthropic Employees ACTUALLY Use Claude Skills

Anthropic employees use Claude Skills in five key ways: categorizing skills into four types (utility, verification, data enrichment, orchestration), leveraging power components like scripts and templates, focusing on verification skills for quality, adding 'gotchas' as living documentation, and tuning triggers for automatic invocation.

[claude] [anthropic] [skills] [verification] [workflow] [ai-tools] [prompts]

Frontier Notes · Generated Jun 23, 2026 · 5 of 5 signals
You received this as a Frontier Notes recipient.