RDF → HTML Infographic

Claude Code's Limits Are Generous. The Problem Is Your Harness.

A structured infographic projection of the X post, visible engagement signals, glossary, FAQ, and operational guidance extracted from RDF knowledge graph data.

Author: Paweł Huryn

Published: 2026-04-25

Source: X status post

7Sections modeled

10FAQ items in graph

5HowTo steps projected

Primary graph thesis

The graph frames Claude Code quota pain as a harness design problem driven by cache invalidation, context sprawl, and token-heavy input choices.

Discussion visibility

Reply counts are visible, but X guest view gated actual reply text behind sign-in, so this projection models engagement signals and the access constraint rather than unseen comments.

Overview

Why this graph matters

The post is not just about Claude Code pricing. It is really about operational discipline: keep prefix state stable, keep context short, isolate work, and use lower-token ingestion paths.

Diagnosis

The post separates Anthropic's fixed bugs from the remaining user-side causes of waste.

Mechanism

Prompt cache behavior becomes the central economic mechanism around which the rest of the harness design is organized.

Operational outcome

The end state is a workflow that preserves the interface while sharply reducing avoidable token burn.

9visible replies count

39reposts

331likes

1.1Kbookmarks

285Kviews

Narrative Structure

Core sections from the post

The source post was decomposed into section entities that move from cost diagnosis to session hygiene, model routing, ingestion, and observability.

Cache misses

Prompt caching is framed as the largest cost lever. The post says mid-session tool or model changes invalidate the cached prefix and force expensive rereads.

Prompt caching Cache prefix stability Tool locking Model locking

Context bloat

Large default context windows are described as expensive and too permissive. The recommendation is to disable the one-million-token mode and compact before auto-trigger.

Context bloat 1M context mode Auto-compact threshold Session compaction

Five session moves

The post turns session hygiene into a playbook: compact early, clear between unrelated work, rewind bad turns, and delegate isolated subtasks.

Session moves /compact /clear /rewind

Subagents and skill isolation

Subagents are presented as the underused optimization because they keep the parent context lean while routing mechanical or scoped work to cheaper models.

Subagent delegation Parent context hygiene Skill as agent Model tiering

Wrong model or effort

The post separates effort level, session model choice, and provider routing into distinct dials that should be set deliberately rather than left at expensive defaults.

Reasoning effort Session model routing OpenRouter routing Cost-aware model selection

Wrong input format

Screenshots, PDF image reads, and raw large-repo reads are described as bad defaults when lower-token alternatives already exist.

agent-browser pdftotext code-review-graph Token-efficient ingestion

Watch the number

The final operational claim is that users need historical and real-time telemetry, including cache hit rates, to correct harness design.

Usage observability Historical usage dashboard Real-time usage monitor Cache dashboard

Discussion Layer

Visible engagement and access constraints

The graph preserves engagement counters and explicitly models that the reply bodies were not available from the guest-view capture path.

The post advertises 9 replies, but X guest view blocks reply text behind sign-in gating. The graph therefore captures visible engagement counts and the reply-access constraint, not the hidden thread content.

What was still extractable

The full long-form post text, internal section structure, visible metrics, timestamp, author identity, and cited external references were all available and have been mapped into RDF.

HowTo

Operational playbook projected from the graph ↗

The RDF does not stop at summarization. It turns the post into an explicit five-step operational procedure that can be reused in demos and future runs.