Software architecture skill: 10 reviews Claude can run

clean-code

Same author's companion: enforces the line-level standards under each new module.

filesystem

Lets Claude read the whole src/ tree to find every caller before renaming.

Bounded-context map for a service

Hand the skill a feature spec and get a Domain-Driven Design carve-up: which entities live in which context, where the seams are, and the ubiquitous-language term for each. The output is a folder layout you can scaffold from.

ForEngineers starting a new service who want the boundaries right on day one.

The prompt

Here is the spec for a checkout service (`docs/checkout-spec.md`). Apply DDD: identify the bounded contexts, assign each domain entity to one context, and name the contexts with the team's ubiquitous language — not technical layers. For each context list its core entities and the one public operation other contexts call. Return it as a directory tree under `src/` with one folder per context. Keep domain logic out of any infrastructure folder.

What slides.md looks like

src/
  ordering/            # context: "placing an order"
    Order.ts  Cart.ts  PlaceOrder.ts        ← use case
    ports/ PaymentGateway.ts (interface)
  pricing/             # context: "what it costs"
    PriceList.ts  OrderCalculator.ts
  fulfilment/          # context: "getting it shipped"
    Shipment.ts  DispatchOrder.ts
  shared-kernel/
    Money.ts  CustomerId.ts                 ← value objects only
# infra adapters live in src/infrastructure/, never inside a context

One-line tweak

Ask for a one-line context-relationship list (upstream/downstream) and you get the start of a context map without leaving the prompt.

Pairs with

mermaid-gen

Turn the context list into a rendered context-map diagram.

system-design

Zoom out from contexts to the deployment and data-flow view.

Monolith vs. microservices, for your constraints

Give the skill the team size, deploy cadence, and traffic shape; get a reasoned recommendation instead of a Medium-article default. It argues from your numbers and names the modular-monolith middle path when it fits.

ForTech leads being pushed toward microservices by hype rather than need.

The prompt

We are 4 engineers, deploying twice a week, ~200 req/s peak, one Postgres, no platform team. A stakeholder wants microservices. Argue the trade-off against THESE constraints: operational overhead, transaction boundaries, team cognitive load, and time-to-first-revenue. Recommend one of: monolith, modular monolith, or microservices — and say what would have to change for the answer to flip. Be specific; do not give a both-sides non-answer.

What slides.md looks like

Recommendation: modular monolith.

Why, against your constraints
  • 4 engineers, no platform team → microservices' ops tax (CI per
    service, distributed tracing, network failure modes) buys you
    nothing yet. It's a part-time SRE you don't have.
  • One Postgres + real money → keep transactions in-process; a
    saga across services to avoid a single COMMIT is pure downside.
  • Module boundaries (per ctx, use case 2) give you 80% of the
    decoupling with 0% of the network.

Flip the call when: a context needs independent scaling (e.g.
search at 10x the rest), OR headcount passes ~15 with >3 teams.

One-line tweak

Drop in real latency SLOs and the skill weighs network hops explicitly, sharpening the same recommendation for a performance-bound system.

Pairs with

system-design

Carry the decision into a full deployment + scaling design.

sequential-thinking

Forces the trade-off to be reasoned step by step, not asserted.

Separation-of-concerns audit on a controller

Feed the skill a fat controller and get the layering violations called out by name — SQL in the handler, business rules in the route, a domain rule leaking into the view. Each finding ships with the move that fixes it.

ForTeams whose controllers quietly grew database queries and pricing logic.

The prompt

Review `src/routes/orders.ts`. Apply the separation-of-concerns rules: no database queries in controllers, no business logic mixed into the HTTP layer, framework code out of the domain. List each violation with file:line, classify it (persistence / business-rule / framework-leak), and give the one-line fix — which layer the code should move to. Order findings worst-first. Keep the controller as a thin orchestrator.

What slides.md looks like

orders.ts — 3 violations

✗ business-rule  L41  discount math inlined in the POST handler
    → move to pricing/OrderCalculator.applyDiscount()
✗ persistence    L58  raw `db.query('SELECT … orders')` in route
    → move behind orders/ports/OrderRepository (interface)
✗ framework-leak L72  domain Order imports `express` Request type
    → pass a plain DTO into the use case; keep express at the edge

After: the handler is 9 lines — parse, call PlaceOrder, map result.

One-line tweak

Ask it to also emit the extracted `OrderRepository` interface and you get the seam stubbed out, ready to wire an adapter behind.

Pairs with

code-reviewer

Runs the broader correctness pass alongside the layering audit.

github

Open the fix as a PR with the violation list as the description.

Library-first check before you write the util

Before greenlighting custom retry/validation/state code, ask the skill to find the established library that already does it. It returns the named package and the carve-out where rolling your own is actually justified.

ForReviewers who keep approving home-grown code that npm already solved.

The prompt

We are about to write a custom retry wrapper, a custom form validator, and our own auth session store. For each, apply the library-first rule: name the established library that solves it (the skill cites `cockatiel` for retries as the canonical example), and state the one condition under which writing it ourselves is justified. If custom is the right call, say why in one line — security control, unique business logic, or dependency overkill.

What slides.md looks like

Library-first verdicts

retry wrapper   → use cockatiel (policies: retry, circuit-breaker,
                  timeout). Custom only if you need a backoff curve
                  the lib can't express — you don't.
form validation → use zod / valibot. Schema is data; hand-rolled
                  validators rot into a second source of truth.
auth sessions   → use the framework's session + Auth0/Supabase.
                  Custom ONLY if a compliance control forces full
                  control of token storage. Default: don't.

Rule of thumb: every line of custom code is a liability you must
test, document, and maintain. Borrow before you build.

One-line tweak

Point it at your `package.json` first and it will prefer libraries you already depend on, avoiding a new transitive tree for a one-off.

Pairs with

clean-code

Keeps whatever custom code survives the cut concise and comment-light.

security-review

Second opinion on the one place you chose custom for a security reason.

Untangle the deep-nesting and long-function smells

Run the skill's code-quality pass over a gnarly module: functions over 50 lines, nesting past three levels, files over 200 lines. It returns a refactor list using early returns and extracted, domain-named helpers.

ForAnyone staring at a 400-line file with five levels of indentation.

The prompt

Apply the code-quality rules to `src/billing/processInvoice.ts`: flag every function over ~50 lines, any nesting deeper than 3 levels, and the file itself if it's over 200 lines. For each, propose the fix — early-return guard clauses to kill the arrow-of-doom, and extractions into smaller functions with domain names (no `helper2`). If the file is too big, propose the split into multiple files. Show the before/after shape of the worst offender.

What slides.md looks like

processInvoice.ts — 312 lines, splits into 3

worst offender: validateAndCharge() — 71 lines, 5-deep nesting
  fix: invert the pyramid with guard clauses

  // before                      // after
  if (inv) {                     if (!inv) return err('no invoice')
    if (inv.lines) {             if (!inv.lines.length) return err('empty')
      if (paid) { … }            if (!paid) return err('unpaid')
    }                            return charge(inv)   // happy path, flat
  }

split: → InvoiceValidator.ts · InvoiceCharger.ts · processInvoice.ts

One-line tweak

Cap the change at 'guard clauses only, no file split' when you want a low-risk diff that a reviewer can read in one sitting.

Pairs with

clean-code

Pairs the structural refactor with line-level style enforcement.

semgrep

Add a static rule so the smell fails CI next time instead of re-accreting.

ADR drafted from a design discussion

Paste a messy Slack thread or meeting notes where the team argued about a choice; get back a clean Architecture Decision Record — context, options weighed, decision, consequences — using the skill's trade-off discipline.

ForWhoever got volunteered to 'write up what we decided' after the meeting.

The prompt

Turn these notes into an ADR. We debated event-driven vs. direct calls between the ordering and fulfilment contexts. Notes: `docs/meeting-2026-06.md`. Use the standard ADR shape — Context, Options considered (with the trade-off for each), Decision, Consequences (good and bad). Ground the reasoning in our actual constraints from the notes; don't invent a rationale we didn't discuss. Keep it under one page.

What slides.md looks like

# ADR-014: Ordering → Fulfilment integration

## Context
Order placement must trigger dispatch. Contexts deploy together
today (modular monolith, ADR-009).

## Options
1. Direct call PlaceOrder → DispatchOrder
     + simplest, in-process transaction
     − couples the two contexts at compile time
2. Domain event OrderPlaced, fulfilment subscribes
     + contexts stay decoupled; ready to split later
     − eventual consistency; needs an outbox for reliability

## Decision
Domain event via an in-process bus + transactional outbox.

## Consequences
+ Splitting fulfilment to its own service later is a config change.
− We accept eventual consistency on dispatch; surface it in the UI.

One-line tweak

Ask for a one-line 'Status: proposed' header and a link to the prior ADR it supersedes, and the record drops straight into a `docs/adr/` log.

Pairs with

documentation

Files the ADR into the project's docs set with consistent formatting.

github

Commit the ADR and open it for team sign-off in one step.

Catch the NIH and anti-pattern smells in review

Give the skill a diff and ask it to flag the Not-Invented-Here moves and the named anti-patterns — custom auth where Auth0 fits, a `common/shared.js` dumping ground, business logic in a React component — before they reach main.

ForCode reviewers who want the architectural smells flagged automatically.

The prompt

Review this PR diff for the anti-patterns the architecture skill calls out: NIH syndrome (custom auth/state/validation where a standard library exists), generic-naming dumping grounds (`utils`, `helpers`, `common`, `shared`), and separation-of-concerns breaks (DB queries in controllers, business logic in UI components). For each hit, name the anti-pattern, the file:line, and the standard alternative. Skip style nits — only structural smells.

What slides.md looks like

PR #218 — 3 architectural smells

⚠ NIH            auth/session.ts  — hand-rolled JWT refresh
                 → use the framework session or Auth0; this is the
                   exact case the skill warns against owning.
⚠ generic-naming common/shared.ts — 9 unrelated exports added
                 → split by domain; "shared" hides coupling.
⚠ concern-leak   CartView.tsx L88 — tax calc inside the component
                 → move to pricing/; UI renders, it doesn't compute.

No correctness bugs found — these are design-debt flags.

One-line tweak

Restrict it to 'NIH only' on a dependency-light project where the team's bias is the opposite — too many libraries, not too few.

Pairs with

code-reviewer

Layer the architecture smells on top of a full correctness review.

github

Post the smell list as inline PR review comments.

Error-handling and typed-catch pass

Ask the skill to harden a module's failure paths: typed catch blocks instead of swallowed errors, no empty `catch {}`, failures that map to a domain error rather than leaking a stack trace to the caller.

ForBackend engineers whose 'it works' code has no real failure story.

The prompt

Apply the error-handling rules to `src/payments/charge.ts`. Find every bare `catch` that swallows or re-throws raw, every place a third-party error leaks past the domain boundary, and any failure with no typed result. Propose typed catch blocks and a domain error type (e.g. `PaymentDeclined`, `GatewayUnavailable`) so callers branch on meaning, not on string matching. Keep the happy path readable; don't drown it in try/catch.

What slides.md looks like

charge.ts — error-handling fixes

✗ L23  catch (e) { console.log(e) }   ← swallowed, charge looks ok
   → rethrow as GatewayUnavailable; let PlaceOrder decide retry
✗ L40  Stripe.CardError leaks to the HTTP layer
   → map at the adapter: CardError → PaymentDeclined(reason)
✗ no Result type — callers can't tell decline from outage

type ChargeError = PaymentDeclined | GatewayUnavailable
async function charge(o: Order): Promise<Result<Receipt, ChargeError>>

One-line tweak

Ask it to also add the matching `never`-exhaustiveness switch at the call site so a new error variant fails the build, not production.

Pairs with

backend-architecture

Sets the service-wide error contract this module conforms to.

writing-tests

Generate the failure-path tests for each new domain error.

Pre-merge architecture gate in CLAUDE.md

Wire the skill's rules into a repeatable review the agent runs on every feature branch: layering intact, no generic-named files added, no new custom util that a library covers, functions and files within the size budget.

ForLeads who want the architecture standards enforced without nagging in every PR.

The prompt

Draft a short architecture-gate checklist I can drop in CLAUDE.md so the agent self-reviews before proposing a merge. Base it on the software-architecture skill: (1) no DB queries in controllers, (2) no new `utils/helpers/common/shared` files, (3) no custom code where an established library exists, (4) functions under ~50 lines, files under ~200, nesting under 3, (5) domain-named modules only. Phrase each as a yes/no the agent answers with file:line evidence.

What slides.md looks like

## Architecture gate (run before proposing a merge)

For this branch, answer each with evidence or "n/a":
  [ ] Any DB query added inside a controller?            (file:line)
  [ ] Any new utils/helpers/common/shared file?          (file:line)
  [ ] Any custom code a known library covers?            (which lib?)
  [ ] Any function > ~50 lines or file > ~200?           (file:line)
  [ ] Any module with a generic, non-domain name?        (file:line)
  [ ] Business logic living in a UI component?           (file:line)

If any box is checked, fix or justify before merge.

One-line tweak

Add a final line — 'output PASS or a numbered fix list' — so the gate returns a machine-readable verdict your CI step can grep for.

Pairs with

clean-code

The line-level half of the same standard; pair both in CLAUDE.md.

github

Run the gate as a check on every pull request automatically.

Community signal

Three voices on why architectural judgment is the part of coding that does not come for free. The first is a working engineer; the next two are the canonical references the skill leans on — the dependency rule and the bounded-context pattern.

“Architecture decisions. AI builds what you ask for. It doesn't tell you 'hey, this feature should be behind a paywall' ... You need engineering judgment for that.”

u/JustinBundrick (17-year software engineer) · Reddit

A 17-year veteran who shipped 5 iOS apps with Claude Code — his sharpest takeaway is that architectural judgment is the gap the model doesn't fill on its own. A skill that encodes that judgment is exactly the fix.

“The overriding rule that makes this architecture work is The Dependency Rule. This rule says that source code dependencies can only point inwards.”

Robert C. Martin (Clean Coder blog) · Blog

The dependency rule is the spine of the skill's 'keep business logic independent of frameworks' and 'separate domain from infrastructure' guidance.

“Bounded Context is a central pattern in Domain-Driven Design. It is the focus of DDD's strategic design section which is all about dealing with large models and teams.”

Martin Fowler (martinfowler.com) · Blog

Use case 2 asks the skill to carve a service into bounded contexts — Fowler's definition is the canonical reference for what that boundary is and why it matters.

The contrarian take

The fair objection to any architecture skill comes from the same veteran in the community signal above:

“Without the engineering foundation, AI output would've been garbage I couldn't debug.”

u/JustinBundrick · Reddit

From a thread on shipping production apps with Claude Code — the limit of AI without an engineering foundation.