a builder's codex
codex · playbooks · Synthetic audience test playbook

Synthetic audience test playbook

Scoring rubric · Wynter + Gartner, 7 dimensions

The Synthetic Audience Test

Threshold layer
Clarity · gate dimensionif clarity < 3, stop scoring; the buyer never reaches the other dimensions
Relevancedoes this speak to a real situation this buyer is actually in?
Directional only. Use before filing a gate packet on any positioning or LP variant. Never a gate verdict on its own.
Value layer
Valuedoes the buyer believe this outcome is worth pursuing?
Differentiationdoes this feel distinct from alternatives the buyer already knows?
Credibilitydoes the claim feel plausible given the source and the proof provided?
Intent layer
Urgencydoes this feel like a now problem, not a someday problem?
Friction (inverse)lower is better; every unnecessary word, qualification, or caveat is friction
Trusted-5 panel: sales and CS roles closest to buyer voice. Not internal marketers.

Pre-flight test of a positioning, LP, ad copy, or outbound opener variant against grounded buyer personas before live calibration. Returns a structured read (clarity, objections, comprehension gaps, predicted intent delta). Directional only, not a gate verdict. Use BEFORE filing a gate packet on the variant.

The synthetic audience is BUYERS, not marketers, like Nooks for SDR cold-call practice. Trusted-5 reviewers in the loop are sales/CS roles closest to the buyer voice, not internal marketers.

Source synthesis: Wynter B2B Message Layers (Clarity / Relevance / Value / Differentiation + Friction inverse), Gartner G00823537 Market Guide for B2B Message Testing (Clarity / Credibility / Urgency / Uniqueness / Value / Relevance), JTBD interviews surface the customer's actual language and the switch trigger (Moesta, verbatim language as primary signal).

When to use

When NOT to use

Inputs

  1. The variant under test (one block of copy, or N variants for comparison).
  2. Optional: target persona slice. Default: all personas.
  3. Optional: comparison anchor (current live copy, competitor claim), improves the read.

7-dimension scoring rubric

Wynter ∪ Gartner. Score 1–5 per dimension per persona. Wynter order rule: if clarity < 3, do NOT score the later dimensions. The buyer never reaches them.

variant_id: <id>
tested_at: YYYY-MM-DD
buyer_state: cold | researching | shortlisting | comparing | objecting | renewing
panel:
  - persona: <id>
    clarity_1to5            # I get it
    relevance_1to5          # It's for me
    value_1to5              # I want the promises
    differentiation_1to5    # I get how this is different
    credibility_1to5        # I believe the claims (data, not salesy)
    urgency_1to5            # I should act now
    friction_1to5           # Resistance / doubts (INVERSE — lower is better)
    one_read_takeaway: <verbatim — what the buyer thinks this says>
    objections: [verbatim]
    comprehension_gaps: [bullets]
    irrelevant_message_avoidance: true|false
    intent_lift_vs_anchor: -2..+2
    confidence: 0.0–1.0
    [VERIFY]_flags: [<dims where score derived from <50% literal overlap>]
predicted_winner: <variant_id>
forecast_recorded: false

Output mode: directional only

Per iter-3 debrief: synthetic panels rank-invert on narrow high-intent product LPs at low N. Treat the read as direction, not verdict. Live calibration follows.

Composes with

Refusal patterns

Common failure modes

Calibration

Forecast recorded BEFORE live result. Brier score the panel's predicted_winner against the live winner. Track per-persona calibration across many bets. Personas with poor calibration get retired or refined.

Open the interactive view →