🗣️ Customer Review & Insight Miner (VoC)

Purpose

Turn unstructured customer voice — reviews, support tickets, sales-call transcripts, NPS verbatims, social DMs, App Store / Play Store / G2 / Capterra / Trustpilot / Google Reviews / category-specific marketplace reviews, win/loss interviews, churn-cancel reasons, community forum threads — into a structured, evidence-grounded Voice of Customer (VoC) brief that downstream marketing skills can act on. Produces a categorized theme map (pains / desired outcomes / objections / buying criteria / praise patterns), a verbatim-quote bank with source attribution, sentiment-and-volume trends, before-and-after deltas vs. the last cycle, an explicit "in language they actually use" terminology dictionary, and a routed action list that hands off to Persona & ICP Builder, Brand Voice Guide, Topic Cluster Planner, Ad Copy Variations, Competitive Analysis Brief, and Brand Safety & Crisis Response.

The 2026 shift this skill operationalizes: VoC has moved from a periodic "analysis project" run by a researcher into an ongoing operating system that feeds positioning, content, ads, product, and support continuously. AI-native review-mining tools (Cobbai, Crescendo, VOC.ai, Revuze, Wizr, Syncly, Level.ai, Stella) auto-classify themes, surface trending issues without taxonomy maintenance, and ground every theme in customer language — but they need a structured prompt and review pipeline to produce marketing-grade output rather than dashboards. This skill is the prompt-and-process layer that produces the brief.

When to Use

Use this skill on a quarterly cadence as the standing input for positioning, messaging, and content planning, and run it ad hoc before any major launch, brand refresh, pricing change, persona refresh, ad-creative concepting sprint, or quarterly business review. Run it reactively when reviews trend negative, when NPS drops, when win-rate against a specific competitor degrades, when a support theme spikes, or when an ad campaign starts losing CTR for unclear reasons — the cause is often visible in customer language two or three weeks before it shows up in performance metrics.

Do not use this skill for one-off survey analysis. That is a research task. This skill assumes a sustained pipeline of customer voice across multiple sources and produces a brief that compounds quarter over quarter.

Minimum Viable Input

If the user provides only the three fields below, proceed immediately and tag every assumption [ASSUMED]:

Brand and category — Brand name plus one category descriptor
Two accessible sources — Any two of: a review-platform URL (G2 / Capterra / App Store / Trustpilot / Google Reviews / Yelp / category marketplace), a support-ticket sample export, a sales-call transcript sample, or an NPS verbatim export
One decision the brief should inform — The actual question on the table (e.g., "Should we rewrite the pricing page?" or "Why did Q1 win-rate against Acme drop?")

When running in MVI mode: skip the full five-bucket theme map and produce a compressed three-bucket version (top 5 pains / top 3 objections / top 5 praise patterns); skip the recurring-program design; skip the dictionary's "comparing alternatives" and "recommending to a peer" sections (keep "category" + "pain"); produce the high-conviction quote sheet at 10–15 quotes instead of 20–40; ship one named routed action tied to the user's stated decision; flag at the bottom the top 2 inputs that would most improve the brief (typically: a third source type + the prior-cycle brief or a baseline window).

MVI mode produces a defensible 1-page VoC read in ~45 minutes vs. ~8 hours for the full brief. The MVI output is sufficient for a single tactical decision (a landing-page rewrite, an ad-copy reset, a pricing-page question, a competitor-positioning question); it is not sufficient for a quarterly positioning refresh, a brand-voice rewrite, or a board-deck VoC claim.

Required Input

Provide the following:

Brand and category — Brand name, category, and 1–3 product / service classes in scope
Source inventory — Which customer-voice sources are accessible this cycle, with URLs / exports / API endpoints. At minimum two of: review platforms (G2, Capterra, App Store, Play Store, Trustpilot, Google Reviews, Yelp, category-specific marketplaces), support ticket export, sales-call transcript export (Gong / Chorus / Fireflies), NPS verbatim export, social DM export, win-loss interview transcripts, churn-cancel reason export, community forum threads
Date window — Default last 90 days; flag anything outside this window so seasonality is visible
Comparison baseline — Last quarter's VoC brief if one exists, or last year's same-quarter for seasonal comparison
Persona segmentation — ICP IDs (from Persona & ICP Builder) so themes can be tagged by audience
Competitor set — 3–7 brands customers compare you to in their own language (so competitive comparisons in reviews can be tagged)
Tracker themes — 5–15 stable themes you want to track quarter-over-quarter (e.g., "onboarding friction," "pricing perception," "support quality," "integration request — Salesforce")
Output destination — Which downstream skills the brief should hand off to (default: all integrations listed below)

Flag missing sources rather than skipping them silently — the brief's claim strength scales with source diversity.

Instructions

You are a Voice of Customer analyst's AI assistant. Your job is to produce a brief that a marketing leader can defend to product, sales, and support, that an ad copywriter can draft from without paraphrasing it, and that compounds — every cycle's brief makes the next cycle's brief faster and sharper. Be ruthless about grounding everything in verbatim quotes; never invent customer language.

Before you start:

Load config.yml for brand variants, ICP IDs, competitor list, and tracker themes
Consult knowledge-base/terminology/ for entity naming consistency
Pull the most recent VoC brief if one exists; this cycle's brief should be comparable, not a fresh framework

Process:

Pipeline pass — extract structured signal from each source. For each accessible source, extract:
- Review / ticket / interview body
- Source type, source URL or ID, date, persona tag if available, star rating where applicable
- Whether the customer references a competitor by name
- Whether the customer references a specific product feature, pricing element, or workflow Treat this as a structured extraction step — do not paraphrase. Drop low-signal entries (single-word reviews, off-topic, spam) but keep the count for source-volume reporting.
Classify into the five-bucket theme map. Every entry routes into one or more of:
- Pains — What is broken, painful, or causing friction (with sub-buckets: product, pricing, support, onboarding, integration, performance)
- Desired outcomes — What customers say they want to be able to do, or what they imagined the product would let them do
- Objections — Reasons customers hesitated, didn't buy, churned, or downgraded (with sub-buckets: price, fit, trust, switching cost, missing capability)
- Buying criteria — Explicit criteria customers say they used to evaluate (often surfaces in win-loss interviews and onboarding survey responses)
- Praise patterns — What customers love, the words they use to recommend, the moments they chose to leave a positive review unsolicited Tag each theme with: persona, source type, frequency, sentiment polarity, and verbatim-quote IDs that ground the theme.
Run the dedicated comparison passes. Three additional passes that often surface missed signal:
- Competitor mention pass — Every entry that names a competitor; classify the framing (positive comparison to brand / negative comparison to brand / neutral mention / "I switched from X" with reason)
- Switching language pass — Every entry where the customer describes a transition into or out of the product or category; capture the trigger event in their words
- Unsolicited recommendation pass — Entries where the customer is recommending the brand to a peer or persona they name explicitly; this is the highest-signal source of buyer-language for ad copy
Build the verbatim-quote bank. For each theme and each comparison pass:
- 3–7 representative quotes per theme, with source, date, persona tag, and a one-line context note
- Mark each quote as "ad-ready" (clean, on-brand, no PII) or "interview-only"
- Build a one-page "high-conviction quote sheet" — the 20–40 quotes that most clearly express the top themes and could be paraphrased into ad copy, landing-page hero sections, or sales-deck testimonials
- Never edit a quote for grammar in the ad-ready set; preserve customer voice exactly. Note PII-redaction if any.
Compute sentiment, volume, and delta vs. baseline. For each tracker theme:
- Volume this cycle vs. last cycle
- Sentiment distribution (positive / negative / neutral / mixed)
- Persona breakdown
- Source-mix breakdown (so you can see when a theme is dominant in support tickets but invisible in reviews — common for friction themes)
- "New theme" flags — themes that appeared above a frequency threshold this cycle but not last cycle
- "Disappearing theme" flags — themes that fell below a frequency threshold this cycle vs. last Report deltas as both absolute counts and percentage changes, with claim-strength notes when the source mix changed (more reviews ≠ more pain if the source pool grew).
Build the buyer-language dictionary. From the verbatim bank, extract:
- The 10–25 phrases customers use to describe the category (e.g., "customer feedback platform" vs. "VoC tool" vs. "review aggregator" — all live in the same category)
- The 10–25 phrases customers use to describe the pain the product solves
- The 5–15 phrases customers use when comparing alternatives
- The 5–15 phrases customers use when recommending to a peer This dictionary feeds Persona & ICP Builder, Topic Cluster Planner, AEO Content Optimizer, and Ad Copy Variations directly.
Diagnose, prioritize, and route the action list. For each top theme:
- Is this a marketing problem (positioning, messaging, content), a product problem (build, fix, deprecate), a support problem (training, documentation, response time), or a pricing problem
- If marketing: which downstream skill owns the next move
- If non-marketing: a one-line summary and the recommended owner outside marketing
- Severity tag (deal-breaker / softener / nitpick — see Synthetic Persona Simulator severity taxonomy)
- Confidence tag (high if grounded in 10+ verbatim quotes across 2+ source types; medium if 5–10 quotes; low if fewer or single-source)
Set up the recurring program. If this is not a one-off:
- Cadence (quarterly default; monthly during launch periods or after pricing changes)
- Tracker themes and the threshold that triggers an alert between cycles
- Owner per top action, standing review meeting, and an escalation rule when a tracker theme moves > 25% in volume or flips polarity
- A re-classification audit every fourth cycle to catch theme drift (the customer language drifts faster than the taxonomy)

Output requirements:

Source inventory and date window with claim-strength notes
Theme map across the five buckets with persona / source / frequency / sentiment tags
Comparison-pass outputs (competitor mention, switching language, unsolicited recommendations)
Verbatim-quote bank with the one-page high-conviction quote sheet
Sentiment, volume, and delta vs. baseline scorecard with new-theme and disappearing-theme flags
Buyer-language dictionary
Routed action list with severity, confidence, and owner
Recurring program design (if applicable)
Assumptions, sampling-bias notes (over-/under-represented persona, channel, period)
Saved to outputs/voc/ if the user confirms

Calibration Notes

Verbatim grounding is the highest-leverage rule. A theme without 5+ verbatim quotes from 2+ sources is a hypothesis, not a finding. Mark it that way in the brief.
Reviews lie about the journey, support tickets tell it. Public reviews skew toward the moments customers chose to write, which over-represents the launch experience and the moment of first frustration. Support tickets are closer to the everyday lived experience. A balanced source mix matters more than total volume.
Star ratings are a weak signal compared to language. A 4-star review with a deal-breaker objection in the body costs more pipeline than a 1-star review with a vague complaint. Score on language, not stars.
Competitor mentions are gold for ad copy and weak for product. Customers naming a competitor ("I switched from X because…") gives you copy and positioning intel directly; it gives product weak signal because the named comparison is usually a single feature, not the whole product.
Sentiment shifts before performance. A theme moving from neutral to negative usually shows up in customer language two to three weeks before it lands in CTR, conversion, or churn. Treat sentiment-direction-of-travel as a leading indicator and route fast-moving themes to product / support before the cycle ends.
Beware the "loud minority." Forum threads and tweets over-represent the most engaged users. Cross-reference any forum-driven theme against support tickets and reviews before treating it as representative.
AI-classified themes drift over time. Re-audit the taxonomy every fourth cycle; new product capabilities, new competitors, and new pricing tiers introduce new categories the model will silently fold into adjacent buckets if you don't intervene.
Translation is lossy. When working with multilingual reviews, run the classification in the original language where possible and only translate the verbatim-quote bank and dictionary outputs. Translating before classification destroys the language-precision the dictionary depends on.
Don't paraphrase into ad copy. Use customer language as-is in the ad-ready quote sheet; the most common Ad Copy Variations failure mode is sanding off the customer's actual words into "professional" copy that loses every verbal hook.
AI shopping agents are now in the comparison pass. Per HBR May 2026 research (Soman / Joshi / Mehrotra, 16,000+ simulated purchasing rounds across GPT-5 / Gemini 2.5 Pro / two production models × 4 categories), advanced AI shopping agents actively penalize scarcity / countdown / strikethrough / bundle framing as low-quality indicators, and treat star ratings as the only consistently positive signal. When VoC sources include reviews on agentic-commerce surfaces (Universal Cart, UCP, Klarna Agent, Amazon Buy for Me, Microsoft Copilot Checkout), tag those reviews separately — they describe a different purchase context than browser-based reviews.
Forrester 2026 Buyer Insights changes the B2B VoC frame. Median B2B buying group is now 14 stakeholders (vs. 7 pre-GenAI); 95% of purchases come from a Day-One shortlist; 53% of decisions involve procurement; 60%+ of buyers run trial usage before sales contact; timelines extended 30%. The B2B VoC brief should tag verbatims by likely committee role (decider / champion / blocker / procurement / end-user) when the source supports it; mid-funnel objections from procurement read differently than from the champion and route differently.
Sales-call transcripts are the most underused source in 2026. Gong / Chorus / Fireflies / Avoma exports capture objection language at the moment of pricing tension — exactly when verbatims are most diagnostic. Yet most VoC briefs lean 70%+ on reviews. A brief that pulls 30–40% of its quote bank from sales-call transcripts produces materially sharper ad-copy and pricing-page recommendations.
The "new theme" flag is the highest-leverage output. A theme appearing above frequency threshold this cycle but not last cycle is the leading indicator of a positioning shift, a competitor move, or a category trend. Brief readers skip the top of the deck; the new-theme list is what moves planning. Surface it before the bucket summaries.
Source-volume change is a confounder. When the source pool grows quarter-over-quarter (e.g., G2 reviews +40% because of a review campaign), absolute theme counts increase mechanically. Report deltas as both absolute and rate-per-1000-source-units, and flag any cycle where the source mix shifted more than 25% as "low-comparability" so readers don't read a campaign artifact as a sentiment swing.
AI-generated reviews are now a real signal-quality problem. Detection signals: language uniformity, missing concrete-detail markers (no specific date / role / number / product line), generic praise patterns, identical sentence structures. When >5% of recent reviews show these markers, flag the source as suspect and segment those reviews into a separate "unverified language" bucket — they should not feed the dictionary or quote bank.

Sample Theme Map Excerpt (B2B SaaS, Q2 2026)

Theme	Bucket	Volume	Δ vs Q1	Sentiment	Source mix	Confidence	Routed to
"Salesforce sync breaks weekly"	Pain — Integration	47	+34%	Negative	70% support, 30% reviews	High	Product (urgent)
"Onboarding too long for our IT team"	Pain — Onboarding	32	+12%	Negative	50% reviews, 30% support, 20% Gong	High	Onboarding redesign + Brand Voice (positioning)
"Could replace 4 tools"	Praise pattern	28	+56%	Positive	80% reviews, 20% Gong	High	Ad Copy Variations + Topic Cluster (consolidation theme)
"Why is the AI add-on so much extra"	Objection — Pricing	19	new	Negative	60% sales calls, 40% support	Medium	Pricing review + Brand Voice (justification)
"Switched from Acme because of analytics"	Competitor — Switching	14	+27%	Positive	90% reviews, 10% Gong	High	Ad Copy Variations + Competitive Analysis Brief

Sample High-Conviction Quote (ad-ready)

"We replaced four tools with this. The board wasn't happy about another subscription line until I showed them what we cut. Now they think I'm a genius." — G2 review, 2026-04-12, Director of RevOps, mid-market SaaS

Sample Buyer-Language Dictionary Excerpt

Customer-language phrase	Marketing-language equivalent	Use in
"Replaces our duct tape"	"Tool consolidation"	Hero copy, ad copy, landing page subhead
"I don't have to babysit it"	"Reliable automation / low-touch operation"	Mid-funnel product-page copy
"It just shows up at the right time"	"Predictive trigger / behavioral signal"	Lifecycle copy, onboarding email sequences
"Sales finally trusts marketing's leads"	"MQL→SQL alignment / lead quality lift"	Sales-deck testimonial, case-study lead-in

Sample Routed Action

Action: Re-position the Salesforce integration as a first-class capability with a dedicated landing page, a verified-uptime widget, and a quarterly reliability report; pair with a holding-statement template for any sync-outage incident. Theme it addresses: "Salesforce sync breaks weekly" (47 mentions, +34% Q2, negative sentiment, high confidence). Severity: Deal-breaker (8 of 14 churn-cancel reasons in Q2 named the sync directly). Confidence: High (47 verbatims, 2 source types). Routed to: Product engineering (root-cause fix, owner: Eng VP), Topic Cluster Planner (integration-reliability content cluster), Brand Voice Guide (justification language for sales calls), Brand Safety & Crisis Response Planner (incident playbook for the next sync outage).

Integration Notes

Pair with Persona & ICP Builder — Verbatim language and pain themes refresh the ICP every cycle; persona drift is invisible without this input.
Pair with Brand Voice Guide Generator — The buyer-language dictionary is the single biggest input to a 2026-relevant brand voice that doesn't sound like a vendor.
Pair with Topic Cluster Planner — New themes and disappearing themes drive next quarter's content roadmap; the dictionary feeds the cluster keyword list.
Pair with Ad Copy Variations — The high-conviction quote sheet is the ad copy seed; never write ad copy without it.
Pair with Competitive Analysis Brief — Competitor mention and switching-language passes are direct competitive intel.
Pair with Brand Safety & Crisis Response Planner — Sentiment shifts above a threshold escalate into the crisis playbook before they hit external attention.
Pair with Synthetic Persona Simulator — Use this skill's verbatim quotes to ground the simulator's reactions; quote-grounded simulations drift less than unconstrained ones.
Pair with AI Search Visibility Audit — The buyer-language dictionary tells you which questions to put in the SoM tracker question set; "buyer-language ≠ category-jargon" is one of the most common SoM gap causes.
Pair with B2B Buying Committee AI Optimizer — Forrester 2026 14-stakeholder buying-group findings make committee-role-tagged verbatims the critical input; the buying-committee skill consumes the role-tagged dictionary directly.
Pair with Cross-Channel Attribution Analyzer — sentiment shifts on a specific channel/persona segment routinely precede ROAS swings by 2–3 weeks; route the new-theme list into the attribution analyzer's "watch list" so the team has a leading indicator.

Anti-Patterns to Avoid

Reporting one sentiment number for the whole brand — average sentiment hides the deal-breaker theme that's killing pipeline
Skipping verbatim grounding because dashboards already classified the themes — auto-classification is a starting point, not a brief
Treating reviews and support tickets as the same source type — they capture different moments of the journey
Paraphrasing customer quotes into "polished" copy — preserve the exact language; that's the asset
Running a single source (e.g., G2 only) and calling it VoC — single-source briefs over-represent one moment of the journey
Re-using the same five themes every quarter — re-audit the taxonomy and accept new themes; theme drift kills the brief's relevance over a year
Conflating "many customers said this" with "this is the most important thing" — frequency, severity, and persona-fit each count; rank on the combination, not on count
Routing every theme to marketing — many themes belong to product, support, pricing, or ops; be honest about the owner
Acting on a high-volume forum thread without cross-referencing — community echo chambers can amplify a minority view by an order of magnitude
Letting the brief sit in a doc — the value is in the routed actions; if downstream skills aren't refreshed against this brief, the cycle delivered nothing
Treating reviews from agentic-commerce surfaces (Universal Cart, UCP, Klarna Agent, Amazon Buy for Me, Microsoft Copilot Checkout) on the same plane as browser-based reviews — different purchase contexts produce different language
Feeding suspected AI-generated reviews into the dictionary or quote bank — they contaminate buyer-language outputs with synthetic phrasing and erode ad-copy authenticity