The Claude collapse generalizes. The Gemma collapse does not.
37-firm Austin cohort, 18 hash-locked questions, 3+ usable runs on 2026-05-27. First non-Hawaii measured category, built to test whether the Hawaii CPA training-data engine collapse generalizes. It splits.
Why this measurement existed
The Hawaii CPA teardown (published 2026-05-26) surfaced what looked like a category-level pattern: training-data AI engines (Claude and Gemma) cite Hawaii CPA firms at near-zero rates (1% and 2% respectively), in contrast to 4 other measured categories where training-data engines were the highest own-share tier. The pattern was documented at low confidence (n=1) because we had only measured Hawaii CPAs in that category.
The cross-geo question is direct: does the collapse generalize, or is it Hawaii-specific? Austin TX was chosen as the test bed because it is a comparable market (mid-size professional-services hub) with a direct structural analog (Texas Society of CPAs vs Hawaii Society of CPAs). Same question shape, geo-swapped from Hawaii/Honolulu to Austin. Same 7 AI engines, same 3 reps, same hash-locked discipline.
The result split the pattern into two findings, neither of which we could have surfaced from one geo alone.
Methodology summary
Same 7-AI-tool methodology applied across all NeverRanked teardowns:
- 5 web-searching AI tools: Perplexity (sonar API), ChatGPT search (gpt-4o-mini-search-preview), Gemini grounded (gemini-2.5-flash with Google Search), Microsoft Copilot via Bing organic results, Google AI Overviews.
- 2 training-data AI tools: Claude (claude-haiku-4-5, via Anthropic API), Gemma (open-weight, via Together AI).
18 questions an Austin CPA buyer would actually ask AI, locked at hash 17fec069.... The question set is structurally identical to the Hawaii CPA set (8 head + 10 long-tail) with geographic terms swapped (Hawaii/Honolulu/Bishop Street replaced with Austin/downtown Austin/Texas equivalents). 3 repetitions per question per AI tool. 3+ usable runs on 2026-05-27. Pattern-readiness rule of 3 usable runs cleared per MOAT.md rule 5.
The 37-firm cohort was built in four passes. 5 anchor firms registered before run 1 (one with a corrected domain after run #1 surfaced the actual primary domain, same pattern as Hawaii where kmhcpas.com surfaced as kmhllp.com). 16 additional firms surfaced through the run #1 within-citation scan. 9 more surfaced after run #2 cohort-coverage. 8 final additions after run #3. National Big-X firms (KPMG, PwC, Deloitte, EY, BDO, RSM, CLA, Cherry Bekaert, Weaver, Withum, CohnReznick) and out-of-state firms were excluded from the cohort to keep the comparison Austin-specific. Texas Society of CPAs (state association), city/state surfaces, gig platforms, and review aggregators were deliberately not registered as competitors; they appear in the data as third-party content sources.
Full methodology + open-source measurement code at /methodology/.
Source-type distribution (cohort-wide)
Across all 37 firms and all 7 AI tools, 7,738 total citations, AI pulled answers from these source types:
| Source type | % of mentions | Count |
|---|---|---|
| Independent web (third-party content) | 54% | 4,174 |
| Competitor (firm-owned websites) | 40% | 3,127 |
| Review directories | 3% | 195 |
| Wikipedia | 2% | 132 |
| Social (LinkedIn, Facebook, Instagram) | 1% | 54 |
| 1% | 42 |
The 40% firm-owned share is comparable to the Hawaii CPA 39% number. Both Austin and Hawaii CPA categories sit at the bottom of the professional-services cluster on the web-searching surface; they're roughly equivalent on that axis. The divergence between the two geographies is concentrated entirely in the training-data engines (next section).
Per-AI-tool breakdown, the cross-geo split
| AI tool | Austin own-share | Hawaii own-share | Cross-geo finding |
|---|---|---|---|
| Perplexity | 61% | 46% | Both geos high; Austin higher |
| Gemini grounded | 61% | 59% | Both geos high; similar |
| Google AI Overviews | 50% | 45% | Both geos mid-high; similar |
| ChatGPT search | 45% | 60% | Both geos mid-high; Hawaii higher |
| Microsoft Copilot (Bing) | 0% | 2% | Universal cohort-wide gap holds |
| Gemma (training data) | 23% | 2% | SPLIT: Austin 11x higher |
| Claude (training data) | 0% | 1% | Collapse generalizes |
Two things to read from this table at once.
The web-searching engines are roughly comparable across the two geographies. Perplexity, Gemini, Google AIO, and OpenAI all reach between 45% and 61% own-share in both Hawaii and Austin. There are small differences (Austin Perplexity is 61% vs Hawaii 46%; Hawaii OpenAI is 60% vs Austin 45%) but the cluster is the same shape. Web-searching engines find CPA firms in both geographies through similar mechanisms.
The training-data engines diverge dramatically. Claude cites Austin CPAs 0% of the time and Hawaii CPAs 1% of the time. The collapse holds. Gemma cites Austin CPAs 23% of the time and Hawaii CPAs 2% of the time. The collapse breaks. Same query shape, same week, same methodology. The difference is geographic, not categorical.
The Claude finding is the more important of the two, because it generalizes. The category-level claim (Claude under-cites CPA firms regardless of geo) is now supported by two distinct geographies. The original Hawaii-only observation was at low confidence (n=1); the cross-geo data promotes it to medium confidence.
The Gemma finding is the more surprising. The same model, the same questions, the same week. The only difference is the geography embedded in the queries. Gemma's training corpus appears to know Texas CPA firms substantially better than Hawaii CPA firms. This is a geo-density finding about the training corpus, not a category finding about CPAs. Hawaii's smaller business-news footprint relative to Texas is a plausible explanation; we observe the difference without claiming the cause.
What the split means for an Austin CPA firm
For an Austin CPA firm specifically, the recommendation is materially different from the Hawaii CPA recommendation. Claude is the same kind of category-wide blind spot for Austin firms as it is for Hawaii firms; do not invest in trying to move Claude's needle on a short timeline. Gemma is reachable for Austin firms in a way it is not for Hawaii firms; Gemma's 23% own-share means sustained editorial presence on the web ingestible by Gemma's training cycle does move the needle for Austin CPA firms specifically. The web-searching engines (Perplexity 61%, Gemini 61%, Google AIO 50%, OpenAI 45%) are the primary closable surface; Microsoft Copilot is the universal first-mover opening as in every measured category.
Top recurring firms (anonymized)
The 5 firms AI cited most often across the 18 questions and 7 tools.
| Firm (anonymized) | Total mentions | % of cohort competitor share | Runs cited in |
|---|---|---|---|
| Firm A | 479 | 15% | 4/4 |
| Firm B | 269 | 9% | 4/4 |
| Firm C | 261 | 8% | 4/4 |
| Firm D | 205 | 7% | 4/4 |
| Firm E | 192 | 6% | 4/4 |
The top 5 firms account for 45% of all firm-owned mentions (1,406 of 3,127). Slightly more concentrated than Hawaii CPA (35% top-5 share). Firm A leads the cohort by roughly 2x the second-place firm. The remaining 32 firms have meaningful mention counts but lower-frequency presence, with most landing in the single-digit to low-hundred range across the 4 measurement windows.
Where AI pulls from when it cites non-firm content
The 4,174 third-party-content mentions are not all the same shape. Top recurring sources across runs:
| Source | Mentions | Why AI cites it |
|---|---|---|
| Cherry Bekaert (cbh.com, national top-25) | 166 | National firm with Austin office (excluded from local cohort) |
| Google (google.com) | 161 | AI engine grounding artifact (links back to Google) |
| CliftonLarsonAllen (claconnect.com, national) | 138 | National firm with Austin office (excluded from local cohort) |
| CX Research Institute (cxresearchinstitute.org) | 120 | Research organization producing CPA-firm rankings content |
| City of Austin (austintexas.org) | 104 | State/city tourism and reference content |
The top recurring third-party sources are dominated by national firms with Austin offices (Cherry Bekaert, CLA) and generic Austin-area reference content. There is no single dominant editorial trust layer the way HSCPA dominates the Hawaii CPA third-party tier (HSCPA appeared 130 times in Hawaii CPA data). The Texas Society of CPAs (tscpa.org) is registered as excluded from the cohort but did not surface as a top-cited source for these queries; the Austin third-party landscape is more fragmented than the Hawaii third-party landscape for the same category.
What this teardown does and does not prove
What it does support:
- The 37-firm cohort is a representative slice of Austin TX CPA firms AI tools actually cite for buyer-shaped queries.
- The 40%/54% firm-own / third-party split is stable across the expanded cohort and 4 measurement windows.
- The Claude training-data collapse on CPA firms generalizes across two distinct geographies (Hawaii 1%, Austin 0%). Two-geo confidence is medium; a third geo measurement would strengthen or break the pattern.
- Gemma's CPA-firm coverage is geo-specific rather than category-specific (Hawaii 2%, Austin 23%). The original "training-data engine collapse" pattern from the Hawaii CPA teardown overstated the symmetry between Claude and Gemma; the cross-geo measurement corrected the overstatement.
- Microsoft Copilot's 0% own-share holds in Austin as it does in every other measured category. Universal pattern (n=6 categories now).
- The web-searching engines are the primary closable AEO surface for Austin CPA firms (40-61% own-share band depending on engine).
What it does not yet support:
- That the Claude collapse holds outside the two measured geographies. A third (and fourth, etc.) measured CPA geo would strengthen the cross-geo claim. As of this teardown, the pattern is observable in Hawaii and Texas; it has not been measured elsewhere.
- That Gemma's 23% Austin own-share is the ceiling. Cohort expansion could move it slightly up or down on additional runs.
- That the geo-density hypothesis (Gemma knows larger-market business content better) is the cause of the Gemma split. We observe the split; we infer the cause. A measurement of a CPA category in a third geography of different population density would test the hypothesis directly.
- That changing a firm's content, directory presence, or third-party listings would cause AI to cite differently. We measured what AI cites. Causation requires pre-registered experiments. Different scope.
Why this is anonymized
None of the 37 firms in this cohort are paying NeverRanked customers. The non-customer anonymization rule applies: counts, distributions, source-type breakdowns, and per-AI-tool numbers are public; individual firm names are not. The pattern is what is informative on a public surface. The named cohort lives only inside paid engagement deliverables, where the named firm is the customer authorizing the use.
A firm that becomes a NeverRanked customer gets a 1:1 deliverable that names every firm in the cohort, names the queries the customer is missing on, and ranks the closable conditions. That deliverable is private to the customer.
Measurement window: 4 manifest-recorded runs on 2026-05-27 (3 deliberate runs + 1 partial-recovery run that completed cleanly). Pattern-readiness rule of 3 runs cleared per MOAT.md rule 5. Refresh cadence is monthly or on customer request.
Substantiation: question set locked by hash 17fec069..., open-source measurement code, named AI tools on named dates. The fact-checker (also public source) rejected zero claims in this teardown.
Anonymization: the 37-firm cohort is kept anonymized at the firm level per the non-customer rule. Counts, distributions, and named third-party sources (Cherry Bekaert, CLA, CX Research Institute) are public because they are categorically named already and the substantiation value depends on naming the specific structural surfaces AI uses.