NeverRanked · Teardown 07 · Austin TX CPA firms

The Claude collapse generalizes. The Gemma collapse does not.

37-firm Austin cohort, 18 hash-locked questions, 3+ usable runs on 2026-05-27. First non-Hawaii measured category, built to test whether the Hawaii CPA training-data engine collapse generalizes. It splits.

The headline finding in one sentence: across 37 Austin TX CPA firms and 7 AI tools, the same query shape that measured Hawaii CPA firms last week produces a split-result on the training-data engines. Claude cites Austin CPA firms 0% of the time (3 of 885 cites) and Hawaii CPA firms 1% of the time. The collapse generalizes across geographies. Gemma cites Austin CPA firms 23% of the time but Hawaii CPA firms only 2%. Gemma's coverage of CPA firms is geo-specific, not category-specific. Microsoft Copilot's 0% own-share holds in both geographies as the universal cohort-wide gap.

Why this measurement existed

The Hawaii CPA teardown (published 2026-05-26) surfaced what looked like a category-level pattern: training-data AI engines (Claude and Gemma) cite Hawaii CPA firms at near-zero rates (1% and 2% respectively), in contrast to the other categories we had measured at that point, where training-data engines were the highest own-share tier. The pattern was documented at low confidence because we had only measured Hawaii CPAs in that category.

The cross-geo question is direct: does the collapse generalize, or is it Hawaii-specific? Austin TX was chosen as the test bed because it is a comparable market (mid-size professional-services hub) with a direct structural analog (Texas Society of CPAs vs Hawaii Society of CPAs). Same question shape, geo-swapped from Hawaii/Honolulu to Austin. Same 7 AI engines, same 3 reps, same hash-locked discipline.

The result split the pattern into two findings, neither of which we could have surfaced from one geo alone.

Methodology summary

Same 7-AI-tool methodology applied across all NeverRanked teardowns:

5 web-searching AI tools: Perplexity, ChatGPT search, Gemini (grounded with live web search), Microsoft Copilot via Bing organic results, Google AI Overviews.
2 training-data AI tools: Claude (via the Anthropic API), Gemma (open-weight, run on a model-hosting provider so the model itself is independently inspectable).

18 questions an Austin CPA buyer would actually ask AI, locked at hash 17fec069.... The question set is structurally identical to the Hawaii CPA set (8 head + 10 long-tail) with geographic terms swapped (Hawaii/Honolulu/Bishop Street replaced with Austin/downtown Austin/Texas equivalents). 3 repetitions per question per AI tool. 3+ usable runs on 2026-05-27. Pattern-readiness rule of 3 usable runs cleared per the internal pattern-readiness rule.

The 37-firm cohort was built in four passes. 5 anchor firms registered before run 1 (one with a corrected domain after run #1 surfaced the actual primary domain, same pattern as Hawaii, where one firm's registered domain turned out to be a secondary domain and was corrected after run #1). 16 additional firms surfaced through the run #1 within-citation scan. 9 more surfaced after run #2 cohort-coverage. 8 final additions after run #3. National Big-X firms (KPMG, PwC, Deloitte, EY, BDO, RSM, CLA, Cherry Bekaert, Weaver, Withum, CohnReznick) and out-of-state firms were excluded from the cohort to keep the comparison Austin-specific. Texas Society of CPAs (state association), city/state surfaces, gig platforms, and review aggregators were deliberately not registered as competitors. They appear in the data as third-party content sources.

Full methodology documented at /methodology/. The question set is locked by hash and every run is dated on the claims ledger.

Source-type distribution (cohort-wide)

Across all 37 firms and all 7 AI tools, 7,738 total citations, AI pulled answers from these source types:

Source type	% of mentions	Count
Independent web (third-party content)	54%	4,174
Competitor (firm-owned websites)	40%	3,127
Review directories	3%	195
Wikipedia	2%	132
Social (LinkedIn, Facebook, Instagram)	1%	54
Reddit	1%	42

The 40% firm-owned share is comparable to the Hawaii CPA 39% number. Both Austin and Hawaii CPA categories sit at the bottom of the professional-services cluster on the web-searching surface, roughly equivalent on that axis. The divergence between the two geographies is concentrated entirely in the training-data engines (next section).

Per-AI-tool breakdown, the cross-geo split

AI tool	Austin own-share	Hawaii own-share	Cross-geo finding
Perplexity	61%	46%	Both geos high, Austin higher
Gemini grounded	61%	59%	Both geos high, similar
Google AI Overviews	50%	45%	Both geos mid-high, similar
ChatGPT search	45%	60%	Both geos mid-high, Hawaii higher
Microsoft Copilot (Bing)	0%	2%	Universal cohort-wide gap holds
Gemma (training data)	23%	2%	SPLIT: Austin 11x higher
Claude (training data)	0%	1%	Collapse generalizes

Two things to read from this table at once.

The web-searching engines are roughly comparable across the two geographies. Perplexity, Gemini, Google AIO, and OpenAI all reach between 45% and 61% own-share in both Hawaii and Austin. There are small differences (Austin Perplexity is 61% vs Hawaii 46%. Hawaii OpenAI is 60% vs Austin 45%) but the cluster is the same shape. Web-searching engines find CPA firms in both geographies through similar mechanisms.

The training-data engines diverge dramatically. Claude cites Austin CPAs 0% of the time and Hawaii CPAs 1% of the time. The collapse holds. Gemma cites Austin CPAs 23% of the time and Hawaii CPAs 2% of the time. The collapse breaks. Same query shape, same week, same methodology. The difference is geographic, not categorical.

The Claude finding is the more important of the two, because it generalizes. The category-level claim (Claude under-cites CPA firms regardless of geo) is now supported by two distinct geographies. The original Hawaii-only observation was at low confidence (n=1). The cross-geo data promotes it to medium confidence.

The Gemma finding is the more surprising. The same model, the same questions, the same week. The only difference is the geography embedded in the queries. Gemma's training corpus appears to know Texas CPA firms substantially better than Hawaii CPA firms. This is a geo-density finding about the training corpus, not a category finding about CPAs. Hawaii's smaller business-news footprint relative to Texas is a plausible explanation. We observe the difference without claiming the cause.

What the split means for an Austin CPA firm

For an Austin CPA firm specifically, the recommendation is materially different from the Hawaii CPA recommendation. Claude is the same kind of category-wide blind spot for Austin firms as it is for Hawaii firms. Do not invest in trying to move Claude's needle on a short timeline. Gemma already cites Austin firms in a way it does not cite Hawaii firms. Gemma's 23% own-share means sustained editorial presence on the web ingestible by Gemma's training cycle is a plausible lever for Austin CPA firms specifically. We measure what AI cites and do not promise a lift. The web-searching engines (Perplexity 61%, Gemini 61%, Google AIO 50%, OpenAI 45%) are the surface where firm-owned content already shows up most. Microsoft Copilot is the cohort-wide 0% gap that holds in every measured category.

Top recurring firms (anonymized)

The 5 firms AI cited most often across the 18 questions and 7 tools.

Firm (anonymized)	Total mentions	% of cohort competitor share	Runs cited in
Firm A	479	15%	4/4
Firm B	269	9%	4/4
Firm C	261	8%	4/4
Firm D	205	7%	4/4
Firm E	192	6%	4/4

The top 5 firms account for 45% of all firm-owned mentions (1,406 of 3,127). Slightly more concentrated than Hawaii CPA (35% top-5 share). Firm A leads the cohort by roughly 2x the second-place firm. The remaining 32 firms have meaningful mention counts but lower-frequency presence, with most landing in the single-digit to low-hundred range across the 4 measurement windows.

Where AI pulls from when it cites non-firm content

The 4,174 third-party-content mentions are not all the same shape. Top recurring sources across runs:

Source	Mentions	Why AI cites it
Cherry Bekaert (cbh.com, national top-25)	166	National firm with Austin office (excluded from local cohort)
Google (google.com)	161	AI engine grounding artifact (links back to Google)
CliftonLarsonAllen (claconnect.com, national)	138	National firm with Austin office (excluded from local cohort)
CX Research Institute (cxresearchinstitute.org)	120	Research organization producing CPA-firm rankings content
City of Austin (austintexas.org)	104	State/city tourism and reference content

The top recurring third-party sources are dominated by national firms with Austin offices (Cherry Bekaert, CLA) and generic Austin-area reference content. There is no single dominant editorial trust layer the way HSCPA dominates the Hawaii CPA third-party tier (HSCPA appeared 130 times in Hawaii CPA data). The Texas Society of CPAs (tscpa.org) is registered as excluded from the cohort but did not surface as a top-cited source for these queries. The Austin third-party landscape is more fragmented than the Hawaii third-party landscape for the same category.

What this teardown does and does not prove

What it does support:

The 37-firm cohort is a representative slice of Austin TX CPA firms AI tools actually cite for buyer-shaped queries.
The 40%/54% firm-own / third-party split is stable across the expanded cohort and 4 measurement windows.
The Claude training-data collapse on CPA firms generalizes across two distinct geographies (Hawaii 1%, Austin 0%). Two-geo confidence is medium. A third geo measurement would strengthen or break the pattern.
Gemma's CPA-firm coverage is geo-specific rather than category-specific (Hawaii 2%, Austin 23%). The original "training-data engine collapse" pattern from the Hawaii CPA teardown overstated the symmetry between Claude and Gemma. The cross-geo measurement corrected the overstatement.
Microsoft Copilot's 0% own-share holds in Austin as it does across every category we have measured. The pattern is universal across the measurement windows.
The web-searching engines are the primary closable AEO surface for Austin CPA firms (40-61% own-share band depending on engine).

What it does not yet support:

That the Claude collapse holds outside the two measured geographies. A third (and fourth, etc.) measured CPA geo would strengthen the cross-geo claim. As of this teardown, the pattern is observable in Hawaii and Texas. It has not been measured elsewhere.
That Gemma's 23% Austin own-share is the ceiling. Cohort expansion could move it slightly up or down on additional runs.
That the geo-density hypothesis (Gemma knows larger-market business content better) is the cause of the Gemma split. We observe the split. We infer the cause. A measurement of a CPA category in a third geography of different population density would test the hypothesis directly.
That changing a firm's content, directory presence, or third-party listings would cause AI to cite differently. We measured what AI cites. Causation requires pre-registered experiments. Different scope.

Why this is anonymized

None of the 37 firms in this cohort are paying NeverRanked customers. The non-customer anonymization rule applies: counts, distributions, source-type breakdowns, and per-AI-tool numbers are public. Individual firm names are not. The pattern is what is informative on a public surface. The named cohort lives only inside paid engagement deliverables, where the named firm is the customer authorizing the use.

A firm that becomes a NeverRanked customer gets a 1:1 deliverable that names every firm in the cohort, names the queries the customer is missing on, and ranks the closable conditions. That deliverable is private to the customer.

Get the free diagnostic Hawaii CPA teardown (for comparison) Cross-category teardown How we measure

Measurement window: 4 manifest-recorded runs on 2026-05-27 (3 deliberate runs + 1 partial-recovery run that completed cleanly). Pattern-readiness rule of 3 runs cleared per the internal pattern-readiness rule. Refresh cadence is monthly or on customer request.

Substantiation: question set locked by hash 17fec069..., documented method at /methodology/, named AI tools on named dates, every run dated on the claims ledger. The Gemma model itself is open-weight and independently inspectable.

Anonymization: the 37-firm cohort is kept anonymized at the firm level per the non-customer rule. Counts, distributions, and named third-party sources (Cherry Bekaert, CLA, CX Research Institute) are public because they are categorically named already and the substantiation value depends on naming the specific structural surfaces AI uses.