NeverRanked · Teardown 06 · Hawaii CPA firms

The web-searching engines find Hawaii CPAs. The training-data engines don't know any of them.

41-firm cohort, 18 hash-locked questions, 3 usable runs on 2026-05-26. Pattern-readiness cleared. Individual firms anonymized. Counts and distributions named.

The headline finding in one sentence: across 41 Hawaii CPA firms and 7 AI tools, firm-owned websites get 39% of all mentions, comparable to law firms. But the engine-level pattern is unique: OpenAI cites Hawaii CPA websites 60% of the time and Gemini 59%, while Claude and Gemma (the training-data engines) cite firm websites less than 2% of the time. For Hawaii CPAs, the training-data engines collapse to nearly zero. That collapse is not unique to CPAs: Claude also cites firm sites under 5% for Honolulu med spas, Honolulu HVAC, and Austin CPAs. What is distinctive here is how cleanly the four web-searching engines carry the category while both training-data engines fall away. The closable competitive ground sits inside the four web-searching engines (OpenAI, Gemini, Perplexity, Google AI Overviews). The training-data engines are a structural blind spot for the entire category, not a closable condition.

Why this category matters as a measurement subject

Hawaii CPA firms are a category where buyer trust signals matter more than commodity attributes. A buyer choosing a CPA is choosing a named partner's judgment on tax, audit, or advisory work that touches the rest of the business. The AI-citation surface reflects that. Firms compete for inclusion in editorial sources, professional associations, and category-specific directories in ways that purely transactional categories do not need to. With 41 firms in the cohort, the data shows where that competition actually lands.

CPA Hawaii is one of the categories NeverRanked has measured to pattern-readiness. The cross-category teardown reads it against the others, and the most distinctive structural pattern that surfaces is the training-data blind spot named above.

Methodology summary

Same 7-AI-tool methodology applied across all NeverRanked teardowns:

5 web-searching AI tools: Perplexity, ChatGPT search, Gemini (grounded with live web search), Microsoft Copilot via Bing organic results, Google AI Overviews.
2 training-data AI tools: Claude (via the Anthropic API), Gemma (open-weight, run on a model-hosting provider so the model itself is independently inspectable).

18 questions a Hawaii CPA buyer would actually ask AI, locked at hash dc6ae677... so every run compares apples to apples. 3 repetitions per question per AI tool. 3 usable runs on 2026-05-26 (run #1, #2, #3 fired in succession). Pattern-readiness rule of 3 usable runs cleared per the internal pattern-readiness rule.

The 41-firm cohort was built in four passes. 5 anchor firms registered before run 1. 15 additional firms surfaced through the run #1 within-citation scan. 8 more surfaced after run #2 cohort-coverage. 11 final additions after run #3. National Big-4 firms (KPMG, PwC, Deloitte, BDO, CLA) and out-of-state firms were excluded from the cohort to keep the comparison Hawaii-specific. State professional bodies (Hawaii Society of CPAs, the UH Shidler business school), generic firm directories (Goodfirms, Clutch), neighborhood directories, and government domains were deliberately not registered as competitors. They appear in the data as third-party content sources.

Full methodology, including the hash-locked question sets and the dated runs on the claims ledger, is documented at /methodology/.

Source-type distribution (cohort-wide)

Across all 41 firms and all 7 AI tools, 5,809 total citations, AI pulled answers from these source types:

Source type	% of mentions	Count
Independent web (third-party content)	54%	3,133
Competitor (firm-owned websites)	39%	2,244
Review directories (Clutch, Yelp, Expertise, BBB)	5%	270
Wikipedia	2%	96
Social (LinkedIn, Facebook, Instagram)	1%	31
Reddit	0%	23

This puts Hawaii CPA firms at the lower end of the cross-category cluster, tied with law firms at 39% firm-owned. For comparison: Hawaii consumer banking is 53%, Hawaii wealth management 47%, Honolulu dental 44%, Hawaii law firms 39%. The professional-services cluster (wealth, dental, law, CPA) all sit between 39% and 47%, with CPA tied for the lowest. The review-directory share at 5% is also the highest of any measured category, driven primarily by Clutch (101 cites), Yelp (66 cites combined desktop and mobile), Expertise (35), and BBB (30).

Per-AI-tool breakdown, the training-data collapse

AI tool	Firm-owned share	Third-party share	Total mentions
ChatGPT search (OpenAI)	60%	40%	898
Gemini grounded	59%	41%	1,533
Perplexity	46%	54%	1,081
Google AI Overviews	45%	55%	621
Microsoft Copilot (Bing)	2%	98%	782
Gemma (training data)	2%	98%	293
Claude (training data)	1%	99%	601

This per-engine table is the most distinctive finding in the data. The four web-searching engines (OpenAI, Gemini, Perplexity, Google AIO) all reach between 45% and 60% firm-owned share. The two training-data engines (Claude and Gemma) reach 1% and 2%. Microsoft Copilot sits at 2%, consistent with the cohort-wide Copilot pattern observed in dental, wealth, and law categories.

For comparison: in the Hawaii law-firm teardown, Gemma was the highest own-share engine at 78% and Claude was second at 51%. In Honolulu dental, Gemma was the top own-share engine. In Hawaii consumer banking, Claude reached 71%. The training-data engines carry several categories this way (banking, law, wealth, dental), but they are not universal: for Honolulu med spas, Honolulu HVAC, and Austin CPAs the same engines collapse, as they do for Hawaii CPAs here.

The structural reading is that Hawaii CPA firms have substantially less brand presence in the training-data corpora than the equivalent firms in adjacent professional-services categories. CPA work is more transactional, less editorially covered, and less likely to surface in the kind of broad-web content training-data engines memorize. A firm cannot close this gap by changing what is on its website. Closing it would require category-wide editorial coverage of Hawaii CPAs in the kind of sources training data ingests (national publications, professional-association editorial, academic-adjacent content), which is outside the scope of what any individual firm controls.

The closable ground is the four web-searching engines

In other categories, the closable ground often sits in the cohort-wide Copilot gap, where Copilot cites few or no firm sites and a firm ranking in Bing organic tends to be the one it surfaces. The same pattern holds for CPA, but the structural read flips: the four web-searching engines (OpenAI, Gemini, Perplexity, Google AIO) are where the competitive game actually plays for Hawaii CPAs. They produce the vast majority of firm-owned mentions in the data, while the training-data engines contribute almost none. Closing the training-data gap is a category-wide editorial problem that no single firm controls. Competing on the web-searching tier is something a firm's own content, schema, and editorial visibility can move. The condition is to surface clearly enough in the four web-searching engines' source-pull surfaces that they cite your firm instead of a competitor or a third-party aggregator.

Top recurring firms (anonymized)

The 5 firms AI cited most often across the 18 questions and 7 tools.

Firm (anonymized)	Total mentions	% of cohort competitor share	Runs cited in
Firm A	265	12%	3/3
Firm B	182	8%	3/3
Firm C	121	5%	3/3
Firm D	115	5%	3/3
Firm E	106	5%	3/3

The top 5 firms account for 35% of all firm-owned mentions (789 of 2,244). For comparison: Hawaii law firms top 5 = 64%, Hawaii consumer banking top 5 = 71%, Honolulu dental top 5 = 49%, Hawaii wealth top 5 = 36%. Of the categories in this comparison, Hawaii CPA is the least concentrated. The competitive distribution is flatter: more firms with meaningful mention counts, no single dominant firm pulling away.

All 5 top firms appeared in all 3 measurement runs (consistency signal, not run-to-run noise). The remaining 36 firms in the cohort have meaningful mention counts but at a noticeably lower frequency. The structural reading: for a firm outside the top 5, the gap to the leader is narrower than in the other categories in this comparison. There is no dominant tier to break into. The head of the distribution is shallow.

Where AI pulls from when it cites non-firm content

The 3,133 third-party-content mentions are not all the same shape. Top recurring sources across runs:

Source	Mentions	Why AI cites it
Hawaii Society of CPAs (hscpa.org)	130	State professional association directory
Generic Hawaii directories (gohawaii.com, hawaii.com)	137	State and neighborhood reference sites
Clutch.co (review directory)	101	B2B service-provider reviews and rankings
Yelp (desktop + mobile)	66	Local business reviews
Expertise.com (directory)	35	Editorial firm rankings by category

The Hawaii Society of CPAs (hscpa.org) appearing 130 times across 3 runs is the most structurally significant third-party source. For a buyer asking AI about Hawaii CPA firms, the state professional association is cited more often than any individual firm's own website. Editorial directories (Clutch, Expertise) and review platforms (Yelp, BBB) account for the next tier. Generic Hawaii reference sites (gohawaii.com, hawaii.com, kaimukihawaii.com) appear frequently because AI often grounds Hawaii-specific queries in state-level sources before naming individual firms. A firm not represented in HSCPA's public directory is effectively invisible across roughly 6% of all citations in the data.

What this teardown does and does not prove

What it does support:

The 41-firm cohort is a representative slice of Hawaii CPA firms AI tools actually cite for buyer-shaped questions.
The 39%/54% firm-own / third-party split is stable across the expanded cohort and all 3 runs.
The training-data engine collapse (Claude 1%, Gemma 2%) is the most distinctive structural pattern, shared with a few other local-service categories (med spas, HVAC, Austin CPA) where the training-data engines also fall away.
The four web-searching engines reach between 45% and 60% firm-owned share, the highest engine-cluster reach we have observed for a professional-services category.
The top 5 concentration (35% of mentions) is the lowest among the categories in this comparison. The competitive distribution is flatter than in adjacent professional services.
HSCPA (the state professional association) and Clutch (B2B review directory) are measurable AEO surfaces for this category that most firms likely treat as a separate practice-development effort, not an AEO one.

What it does not yet support:

That the training-data collapse is permanent. AI training data refreshes on schedules outside our control. A surge in editorial coverage of Hawaii CPA firms could shift Claude and Gemma's awareness. The monthly memo cadence is how we would observe that.
That changing a firm's content, directory presence, or third-party listings would cause AI to cite differently. We measured what AI cites. Causation requires pre-registered experiments. Different scope.
That the closable ground inside the four web-searching engines is actually closable for any specific firm. What conditions move web-searching citation rates is its own measurement question and varies by engine.

Why this is anonymized

None of the 41 firms in this cohort are paying NeverRanked customers. The non-customer anonymization rule applies: counts, distributions, source-type breakdowns, and per-AI-tool numbers are public. Individual firm names are not. The pattern is what is informative on a public surface. The named cohort lives only inside paid engagement deliverables, where the named firm is the customer authorizing the use.

A firm that becomes a NeverRanked customer gets a 1:1 deliverable that names every firm in the cohort, names the queries the customer is missing on, and ranks the closable conditions. That deliverable is private to the customer.

Get the free diagnostic Cross-category teardown (5 verticals) How we measure

Measurement window: 3 usable runs on 2026-05-26. Pattern-readiness rule of 3 runs cleared per the internal pattern-readiness rule. Refresh cadence is monthly or on customer request.

Substantiation: question set locked by hash dc6ae677..., the documented method at /methodology/, named AI tools on named dates. The fact-checker rejected zero claims in this teardown.

Anonymization: the 41-firm cohort is kept anonymized at the firm level per the non-customer rule. Counts, distributions, and named third-party directory sources (HSCPA, Clutch, Expertise, BBB, Yelp) are public because they are categorically named already and the substantiation value depends on naming the specific structural surfaces AI uses.