Citera Research

An Analysis of 350,000 B2B SaaS Articles: What Predicts Google Ranking and AI Citation

Authors

Published May 21, 2026

Affiliation Citera, San Francisco, CA

Keywords: B2B SaaS, SEO, AI citation, generative engine optimization, content marketing, AI search, ChatGPT, Perplexity, AI Overviews, Google ranking

Abstract

We analyzed approximately 350,000 B2B SaaS articles competing for 10,382 keywords across 52 B2B SaaS categories (CRM, project management, marketing automation, HR software, and others) in Google and four AI search engines (ChatGPT, Claude, Perplexity, AI Overviews). The typical B2B SaaS article is light on credibility signals: only 21% include expert quotes and 29% include 3 or more statistics. But among AI-cited articles, those numbers jump to 52% and 64%. AI-cited articles average 4.2 statistics and 1.6 expert quotes, while non-cited articles average 1.2 and 0.2. 61% of AI citations come from earned media (including LinkedIn), compared to 49% in Google’s organic results, but brand-owned content still holds 29% of AI citations in B2B SaaS, more than three times the rate measured on the general web. Only 14% of AI-cited URLs also appear in Google’s top 20, though 30% of Google’s top-20 articles are cited by AI. Cross-engine citation overlap ranges from 8% to 17%. AI Overviews trigger on 83-87% of comparison and question-format B2B SaaS queries. These are the first content-quality baselines measured specifically for B2B SaaS, and they reveal a significant optimization gap: most B2B SaaS content lacks the credibility signals that both Google and AI engines reward.


1. Introduction

B2B SaaS companies live and die by organic search. Animalz’s benchmark report found that organic search accounts for 83% of traffic to established SaaS blogs, with every other channel (direct, referral, social, email) contributing single digits. For a category this dependent on search, the emergence of AI-powered discovery represents the most significant distribution shift since mobile.

The problem is that the existing research base was not built for this audience. Backlinko analyzed 11.8 million Google results spanning every category from recipes to real estate. Princeton’s GEO study used a benchmark of 10,000 queries that was 80% informational and drawn from academic datasets. SE Ranking studied 2.3 million pages across 20 unnamed niches. Ahrefs analyzed 55.8 million AI Overviews globally. These are valuable studies, but none of them answer the question a B2B SaaS content team actually needs answered: what works for our keywords, our audience, and our competitive dynamics?

This study addresses that gap. We collected approximately 138,000 raw B2B SaaS keywords from four sources: G2 category pages across 52 B2B SaaS categories, leading SaaS company blogs (HubSpot, Salesforce, Monday.com, and others), a commercial keyword data provider for related-term expansion, and query templates (“X vs Y,” “best X software,” “how to Y”) applied across all categories. After removing duplicates and quality filtering, 10,382 keywords remained. For each keyword, we collected Google’s top 20 organic search results and queried four AI engines. The combined corpus totaled approximately 350,000 unique articles after removing inaccessible URLs (paywalls, broken links, anti-bot blocks). We extracted 17 content features from every article using a combination of AI-powered extraction and automated parsing, validated against 200 manually reviewed articles, plus 6 search and AI citation data points per article.

Here is what we found, in one sentence per section:

  • The typical B2B SaaS article has almost no credibility signals, but articles with more statistics, quotes, and source citations consistently outperform on Google and in AI search.
  • Word count has zero ranking impact within Google’s top 10.
  • AI engines cite earned media at a higher rate than brand-owned content, but brand content holds 29% of AI citations in B2B SaaS, far above the ~8% measured on the general web.
  • Schema markup, keyword-in-URL, and keyword stuffing have no positive impact on Google ranking or AI citation.
  • AI Overviews trigger on the majority of informational B2B SaaS queries, with comparison keywords reaching 87%.

2. Prior Research

The findings in this paper build on 19 studies published between 2020 and 2026. Rather than reviewing each sequentially, we organize them by the questions they address.

What predicts Google ranking? Backlinko’s analysis of 11.8 million Google results (2020) established several durable findings: the average first-page result contains 1,447 words, but word count has zero correlation with ranking position within the first page. Similarly, 65-85% of first-page results contain a keyword in the title, but keyword-in-title shows zero within-page-one ranking impact. Schema markup (present on 72.6% of first-page results) shows no ranking relationship. The strongest ranking predictor remains domain authority, with the #1 result averaging 3.8x more backlinks than positions 2-10. Ahrefs (2025, 1.3M keywords) found that 72.9% of pages in Google’s top 10 are over 3 years old, up from 59% in 2017, and only 6.11% of new pages reach the top 10 within a year.

What predicts AI citation? Princeton’s GEO study (2024, 10K queries) is the first causal evidence in the literature: adding quotations improved AI visibility by 28-43%, adding statistics by 23-33%, and adding source citations by 13-28%, while keyword stuffing decreased visibility by approximately 9%. SE Ranking (2025, 216K-2.3M pages) found directionally consistent but larger effects (93% more citations for data-rich pages, 71% for pages with expert quotes), though these figures are uncontrolled for domain authority. Stanford’s verifiability study (2023, 5.8K query-response pairs) found that only 51.5% of AI-generated statements were fully supported by citations, with citation recall worst on open-ended queries (44.3%) and highest on list-type factoid queries (73.0%).

How do AI engines differ from Google? Ahrefs (2025, 15K queries) found that only 11.9% of AI-cited URLs also appear in Google’s top 10, with 80% not ranking anywhere in Google for the original query. Profound (2025, 100K prompts) found cross-engine overlap equally low: ChatGPT-Perplexity at 11%, AI Overviews-Copilot at 6%, Perplexity-AI Overviews at 16.4%. The University of Toronto (2025, 1K queries) found that AI engines cite earned media at 72.7-74.2% for software products, compared to 31.8-45.4% for Google. On the general web, multiple industry studies have found earned media accounts for 82-85% of all AI citations.

What is the impact of AI Overviews? Agarwal and Sen (2026, 1,065 participants) conducted the first randomized field experiment: AI Overviews reduce organic clicks by 38% and increase zero-click searches from 54% to 72%. Seer Interactive (2025-2026, 5.47M queries) found that being cited in an AI Overview recovers some of that loss:

ScenarioClick-through rateClicks per 1M impressions
No AI Overview3.35%33,500
AI Overview, brand cited2.07%20,700
AI Overview, brand not cited0.94%9,400

Table 1. AI Overview click-through impact (Seer Interactive, 2025-2026).

Being cited in an AI Overview recovers approximately 62% of the click volume that existed before AI Overviews (20,700 / 33,500). Not being cited recovers only 28% (9,400 / 33,500). Even the best-case scenario (cited) still underperforms the no-AI-Overview baseline by 38%. Citation is damage mitigation, not growth.

Pew Research (2025, 68.9K searches) found that question-format queries trigger AI summaries 60% of the time and only 1% of users click source links within AI summaries.

LinkedIn as a B2B AI citation source. Semrush (325K prompts, 89K LinkedIn URLs) found LinkedIn appearing in approximately 11-14% of AI search responses, with the highest rates on ChatGPT Search (14.3%) and Google AI Mode (13.5%). Profound (1.4M citations) found LinkedIn surged from #11 to #5 on ChatGPT between November 2025 and February 2026, and is the #1 most-cited domain for professional queries across all six major AI platforms. Posts and articles grew from 27% to 35% of LinkedIn citations while profile citations dropped from 34% to 15%.

What about B2B SaaS specifically? Animalz (2021, 60K+ SaaS articles) found the median SaaS article is 1,200 words, the median number of internal links per article is zero, and the median number of backlinks is 1. Organic traffic share for established SaaS blogs reached 82.9% in 2021, with the median SaaS blog experiencing negative growth (-1.64% annually). Prior studies have examined these dimensions separately for the general web but not in combination for B2B SaaS content. No study has measured what percentage of B2B SaaS articles actually contain the credibility signals (statistics, expert quotes, source citations) that both Google and AI engines reward.


3. Methodology

3.1 Keyword Selection

This study was conducted in May 2026, focusing on the US market using English-language queries. We collected approximately 138,000 raw B2B SaaS keywords using four methods: (1) G2 category pages across 52 B2B SaaS categories (CRM, analytics, dev tools, marketing automation, project management, cybersecurity, HR software, customer support, and others), (2) analysis of leading B2B SaaS company blogs (HubSpot, Salesforce, Monday.com, Notion, and others) to identify the keywords they target, (3) related-term expansion via a commercial keyword data provider, and (4) query templates applying proven search patterns (“X vs Y,” “best X software,” “how to Y”) across all categories. All keywords were filtered to monthly search volume of 50 or higher. After removing duplicates and quality filtering, 10,382 keywords remained across 52 categories, classified into six intent categories:

Intent categoryKeywords (approx.)Example
Comparison~2,350“HubSpot vs Salesforce”
Question-format~2,480“how to automate onboarding”
Best-of~2,160“best project management software”
Use-case~1,290“employee scheduling app for restaurants”
Feature~1,230“CRM email tracking”
Category~880“content marketing platform”

Table 2. Keyword distribution by intent category.

The distribution skews toward comparison and question-format queries, which matches the B2B SaaS keyword landscape.

3.2 Data Collection

Google. For each of the 10,382 keywords, we collected the top 20 organic search results via a third-party search data API, capturing URL, title, position, and snippet. This produced 194,821 search result positions pointing to 143,117 unique articles. Roughly three-quarters of URLs were unique; the remaining quarter appeared for multiple keywords.

AI engines. For each keyword, we queried ChatGPT (web search enabled), Claude (web search), Perplexity, and Google AI Overviews. Each query was run once per engine with a standardized prompt. Across 10,382 keywords and four engines, we collected 487,982 individual article references: ChatGPT returned 86,453 (averaging 8.3 per response), Perplexity returned 243,615 (23.5 per response), Claude returned 64,312 (6.2 per response), and AI Overviews returned 93,602 (9.0 per response). After removing duplicates within each engine (ChatGPT: 67,319 unique; Claude: 52,119; Perplexity: 174,604; AI Overviews: 64,112) and then across engines, 301,604 unique AI-cited article URLs remained. Roughly 16% of articles appeared in multiple engines, driven by concentration on a small number of high-authority domains (Wikipedia, G2, major publications) that recur across many responses.

Article corpus. The combined URL set across Google and AI results totaled 402,135 unique URLs. After removing URLs that could not be scraped (paywalls, broken links, anti-bot protections, non-article pages), approximately 350,000 articles remained with full feature extraction. Of the 301,604 AI-cited articles, 42,586 (14.1%) also appeared in Google’s top 20. Looking the other direction, 42,586 of the 143,117 Google articles (29.8%) were also cited by at least one AI engine.

The analyses below use different subsets depending on the comparison: Google analyses use articles appearing in Google’s top 20 (143,117); AI analyses use articles cited by at least one AI engine (301,604); overlap analyses compare the two sets.

3.3 Feature Extraction

We extracted 17 content features from each article. Features requiring interpretation, such as counting statistics or identifying expert quotes, used AI-powered extraction. Structural features such as word count, heading structure, and URL patterns were parsed automatically. The full feature list:

#FeatureMethodValidation accuracy
1Statistics countAI extraction + pattern matching90.2%
2Expert quote countAI extraction86.6%
3External source citation countAI extraction + link parsing89.7%
4Word countAutomated (visible text)98.1%
5Readability grade (Flesch-Kincaid)Automated98.0%
6Section countHTML heading parser95.2%
7Average section lengthHTML heading parser93.8%
8Keyword densityAutomated96.1%
9FAQ section presentHeading + schema parser96.1%
10Publish dateHTTP header + meta tags + visible date85.2%
11Content formatAI classification85.1%
12Schema markup presentHTML parser99.8%
13URL lengthAutomated100%
14Keyword in URLString match94.8%
15Keyword in title tagString match99.6%
16Domain typeAI classification93.2%
17On B2B review platformDomain match (G2, Capterra, Trustpilot, TrustRadius, Gartner Peer Insights)98.9%

Table 3. Feature extraction methodology and validation accuracy.

In addition to the 17 content features, we recorded each article’s Google ranking position (1-20), whether the keyword triggered an AI Overview, and whether each of the four AI engines cited the article.

Validation was performed on 200 randomly sampled articles with manual human review. For each article, a human reviewer independently assessed all 17 features and compared results to the automated extraction. Per-field accuracy ranged from 85.1% to 100%. Two features, expert quote detection (86.6%) and content format classification (85.1%), showed lower accuracy than other features, which we note as a limitation.

3.4 Analysis Approach

Google analysis. For each keyword, we compared content features across four position buckets (1-5, 6-10, 11-15, 16-20). This within-keyword comparison controls for keyword difficulty and topic, isolating the content-feature gradient across positions for the same query.

AI analysis. For each keyword, we compared content features of articles cited by at least one AI engine against articles appearing in Google’s top 20 but not cited by any AI engine. This same-keyword comparison controls for topic relevance.

Combined analysis. We identified features that appear elevated in both Google positions 1-5 and AI-cited articles, relative to the overall baseline. Features showing a consistent positive signal on both Google and AI represent the highest-priority content investments.

3.5 Limitations

No domain authority control. We do not measure backlinks, Domain Rating, or any authority metric. All comparisons are observational and confounded by domain authority. Where Princeton’s causal findings exist, we cite them as the benchmark; our B2B SaaS numbers describe the observed direction but cannot establish causation.

Single-run AI queries. Each AI engine was queried once per keyword. Profound’s research shows that 40-60% of AI-cited sources change month to month, so our snapshot does not capture every article that could be cited for a given keyword. Across 10,382 keywords, the noise averages out, but individual keyword-level citation data should be treated as directional.

Observational design. Correlations between content features and ranking or citation do not establish causation, except where we reference Princeton’s controlled experiments.


4. Results

4.1 What Winning B2B SaaS Content Looks Like

The table below shows mean values for count features and percentages for prevalence features across five article groups: all articles in the corpus, Google positions 1-5, Google positions 11-20, AI-cited articles, and articles not cited by any AI engine.

FeatureAll articlesGoogle pos 1-5Google pos 11-20AI-citedNot AI-cited
Mean word count1,3921,6421,2841,6901,305
Mean statistics1.93.61.94.21.2
Mean expert quotes0.61.50.41.60.2
Mean source citations3.25.33.26.22.3
% with at least 1 expert quote21%44%25%52%12%
% with 3+ statistics29%63%36%64%28%
Readability grade (mean)10.59.910.79.610.8
Mean section count9118128
Mean section length (words)139142136141139
Keyword-in-title %73%74%71%69%74%
Schema markup %71%72%70%69%72%
FAQ section %19%22%16%24%17%

Table 4. Content feature baselines across article groups.

Three patterns emerge immediately.

Credibility signals separate winners from losers. Only 21% of B2B SaaS articles include a named expert quote, and only 29% include 3 or more statistics. Among Google top-5 articles, those numbers jump to 44% and 63%. Among AI-cited articles, they reach 52% and 64%. The mean statistics count tells the same story: 1.9 across all articles, 3.6 for Google top-5, and 4.2 for AI-cited. No prior study has measured these baselines for B2B SaaS content specifically. The direction is reinforced by Princeton’s controlled experiments, which found that adding statistics and quotes causally improves AI visibility, but the B2B SaaS-specific prevalence rates (only 21% with quotes, only 29% with 3+ statistics) are new.

Readability grades are higher in B2B SaaS than the general web, but lower is still better. The overall B2B SaaS mean is grade 10.5, but the best-performing articles trend lower: Google top-5 at grade 9.9, AI-cited at grade 9.6. The B2B SaaS sweet spot appears to be grade 9-10: accessible enough for AI engines to extract and cite, technical enough for a B2B buyer to take seriously.

Schema markup and keyword-in-title are table stakes, not differentiators. Keyword-in-title prevalence is high across all groups (69-74%) with no meaningful gradient. Schema markup prevalence is similarly flat (69-72%). Having schema markup does not help you rank higher or get cited more, but not having it could keep you out of the top results entirely. Think of it like wearing a suit to a job interview: if everyone wears one, yours does not make you stand out. But showing up without one might disqualify you.

4.2 What Works for Google Specifically

Breaking Google’s top 20 into four position buckets reveals a gentle gradient, not a cliff.

FeaturePos 1-5Pos 6-10Pos 11-15Pos 16-20
Mean word count1,6421,4631,3381,230
Mean statistics3.62.82.21.6
Mean expert quotes1.50.90.60.2
Readability grade9.910.310.610.8
Keyword density1.3%1.2%1.4%1.3%
Mean article age (months)23221814
% with at least 1 expert quote44%34%28%22%

Table 5. Google ranking position gradient.

Word count shows a gradient but not causation. Top-5 articles average 1,642 words versus 1,230 at positions 16-20. But within the top 10, the difference narrows to 1,642 versus 1,463. Longer articles in top positions likely reflect more comprehensive treatment of topics that happens to correlate with other ranking signals (more backlinks, higher domain authority), not a direct ranking benefit from word count.

Expert quotes show a steep gradient. 44% of articles in Google positions 1-5 include at least one named expert quote, dropping to 34% at positions 6-10, 28% at positions 11-15, and 22% at positions 16-20. The mean expert quote count follows the same curve: 1.5 at positions 1-5 down to 0.2 at positions 16-20.

Article age confirms the incumbency advantage. The mean top-5 article is 23 months old; the mean position 16-20 article is 14 months old. Only 9% of top-20 articles in our dataset were less than 12 months old, compared to Ahrefs’ 13.7% for the general web. B2B SaaS skews even older: established articles on established domains dominate.

Keyword density is flat. Across all position buckets, keyword density ranges from 1.2% to 1.4% with no meaningful gradient. The top-performing positions actually have slightly lower keyword density than bottom positions. Repeating the target keyword more often does not help.

AI Overview trigger rates by query type. We checked whether each keyword triggered an AI Overview:

Query typeAIO trigger rateExample
Comparison87%“HubSpot vs Salesforce”
Question-format83%“how to automate onboarding”
Best-of72%“best project management software”
Category73%“content marketing platform”
Use-case58%“CRM for restaurants”
Pricing/evaluation42%“HubSpot pricing”
Feature-specific41%“CRM email tracking”
Transactional8%“buy Salesforce”

Table 6. AI Overview trigger rates by query type.

Comparison and question-format queries face near-universal AI Overview competition (83-87%). Even pricing queries trigger AI Overviews 42% of the time when there is pricing structure to synthesize. Only pure transactional queries (“buy X,” “X free trial”) remain largely unaffected.

4.3 What Works for AI Specifically

Comparing AI-cited articles to non-cited articles for the same keywords reveals what AI engines value beyond Google ranking signals.

FeatureAI-citedNot AI-citedDifference
Mean statistics4.21.2+3.0
Mean expert quotes1.60.2+1.4
% with at least 1 quote52%12%+40 pts
Readability grade9.610.8-1.2 grades
Mean word count1,6901,305+385 words
Earned media %61%45%+16 pts
On review platform %19%9%+10 pts

Table 7. AI-cited vs non-cited article features.

The credibility gap is substantial. 52% of AI-cited articles include at least one named expert quote, compared to just 12% of non-cited articles. AI engines cite heavily from earned media (review sites, publications, analyst reports), which routinely includes expert quotes, explaining the gap. AI-cited articles average 4.2 statistics compared to 1.2 for non-cited, and 64% of AI-cited articles contain 3 or more statistics versus 28% of non-cited.

Domain type distribution.

Domain typeGoogle top 20AI-citedDifference
Earned media49%61%+12 pts
Brand-owned42%29%-13 pts
Reference (Wikipedia, other)9%10%+1 pt

Table 8. Domain type distribution across Google and AI citation.

Earned media includes review sites, industry publications, analyst reports, and professional content platforms. LinkedIn is now the #5 most-cited domain on ChatGPT and the #1 most-cited domain for professional queries, making it a significant and growing source of AI citations for B2B content.

61% of AI-cited articles for B2B SaaS keywords come from earned media sources, compared to 49% in Google’s top 20. Brand-owned content holds 29% of AI citations. On the general web, earned media accounts for 82-85% of AI citations and brand content for roughly 8%. Our B2B SaaS figures are meaningfully different: brand-owned content holds 29% of AI citations, over three times the general-web rate. This is because B2B SaaS includes a high proportion of feature-specific and use-case queries (“CRM email tracking,” “employee scheduling app for restaurants”) where the vendor’s own documentation and blog is the authoritative source. No journalist at Forbes wrote a deep dive on configuring your CRM pipeline. Only the vendor did.

For context, Reddit accounts for roughly 40% of all AI citations on the general web (46.7% on Perplexity, 21% on AI Overviews), making it the single most-cited domain across AI platforms for consumer queries. B2B SaaS queries behave differently: AI engines surface structured product comparisons and expert analysis rather than forum discussions.

Review platform presence. 19% of AI-cited articles are hosted on B2B review platform domains (G2, Capterra, TrustRadius), compared to 9% for non-cited articles. In practical terms: when AI engines choose which articles to cite for a B2B SaaS keyword, they are roughly twice as likely to pick one from a review platform domain than you would see in a random sample of Google’s top 20 results. This reflects the outsized role review platforms play as authoritative sources for comparison and best-of queries.

Citation volatility. Profound (80K prompts per platform) found that 40-60% of cited sources change every month, with Google AI Overviews showing the highest drift (59.3%) and Perplexity the lowest (40.5%). A single ChatGPT entity update in October 2025 wiped 31% of brand visibility overnight, affecting 85%+ of tracked brands. Any AI visibility measurement should be treated as a moving target, not a stable position.

4.4 What Doesn’t Matter

Several features showed no meaningful impact on Google ranking or AI citation.

FeatureGoogle ranking impactAI citation impact
Schema markupNone (69-72% prevalence, no gradient)None
URL lengthNegligible (9-char difference pos 1 vs 10)None
Keyword-in-URLNone within top 20Inverse (likely confounded by authority)
Keyword stuffingNone within top 20Negative: ~9% (Princeton, causal)
FAQ schemaN/AWeaker than actual FAQ content (SE Ranking)

Table 9. Features with no meaningful impact.

Schema markup is baseline infrastructure, not a competitive advantage. Schema markup prevalence was 69-72% across all position buckets with no gradient. Ahrefs tracked 1,885 pages that added structured data (JSON-LD) between August 2025 and March 2026 and found no boost to AI citations. AI engines read visible page content, not metadata. Schema helps Google understand your page for rich results and knowledge graph disambiguation, but it does not make AI engines more likely to cite you.

Keyword stuffing actively hurts AI citation. Keyword density ranged from 1.2% to 1.4% across all Google position buckets with no positive gradient. Princeton’s controlled experiment confirmed that keyword stuffing hurts AI visibility by approximately 9%.

4.5 How Google and AI Differ

The most strategically important finding is how little overlap exists between the two source pools.

Overlap rates:

ComparisonOverlap %
AI-cited URLs also in Google’s top 2014%
Google top-20 URLs also cited by AI30%
ChatGPT vs Perplexity (same keywords)10%
ChatGPT vs Claude (same keywords)8%
Perplexity vs AI Overviews17%

Table 10. Google-AI and cross-engine citation overlap.

Only 14% of AI-cited URLs for B2B SaaS keywords also appeared in Google’s top 20. But the reverse tells a different story: 30% of Google’s top-20 articles were cited by at least one AI engine. The asymmetry exists because the AI pool (301,604 unique articles) is more than twice the size of the Google pool (143,117). AI engines draw from a much wider set of sources than Google’s top 20. If you already rank on Google, you have a roughly 1-in-3 chance of also being cited by AI. But if you’re optimizing for AI citation, ranking on Google is far from sufficient: only 1 in 7 AI-cited articles ranks on Google at all.

Cross-engine overlap is equally low. ChatGPT and Perplexity share only 10% of cited URLs when given the same keywords. ChatGPT and Claude share just 8%. Even the highest overlap pair (Perplexity and AI Overviews at 17%) means over 83% of sources differ. Optimizing for one AI engine does not transfer to another.


5. Discussion

5.1 Comparison to Prior Research

For each major finding, we compare our B2B SaaS numbers to the general-web source study.

FindingGeneral web (source)B2B SaaS (our data)Verdict
Statistics improve AI citation+23-33% (Princeton, causal)AI-cited mean 4.2 vs not-cited 1.2; 64% vs 28% with 3+ statisticsDirection and magnitude consistent with Princeton’s controlled range.
Quotes improve AI citation+28-43% (Princeton, causal)52% of AI-cited have quotes vs 12% of not-cited; mean 1.6 vs 0.2B2B SaaS starts from a lower baseline (21% prevalence) but the gap is proportionally large.
Source citations improve AI citation+13-28% (Princeton, causal)AI-cited mean 6.2 vs not-cited 2.32.7x gap in B2B SaaS.
Word count predicts rankingZero within page 1 (Backlinko, 11.8M)1,642 vs 1,230 across top 20, flat within top 10Gradient exists across top 20 but flattens within page 1.
Schema helpsZero (Backlinko, Ahrefs)69-72% prevalence, no gradientSchema is baseline infrastructure in B2B SaaS, as on general web.
Keyword stuffing hurts AI~-9% (Princeton, causal)Slight negative trend in our dataDirection matches; we cannot measure the precise effect size.
AI cites earned media82-85% general web; 72-74% software (Toronto)61% (incl. LinkedIn)Calibrated lower. B2B SaaS keyword diversity favors brand content more than generic queries.
Brand content share in AI~8% general web (Stacker/Scrunch)29%B2B SaaS is different. Brand content holds 3x the general-web share.
AI-Google overlap11.9% (Ahrefs, 15K queries)14% from AI direction, 30% from Google directionAsymmetric: AI pool is 2x larger. 30% of Google articles get AI citations.
Cross-engine overlap6-16% (Profound, 100K prompts)8% to 17%Within Profound’s range.
Readability helps AIGrade 6-8 optimal (SE Ranking)Grade 9.6 AI-cited vs 10.8 not-citedCalibrated higher. B2B SaaS optimal is grade 9-10 due to technical audience.
Review platforms boost AI citation3-3.5x (SE Ranking, uncontrolled)19% vs 9% (2.1x)Lower ratio in B2B SaaS; cannot control for domain authority.

Table 11. Comparison of B2B SaaS findings to general-web prior research.

Where B2B SaaS is different from the general web:

  1. Brand content is 3x more viable. On the general web, brand-owned content captures roughly 8% of AI citations. In B2B SaaS, it holds 29%. This is the single most strategically important divergence from the general-web research. It means that creating high-quality content on your own domain is not a lost cause in B2B SaaS, unlike on the general web where earned media dominates almost entirely.
  2. Earned media is lower. 61% in B2B SaaS versus 82-85% on the general web. B2B SaaS keywords include feature-specific queries (“CRM email tracking”) and use-case queries (“employee scheduling app for restaurants”) where the vendor is the authoritative source. The general-web studies use broad informational queries where thousands of publications compete.
  3. The optimization gap is massive. Only 21% of B2B SaaS articles have expert quotes, and only 29% have 3+ statistics. The majority of B2B SaaS content has not been adapted for AI search. This gap represents an opportunity for teams willing to invest in credibility signals while most competitors have not.

5.2 Implications for B2B SaaS Content Strategy

Based on these findings, seven action items emerge:

  1. Add credibility signals to every article. The single highest-leverage change is adding statistics, expert quotes, and source citations. Only 21% of B2B SaaS articles include expert quotes; among AI-cited articles, that number reaches 52%. Only 29% have 3+ statistics; among AI-cited articles, 64% do. This is the only intervention with causal evidence behind it.
  2. Invest in earned media alongside brand content. AI engines cite earned media at 61% versus brand-owned at 29%. Getting mentioned in G2, industry publications, and analyst reports is now a search strategy, not just a PR strategy. But brand-owned content still captures over a quarter of AI citations, particularly for feature-specific and use-case queries where the vendor is the authoritative source.
  3. Maintain existing content rather than publishing new content. Article age in top positions (mean 23 months) suggests that updating existing articles is associated with higher performance than publishing new ones for both Google and AI visibility.
  4. Write at grade 9-10 readability for B2B SaaS. Not grade 6-8 (too simplified for a technical buyer) and not grade 11+ (lower AI citation rates). The sweet spot is accessible expert prose.
  5. Structure articles in 100-150 word sections. SE Ranking found this range optimal for AI Mode citations (+9% versus shorter sections). This maps to clearly labeled subsections that each make a complete, extractable point.
  6. Ensure review platform presence. Having profiles on G2, Capterra, and TrustRadius correlates with 2.1x higher AI citation rates in our data (SE Ranking found 3-3.5x on the general web). This is a low-cost action with potentially significant AI visibility upside.
  7. Treat Google and AI as separate channels. Only 14% of AI-cited URLs appear in Google’s top 20 (though 30% of Google articles do get AI citations), and cross-engine overlap ranges from 8% to 17%. A single SEO strategy will not cover AI visibility. Content teams need a parallel effort targeting earned media, AI-friendly formatting, and credibility signals.

These are the first B2B SaaS-specific content baselines, establishing a benchmark against which future content investments can be measured.

5.3 What We Could Not Test

Several features produced data we could not confidently analyze:

Content format. We classified articles by format (guide, listicle, comparison, case study, etc.) but the distribution was too skewed to draw conclusions. Comparison articles dominated comparison keywords, guides dominated question-format keywords, creating a confound between format and intent.

FAQ content (not schema). While we measured FAQ section presence (19% of all articles), the low prevalence made subgroup comparisons unreliable. FAQ sections were slightly more common in AI-cited articles (24% vs 17%) but the sample size within each keyword was too small for confident claims.

Internal linking. Animalz found the median SaaS blog has zero internal links per article, and our data confirmed this. With so little variance, there was nothing to compare.


6. Limitations

No domain authority control. We do not measure backlinks, Domain Rating, or domain traffic. Every comparison in this study is confounded by domain authority. Articles in Google’s top 5 likely rank there because of domain authority, not because they have more statistics. The credibility-signal gradient could be entirely explained by high-authority domains producing higher-quality content. Princeton’s causal findings remain the only clean evidence that credibility signals themselves drive AI citation.

Single-run AI queries. Each of the 10,382 keywords was queried once per AI engine. AI citation sources shift month to month (Profound found 40-60% monthly drift), so our snapshot does not capture every article that could be cited. Across 10,382 keywords, directional trends are reliable, but individual keyword-level citation data should be interpreted with caution.

Extraction accuracy. Per-field accuracy ranged from 85.1% to 100%. Two features (expert quote detection at 86.6% and content format classification at 85.1%) showed lower accuracy. Statistics extraction (90.2%) required careful tuning to distinguish data points from year references and other numerical content.

B2B SaaS keyword selection. Our 10,382 keywords cover 52 categories but do not equally represent all B2B SaaS verticals. The dataset skews toward marketing, sales, and project management software due to keyword volume distribution. Highly vertical categories (healthcare SaaS, fintech SaaS) are underrepresented.

Observational design. With the exception of Princeton’s experimental findings, everything in this study is correlational. We can describe what winning content looks like, but we cannot prove that making content look this way will cause it to win.


7. Conclusion

B2B SaaS articles that rank well on Google and get cited by AI engines share one trait: credibility signals. Only 21% of B2B SaaS articles include expert quotes and 29% include 3 or more statistics, but among AI-cited articles, those numbers jump to 52% and 64%. This gap is the clearest signal in the data: most B2B SaaS content lacks the features that both Google and AI reward, and the teams that close this gap first will have a measurable advantage.

Two findings set B2B SaaS apart from the general web. First, brand-owned content holds 29% of AI citations in B2B SaaS, more than three times the ~8% measured on the general web. Creating content on your own domain is not a lost cause in B2B SaaS, especially for feature-specific and use-case queries. Second, earned media accounts for 61% of AI citations, significantly below the 82-85% measured on the general web, because B2B SaaS keyword diversity favors vendor content more than generic queries do.

Google and AI draw from largely separate source pools: only 14% of AI-cited URLs overlap with Google’s top 20. B2B SaaS companies need two content strategies operating in parallel: one for Google (where domain authority and content maintenance dominate) and one for AI (where credibility signals and earned media dominate).


References

  1. Backlinko / Semrush. (2020). “We Analyzed 11.8 Million Google Search Results. Here’s What We Learned About SEO.” backlinko.com/search-engine-ranking
  2. Animalz. (2020-2021). “The Animalz Content Marketing Benchmark Report.” animalz.co/blog/benchmark-report-2021
  3. Fubel, F., Groll, N. M., Gundlach, P., Han, Q., & Kaiser, M. (2023). “Beyond Rankings: Exploring the Impact of SERP Features on Organic Click-through Rates.” Nova School of Business and Economics / Universitat Hamburg. arxiv.org/abs/2306.01785
  4. Liu, N., Zhang, T., & Liang, P. (2023). “Evaluating Verifiability in Generative Search Engines.” Stanford University. arxiv.org/abs/2304.09848
  5. Aggarwal, P., Murahari, V., et al. (2024). “GEO: Generative Engine Optimization.” Princeton University / Allen Institute for AI. KDD 2024. arxiv.org/abs/2311.09735
  6. Fishkin, R. (2024). “2024 Zero-Click Search Study.” SparkToro / Datos / Semrush. sparktoro.com
  7. Ahrefs. (2025). “Insights from 55.8 Million AI Overviews.” ahrefs.com/blog/insights-from-56-million-ai-overviews
  8. Ahrefs. (2025). “Top Brand Visibility Factors in ChatGPT, AI Mode, and AI Overviews (75k Brands Studied).” ahrefs.com/blog/ai-brand-visibility-correlations
  9. Ahrefs. (2025). “AI Search Traffic Conversions at Ahrefs.” ahrefs.com/blog/ai-search-traffic-conversions-ahrefs
  10. Ahrefs. (2025). “Only 12% of AI Cited URLs Rank in Google’s Top 10 for the Original Prompt.” ahrefs.com/blog/ai-search-overlap
  11. Ahrefs. (2025). “How Long Does It Take to Rank in Google? And How Old Are Top Ranking Pages?” ahrefs.com/blog/how-long-does-it-take-to-rank-in-google
  12. Pew Research Center. (2025). “Google users are less likely to click on links when an AI summary appears in the results.” pewresearch.org
  13. Profound / tryprofound.com. (2025-2026). “Profound AI Citation Research (multi-study corpus).” tryprofound.com/blog/research
  14. Seer Interactive. (2025-2026). “AIO Impact on Google CTR, v1-v3.” seerinteractive.com/insights/aio-impact-on-google-ctr-2026-update
  15. Semrush. (2026). “Does AI content rank well in search? [Survey + Data study].” semrush.com/blog/does-ai-content-rank-in-search-data-study
  16. Semrush. (2025-2026). “The Most-Cited Domains in AI” and “LinkedIn AI Visibility Study (89K URLs).” semrush.com/blog/most-cited-domains-ai
  17. Chen, M., Wang, X., Chen, K., & Koudas, N. (2025). “Generative Engine Optimization: How to Dominate AI Search.” University of Toronto. arxiv.org/abs/2509.08919
  18. Agarwal, S., & Sen, A. (2026). “Google AI Overviews and Publisher Traffic: Evidence from a Field Experiment.” Indian School of Business / Carnegie Mellon University. SSRN Working Paper. papers.ssrn.com
  19. SE Ranking. (2025). “How to Optimize for AI Mode” and “How to Optimize for ChatGPT.” seranking.com/blog/how-to-optimize-for-ai-mode

This study was conducted by the Citera research team. Citera is an organic growth platform for B2B SaaS companies. The data, methodology, and findings are described in full to allow independent evaluation.