This Week in Brief
AI search fragmentation is accelerating: Google AI Mode now claims over 1 billion daily queries processed and 100 million monthly active users, while Perplexity's pre-built index architecture means conventional real-time SEO assumptions do not apply. Research from Princeton/Allen Institute reinforces that semantic completeness — not Google rank — is the primary predictor of AI citation, with a 0.87 correlation coefficient. Practitioners who have not audited their content against retrieval-oriented signals are already losing ground on multiple citation surfaces simultaneously.
AI Lab Signals
Google AI Mode Surpasses 100 Million Monthly Active Users, Processes Over 1 Billion Daily Queries
Google AI Mode, launched May 2025, has reached 75 million daily users and 100 million monthly active users as of February 2026, processing over 1 billion queries per day. Unlike Google AI Overviews, AI Mode is a distinct ranking surface that blends entity recognition, natural language understanding, and multimodal signals to generate synthesised answers with citations. Practitioners targeting B2B high-intent queries must treat AI Mode as a separate optimisation channel from both traditional organic search and AI Overviews.
Anthropic's ClaudeBot Selects Sources Based on Entity Authority and Structured Data Signals
According to practitioner documentation, Claude's web retrieval pipeline weights entity authority, factual accuracy, structured data, and content clarity when selecting cited sources — it does not replicate a traditional search ranking. ClaudeBot can be configured via robots.txt directives, and llms.txt deployment is documented as a signal for AI crawler access permissions. Practitioners optimising for Anthropic's platform should prioritise entity disambiguation and structured markup ahead of keyword density.
Training Data & Crawl
Perplexity Runs Its Own Crawler and Pre-Built Index — Real-Time Fetch Assumptions Are Wrong
An analysis of 818 Perplexity citations across 19,556 queries (Lee, 2026a — cited within source, details unverified independently) finds that Perplexity does not perform live web fetches at query time; it maintains its own crawler (PerplexityBot), its own index, and its own ranking signals independent of Google or Bing. This architectural distinction means that if PerplexityBot has not crawled and indexed your content before a query is issued, your page cannot appear as a citation regardless of its Google rank. Practitioners should audit PerplexityBot crawl access in server logs and ensure robots.txt does not inadvertently block it.
AI Search & ASO
AI Overviews Appear on 15–30% of Google Searches; Perplexity Exceeds 500 Million Monthly Queries
As of early 2026, Google AI Overviews appear on approximately 15–20% of searches (this source) — a separate BrightEdge figure cited elsewhere puts the figure at ~30%, and the two should be treated as a range rather than a confirmed single data point. Perplexity processes over 500 million queries per month and ChatGPT Search reaches over 100 million monthly users. The three platforms reward materially different content signals: Google AI Overviews favour pages already indexed in Google's core index; Perplexity favours freshness and pre-indexed structured content; ChatGPT Search performs closer to live retrieval. Brands cannot rely on a single optimisation approach across all three surfaces.
Google Rank Does Not Predict AI Citation — Query Intent Does, Per 19,556-Query Analysis
An internal analysis of 19,556 queries across 8 industry verticals and 479 crawled pages (methodology details not independently verified) concludes that traditional Google ranking position is not a reliable predictor of AI citation. Query intent alignment and semantic completeness are identified as primary citation drivers, consistent with the Princeton/Allen Institute finding of a 0.87 correlation between semantic completeness and AI citations. Practitioners running citation-tracking should segment performance by intent category, not page rank position.
Research Radar (arXiv)
GraphRAG: Combining Vector Search with Knowledge Graphs for Relationship-Aware Retrieval
Standard RAG retrieves semantically similar text chunks but cannot resolve dependency chains between concepts — a known failure mode when queries involve relational reasoning (e.g., 'how does component A affect component B'). GraphRAG addresses this by combining vector similarity search with knowledge graph traversal, enabling topology-aware retrieval that surfaces provenance and entity relationships standard embeddings miss. For GEO practitioners, this signals that AI platforms using GraphRAG-style retrieval will increasingly favour content with explicit entity relationships and structured provenance — reinforcing the case for knowledge graph markup and well-linked entity architecture.
Building a Semantic Research Assistant: A Production RAG Pipeline Over 120 arXiv Papers
A practitioner benchmark comparing three transformer embedding models against BM25 across 120 arXiv papers reports a 7.7× answer quality gap in favour of transformer-based semantic retrieval over keyword matching. The finding (Pre-publication / unreviewed) reinforces that retrieval pipelines powering AI answer engines are weighting semantic intent over keyword presence — content structured around conceptual completeness will outperform keyword-optimised content in retrieval scoring. ASO practitioners should note this as further evidence against keyword-stuffing approaches in AI-targeted content.
Practitioner Takeaway
Audit PerplexityBot access in your server logs this week. Confirm the crawler is not blocked by robots.txt or rate-limiting rules, then verify at least your top 20 revenue-relevant pages are present in Perplexity's index by querying them directly in the platform. Perplexity's pre-built index architecture means no crawl equals no citation — regardless of your Google ranking position or content quality.
The 6-phase framework used to structure this newsletter is available as a complete methodology guide — including audit tools, templates, and implementation checklists.
Get the Framework — $299New to AI knowledge publication? Download the free briefing flyer — the data case for why your organisation cannot wait.