How do I get cited by Claude?

Five actions: (1) allow ClaudeBot, Claude-User, and Claude-SearchBot in robots.txt, (2) add specific statistics with named sources, (3) include expert quotations with full attribution, (4) structure content with clear headings, FAQ blocks, and tables, and (5) implement Schema.org structured data. The Princeton GEO study found these signals boost AI visibility by 28–41%.

AI Platforms

Claude Search: How It References Sources & Optimization Tips

Q: How does Claude cite sources in its answers?

Claude references sources with inline citations that link to the original pages. Claude's web search capability uses the Claude-SearchBot crawler to build a fresh index, and Claude synthesizes answers from retrieved passages with inline references. Claude's 200K token context window allows it to process longer passages than most other AI engines.

Q: What crawlers does Anthropic use for Claude?

Anthropic operates three crawlers: ClaudeBot (general web crawling for training and indexing), Claude-User (fetches pages on-demand when a user includes a link in a prompt, with the user's IP visible), and Claude-SearchBot (dedicated to building the Claude web search index). All three must be explicitly allowed in robots.txt for full Claude search visibility.

Q: Is Claude-SearchBot different from ClaudeBot?

Yes. Claude-SearchBot is Anthropic's dedicated web search crawler, distinct from ClaudeBot (general crawling) and Claude-User (user-initiated fetches). Allowing ClaudeBot does not automatically allow Claude-SearchBot. For full Claude search visibility, all three must be explicitly allowed in robots.txt.

Claude uses three crawlers (ClaudeBot, Claude-User, Claude-SearchBot) and a 200K context window. Learn how to optimize for Claude's citations.

8 min read·Updated 2025-06-22

Claude by Anthropic is distinguished by its 200K token context window — significantly larger than most other AI engines. This allows Claude to ingest longer passages and synthesize more comprehensive answers, which creates a different GEO opportunity: longer, well-structured content can be quoted in depth rather than reduced to short snippets.

Anthropic operates three separate crawlers: ClaudeBot (general), Claude-User (user-initiated fetches), and Claude-SearchBot (web search index). Each must be explicitly allowed in robots.txt. Allowing one does not allow the others — a common configuration mistake that eliminates sites from Claude citations.

Claude at a glance: 200K token context window (much larger than most engines) · Three crawlers: ClaudeBot, Claude-User, Claude-SearchBot · Citation format: inline references with source links · AI search referral traffic grew 527% YoY in 2025 (Previsible) · Pages with statistics or citations get 30–40% higher visibility.

How Claude references sources

When Claude uses web search to answer a query, it retrieves passages from the Claude-SearchBot index, processes them within its 200K context window, and synthesizes an answer with inline citations linking back to the original pages. The large context window means Claude can quote longer, more detailed passages than engines with smaller contexts.

Claude's generation stage uses the same five citation factors as other RAG engines: factual density, source authority, information uniqueness, content structure, and semantic consistency. The difference is that Claude can hold more source material in context simultaneously, which tends to reward content that develops a topic thoroughly rather than stating a single fact.

"Claude's 200K context window changes what 'quotable' means. Other engines pull snippets; Claude can ingest and synthesize longer passages. For publishers, this rewards comprehensive content with clear section structure — not just short fact-dense paragraphs."
— Observed Claude product behavior 2025, contextualized by the Princeton GEO study (KDD 2024)

The three Claude crawlers you must allow

Anthropic operates three crawlers with distinct purposes. All three must be explicitly allowed for full Claude search visibility:

Crawler	Purpose	IP visibility
ClaudeBot	General web crawling for training and indexing	Anthropic IP
Claude-User	Fetches pages on-demand when a user includes a link in a prompt	User IP (not Anthropic)
Claude-SearchBot	Dedicated crawler for Claude web search index	Anthropic IP

Source: Anthropic platform documentation, observed 2025.

The minimum robots.txt configuration to be eligible for Claude search citations is:

User-agent: ClaudeBot
Allow: /

User-agent: Claude-User
Allow: /

User-agent: Claude-SearchBot
Allow: /

Many sites that block ClaudeBot for training concerns also accidentally block Claude-SearchBot, eliminating themselves from Claude web search citations. The three crawlers serve different purposes — you can selectively allow Claude-SearchBot and Claude-User while blocking ClaudeBot if you want citation visibility without contributing to model training.

Why the 200K context window changes GEO

Most AI search engines have context windows of 32K–128K tokens, which limits them to extracting short passages. Claude's 200K window can hold the equivalent of roughly 500 pages of text — meaning Claude can ingest long-form content in full and quote from any section.

For GEO, this has three implications:

▸ Long-form content is not penalized — Claude can quote from a 3,000-word guide as easily as a 300-word blog post.
▸ Internal cross-references work — Claude can synthesize across multiple sections of a single page, which rewards well-structured long-form content.
▸ Original research wins — detailed data sections, methodology explanations, and case studies can be quoted at length, not just summarized.

5-step optimization guide for Claude

Based on the Princeton GEO study (KDD 2024) and observed Claude behavior, these are the five highest-leverage optimizations:

1.
Allow ClaudeBot, Claude-User, and Claude-SearchBot in robots.txt
All three must be explicitly allowed. Verify with server logs that the crawlers are fetching your pages. Allowing ClaudeBot alone does not enable Claude search citations.
2.
Add specific statistics with named sources
Statistics addition boosts visibility by +33%. Claude's large context means it can ingest and verify detailed data sections — give it numbers with sources, not just claims.
3.
Include expert quotations with full attribution
Expert quotations give the largest lift at +41%. Use blockquotes, name the speaker, and identify the source. Claude extracts these as discrete, citable units.
4.
Develop topics comprehensively with clear section structure
Claude's 200K context rewards depth. Use H2/H3 headings, FAQ blocks, numbered steps, and tables to organize longer content. Each section becomes a potential extraction target.
5.
Implement Schema.org structured data
Article, FAQ, HowTo, and Organization schema help Claude parse your content. Use JSON-LD and validate with the Schema.org validator. FAQ schema is particularly effective for Claude because the Q&A format maps directly to user queries.

What Claude rewards and penalizes

▸ Expert quotations — +41% visibility (Princeton GEO study)
▸ Statistics with named sources — +33% visibility
▸ Fluent, well-structured prose — +29% visibility
▸ Cited external sources — +28% visibility
▸ Comprehensive long-form content — + (Claude-specific advantage from 200K context)
▸ Keyword stuffing — −8% visibility (harmful)
▸ Blocked Claude-SearchBot — −100% (eliminates Claude search citations)

Source: Aggarwal et al., "GEO: Generative Engine Optimization," arXiv:2311.09735, KDD 2024. Claude-specific behavior observed 2025.

Common Claude citation mistakes

▸ Allowing ClaudeBot but blocking Claude-SearchBot — the most common mistake. The three crawlers serve different purposes and must be configured separately.
▸ Short, snippet-style content — Claude's 200K context rewards depth. Very short pages offer less for Claude to synthesize from.
▸ Walls of text with no section breaks — even with a large context, Claude extracts clean units. Use headings, paragraphs, and lists to make extraction reliable.
▸ Vague claims without sources — Claude rarely cites "many experts believe." Replace with named experts and specific numbers.
▸ No Schema.org — without structured data, Claude has to infer content type, which reduces citation probability.

Frequently asked questions

How does Claude cite sources in its answers?

Claude uses inline citations linking to source pages, built from the Claude-SearchBot index. Claude's 200K token context window allows it to process and quote longer passages than most other AI engines.

What crawlers does Anthropic use for Claude?

Three crawlers: ClaudeBot (general crawling), Claude-User (user-initiated fetches, user IP visible), and Claude-SearchBot (web search index). All three must be explicitly allowed in robots.txt for full Claude search visibility.

What is Claude's context window and why does it matter for GEO?

Claude has a 200K token context window, much larger than most engines. This allows Claude to ingest longer passages and quote from longer content — rewarding comprehensive, well-structured long-form content.

Is Claude-SearchBot different from ClaudeBot?

Yes. Claude-SearchBot is dedicated to building the Claude web search index, distinct from ClaudeBot (general) and Claude-User (user-initiated). Allowing ClaudeBot does not automatically allow Claude-SearchBot.

References: Aggarwal, P., Dugan, L., et al. "GEO: Generative Engine Optimization." arXiv:2311.09735, KDD 2024. · Anthropic platform documentation: ClaudeBot, Claude-User, Claude-SearchBot. · Previsible 2025 AI Search Traffic Report. · Gartner Search Traffic Forecast 2026.

Want to check your site's GEO readiness?

Run the 27-point GEO audit