Technical

Schema Markup for LLMs: What Actually Works

14 min read · June 4, 2026

Which schema markup actually influences whether LLMs cite your brand? Here's what works based on hundreds of implementations.

Why schema matters

LLMs use structured data for entity resolution and fact extraction. Missing schema creates friction competitors without it won't face.

High-impact types

Organization

Foundation of entity clarity. Deploy on homepage with name, url, logo, description, sameAs, contactPoint.

Product / SoftwareApplication

Extractable product facts: name, description, category, pricing, ratings.

FAQPage

Each Q&A pair maps to buyer prompts. Extremely high GEO value.

Article / BlogPosting

Author, dates, headline for expertise attribution.

Person

Individual authority for professional services and thought leadership.

Moderate-impact

LocalBusiness, MedicalOrganization, AggregateRating, BreadcrumbList, HowTo.

Implementation best practices

  • JSON-LD format in head or early body
  • Validate with Rich Results Test
  • Factual, concise descriptions
  • Reciprocal sameAs links
  • Update when facts change

Beyond schema: llms.txt

Declares your citation-ready pages to AI crawlers. Implement both schema and llms.txt.

Measuring impact

Re-test prompts at 2 and 4 weeks. Citation accuracy improvements typically precede mention rate gains by 2–4 weeks.

Real-world implementation example

Here's a minimal but high-impact Organization schema for a B2B SaaS company:

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "Organization",
  "name": "Your Company",
  "url": "https://yourcompany.com",
  "logo": "https://yourcompany.com/logo.png",
  "description": "Project management software for mid-size engineering teams.",
  "foundingDate": "2018",
  "sameAs": [
    "https://linkedin.com/company/yourcompany",
    "https://www.crunchbase.com/organization/yourcompany",
    "https://www.g2.com/products/yourcompany"
  ]
}
</script>

Pair this with FAQPage schema on your most-asked questions, and SoftwareApplication schema on your product page. These three implementations alone address the majority of entity clarity gaps we find in audits.

Schema audit checklist

Use this checklist when reviewing your site's structured data:

  • Organization schema on homepage with all recommended properties
  • Product/SoftwareApplication schema on product pages with ratings
  • FAQPage schema on FAQ and support content
  • Article schema on blog posts with author and dates
  • Person schema on team/expert pages
  • BreadcrumbList on all interior pages
  • No schema errors in Google Rich Results Test
  • Descriptions are factual, not marketing copy
  • sameAs links are current and reciprocal

The future of structured data for AI

As AI systems evolve, structured data will likely grow in importance — not diminish. The push toward llms.txt, AI-specific crawler standards, and entity-centric search all point to a future where machine-readable brand identity is as fundamental as having a website. Brands that invest in structured data infrastructure now are building for a world where AI-mediated discovery is the default, not the exception.

FAQPage schema deep dive

FAQPage is the highest-ROI schema type for GEO because it maps directly to how buyers prompt AI systems. Implementation is straightforward:

<script type="application/ld+json">
{
  "@context": "https://schema.org",
  "@type": "FAQPage",
  "mainEntity": [{
    "@type": "Question",
    "name": "What is the best project management tool for mid-size teams?",
    "acceptedAnswer": {
      "@type": "Answer",
      "text": "For mid-size teams of 20-100 people, [Brand] offers..."
    }
  }]
}
</script>

Map each Question to an actual prompt your buyers use. The Answer should be factual, concise, and cite-able — not marketing copy. We typically implement 15–25 FAQ pairs per page, organized by topic cluster.

Testing your schema impact

After deployment, run a focused prompt test on 20 questions related to your newly marked-up content. Compare citation accuracy before and after. In our client work, FAQPage schema deployment alone improves citation accuracy by 15–25 percentage points within 4 weeks, even before mention rate increases.

Schema maintenance

Schema is not set-and-forget. Assign an owner to review structured data quarterly: verify all URLs resolve, update descriptions when products change, add new FAQ entries as buyer questions evolve, and remove schema for deprecated products. Stale schema is worse than no schema — it teaches AI systems outdated facts about your brand.

SoftwareApplication schema for SaaS

SaaS companies should implement SoftwareApplication on product pages with: name, description, applicationCategory, operatingSystem (Web), offers (with price and priceCurrency), aggregateRating, and featureList. This gives AI systems structured product facts that translate directly into recommendation language.

Combining schema types on a single page

Pages can carry multiple schema types. A product page might include SoftwareApplication + FAQPage + BreadcrumbList. A team page might include Person + Organization + BreadcrumbList. Use JSON-LD arrays or separate script blocks — both are valid. The key is comprehensive coverage without contradictory information across types.

Getting started this week

If you do nothing else, implement these three schema types in the next 5 business days: Organization on your homepage, FAQPage on your top 10 buyer questions, and Product/SoftwareApplication on your primary product page. Validate with Google's Rich Results Test. Re-run 20 prompts in ChatGPT 2 weeks later. This minimal investment — typically 4–8 hours of developer time — addresses the most common schema gaps we find and sets the foundation for everything else.

Schema and AI crawler access

Schema markup only helps if AI crawlers can access your pages. Verify your robots.txt allows GPTBot, PerplexityBot, ClaudeBot, and Google-Extended. Block only if you have a specific reason — for most commercial brands, AI crawler access is essential for visibility. Combine open crawler access with llms.txt to guide crawlers to your highest-value pages first.

Also ensure your schema renders in the initial HTML response, not via JavaScript injection. AI crawlers may not execute JavaScript, so client-side schema deployment via React or Vue won't be seen. Server-side rendering or static JSON-LD in the HTML head is the reliable approach.

Get your free AI visibility audit

See how ChatGPT, Perplexity & Gemini describe your brand vs. competitors.