Why Schema.org Structured Data Matters for LLM Discoverability
Structured data — specifically JSON-LD using the schema.org vocabulary — is a method of embedding machine-readable facts directly in a webpage's HTML, allowing both search engines and large language models to identify, classify and verify information about the entity the page describes.
Structured data — specifically JSON-LD (JavaScript Object Notation for Linked Data) using the schema.org vocabulary — is a method of embedding machine-readable facts directly in a webpage's HTML, allowing both traditional search engines and large language models to identify, classify and verify information about the entity the page describes.
The gap between human-readable and machine-readable
Consider a typical company homepage. It might say: "We're a team of passionate experts helping businesses grow through innovative digital solutions based in Zurich."
A human reading this understands it is a consulting company in Zurich. A language model reading the raw HTML sees the same text — but without reliable structure to extract meaning from. Was "Zurich" the company's location, or mentioned in a case study? What kind of consulting? Who are the founders?
Structured data answers these questions in a format that machines understand without any inference required.
What JSON-LD looks like and where it lives
JSON-LD is embedded in the <head> of an HTML page inside a <script type="application/ld+json"> tag. It doesn't affect the visual appearance of the page — it's invisible to human readers and exclusively for machines.
A minimal Organization schema looks like:
{
"@context": "https://schema.org",
"@type": "Organization",
"name": "Meridian",
"description": "GEO agency for AI-era search",
"url": "https://meridianai.ch",
"foundingDate": "2026",
"address": {
"@type": "PostalAddress",
"addressLocality": "Zurich",
"addressCountry": "CH"
}
}This block gives a machine unambiguous answers to: what type of thing is this? What is it called? What does it do? Where is it? When was it founded?
How LLMs use structured data
LLMs interact with structured data in two ways:
During training — Documents containing schema markup can have their structured facts extracted and weighted more reliably than facts buried in prose, because the schema format is unambiguous. A foundingDate property is always a date, always belonging to the entity being described.
During retrieval (RAG) — When a retrieval-augmented system fetches a page, it can extract JSON-LD blocks directly. This gives the model an instant, authoritative summary of the page's entity without needing to parse prose. Retrieval systems that support structured data cite brands more accurately when that structured data is present.
The most important schema types for GEO
Organization / ProfessionalService
The foundational type. Establishes your company's name, URL, description, location, founding date and personnel. The sameAs property — linking to your LinkedIn, X/Twitter and other profiles — is particularly valuable for entity disambiguation.
FAQPage
A list of question-and-answer pairs. This is one of the highest-impact schema types for GEO because the Q&A format is exactly how RAG systems extract quotable content. An FAQ section with proper schema gives models pre-structured content they can cite verbatim.
Service
Describes a specific service offered by an organization, including its name, description and provider. Useful for making individual service pages clearly machine-readable.
Article / BlogPosting
For blog content and editorial pages. Establishes the author, publication date and the main entity the article is about. Helps models understand that a piece of content is informational rather than transactional, and attribute authorship correctly.
Person
For team members and founders. Linking Person schema to your Organization schema via the founder or employee properties helps models build a clear picture of who is behind the brand.
Common structured data mistakes in GEO
- Using incorrect @type values — Using
Organizationfor a page that is actually aBlogPostingcreates conflicting signals - Omitting sameAs links — These cross-references are how models verify entity identity; skipping them leaves the entity ambiguous
- Structured data that contradicts page content — If your JSON-LD says one thing and your visible text says another, models flag the inconsistency
- Forgetting FAQPage schema — The single most underused high-impact schema type for GEO purposes
- No structured data at all — The most common mistake, and the one with the most consistent improvement impact when fixed
Frequently asked questions
Does structured data still matter for SEO as well as GEO?
Yes. For traditional search, JSON-LD schema unlocks rich results in Google: FAQ dropdowns, organization knowledge panels, star ratings, article rich snippets. These are separate from GEO benefits but provide additional value from the same implementation effort.
Do I need to hire a developer to add structured data?
For JSON-LD, basic implementation involves adding a <script> tag to your page's HTML. In modern CMS platforms like WordPress, Webflow or Squarespace, this can often be done via a custom code field without developer involvement. For complex implementations involving multiple schema types, a developer familiar with schema.org is helpful.
Can I use microdata or RDFa instead of JSON-LD?
All three formats are technically supported by schema.org, but JSON-LD is the format recommended by Google and most widely understood by AI systems. Use JSON-LD unless you have a specific technical reason not to.
How do I validate my structured data?
Google's Rich Results Test validates JSON-LD against known schema types and flags errors. Schema.org's own validator covers a broader range of types. Run both before publishing.