Structured data used to be an SEO concern. You added it so Google could render star ratings next to your listing or show your FAQ as a rich result. That era is over. In 2026, the bigger question is whether Claude, ChatGPT, Perplexity, and Google’s AI Overviews can correctly identify what your business is, who runs it, and what you sell. Structured data is now the most direct input you give an AI model about your entity, and it shows up in the answer every time someone asks about you.
The shift is not subtle. A year ago, your blog post’s SEO score depended on title tags, headings, and backlinks. Today, your AI visibility depends on whether a language model can parse your page, extract a clean set of facts, and cite you as the source. Schema markup is the bridge between your website and that extraction process. Skipping it is like submitting a resume without a name at the top.
What AI search engines actually do with structured data
When Perplexity or ChatGPT Search fetches your page, it does two things in parallel. It reads the visible HTML body, and it parses any JSON-LD schema in the head or inline. The schema is machine-ready, which means the model does not have to guess at structure. If you tell it your page is an Article with an author named Sarah Chen and a publish date of March 4, 2026, the model logs those facts directly. If you leave the schema out, the model extracts the same facts from the visible content, but with lower confidence.
That confidence gap matters when the model has to choose between you and a competitor. An answer about best project management software for construction firms might cite Procore, Autodesk, and three smaller vendors. The vendors with clean structured data get quoted in full. The vendors without it get a single-line mention or nothing at all.
A study from Profound in late 2025 tracked 400 B2B software pages and found that pages with complete Organization, Product, and Article schema appeared in ChatGPT Search citations 3.4 times more often than pages with only basic Open Graph tags. The gap for Perplexity was even wider at 4.1x. These are not edge cases. They are the new baseline.
The schema types that matter most in 2026
Most businesses need four or five schema types, not fifty. The trick is choosing the right ones and filling them in completely rather than scattering half-finished markup across the site.
Organization schema is the foundation. It tells a model your company name, logo, founding date, address, founder, social profiles, and parent or subsidiary relationships. This is the schema that anchors your entity in an AI model’s internal knowledge graph. Without it, the model has to assemble your identity from scraps of visible copy, press mentions, and social profiles, and it often gets the shape wrong.
Person schema does the same work for individuals. If you are a consultant, executive, author, or public speaker, Person schema on your about page and author bylines connects your name to your credentials, affiliations, and body of work. Claude and ChatGPT both weight this schema heavily when answering questions about individual experts.
Article schema marks up your blog posts, guides, and editorial content. The core fields are headline, author, datePublished, dateModified, publisher, and mainEntityOfPage. Add an about field that lists the topics the article covers, and the model can find and quote your article when it matches a query. Skip the schema and the article gets ranked by pure relevance, which is a lottery.
FAQPage schema remains one of the highest-leverage markup types for AI search. When you mark up a list of questions and answers with FAQPage schema, you are writing directly into the training data format that ChatGPT and Claude were built on. The answer field of a FAQPage node often shows up verbatim in AI responses. This is the cleanest pipeline from your content to a quoted citation that exists.
Product schema carries the commercial weight. For ecommerce, SaaS, and service businesses, Product schema captures the name, description, SKU, brand, offers, price, availability, rating, and review count. AI shopping queries like find me a CRM for solopreneurs under $30 per month cannot answer without Product schema on enough pages in the market.
LocalBusiness schema is essential for any business with a physical location. It carries address, phone, opening hours, geo coordinates, and accepted payment methods. Google’s AI Overviews and the “near me” variants of ChatGPT Search both rely on LocalBusiness schema when they recommend a specific shop or office.
How to implement schema without breaking things
The biggest mistake with structured data is treating it as a checklist. You add every schema type, fill in the minimum required fields, and move on. The model reads it, notices the thin data, and assigns a low confidence score. A page with two complete schema types beats a page with six skeletal ones.
Start with JSON-LD in the head. Forget Microdata and RDFa. Google, Perplexity, and every major AI search engine preferred JSON-LD by 2023, and by 2026 it is the only format you should ship. Put your schema inside a script tag with type application/ld+json. Do not mix it into your HTML body markup.
Use Schema.org as the source of truth, not a tutorial from three years ago. The Schema.org vocabulary has grown. Properties like sameAs, knowsAbout, and hasOfferCatalog have become load-bearing for AI extraction. The Schema.org docs are dense but current. Read them.
Fill every field you can truthfully fill. If your Organization has a real founder, include founder with a Person object. If you have social profiles, include all of them in sameAs. If you publish articles, include publisher with a link back to the Organization node. A model reading fully-populated schema has enough structure to build a clean entity card. A model reading skeletal schema builds a fuzzy one.
Reference nodes across pages. If your homepage has an Organization schema with a stable @id, every article, product, and author page can reference that same @id in their publisher or brand field. The model stitches the nodes together into a graph. This is how Wikipedia and Wikidata accumulate authority, and AI models use the same pattern.
Validate before you ship. Run the Rich Results Test and the Schema Markup Validator on every page before it goes live. A trailing comma or a typo’d property name will invalidate the entire block, and the model ignores invalid JSON. Schema errors are silent failures. Nothing tells you the schema was ignored unless you check.
The connection between structured data and AI citations
Perplexity publishes a clear statement in its developer documentation. The model uses structured data to verify facts when multiple sources conflict. That single sentence has enormous implications. If ten sources say your product costs $79 per month and five of them have Product schema that matches, the model treats the schema sources as authoritative. The other five get deprioritized.
This plays out in every pricing query, every founder lookup, every competitive comparison. Structured data is the fastest way to become the authoritative source. The model does not need to trust you more than your competitor. It needs to see that your facts are machine-readable and consistent across your schema, your visible content, and your third-party mentions.
ChatGPT’s retrieval layer behaves similarly. When it fetches a page, the first pass parses JSON-LD. Properties found in schema are tagged with higher trust than properties extracted from prose. A price in a Product offer field is more trusted than a price buried in a paragraph. An author in an Article byline field is more trusted than a name in the footer.
Claude, when used with web access, is more cautious about schema, but still uses it for entity disambiguation. If your name matches five other people on the internet, the schema that connects you to a specific company, city, or publication is what decides which one the model picks.
Common implementation mistakes
Inconsistent data between visible and hidden content is the fastest way to lose trust. If your page lists the price as $49 per month and the Product schema says $79, AI models flag the page as unreliable. The same applies to author names, founding dates, and business addresses. The two layers must match.
Stale dateModified fields cause silent ranking drops. Many sites set dateModified once at publish and forget it. AI models track freshness, and a four-year-old dateModified signals that the page is outdated. Update the field when the content actually changes. If the content has not changed, leave it alone.
Duplicate or conflicting schema across templates is a common problem in WordPress and Shopify sites. A plugin adds its own schema, the theme adds different schema, and a SEO tool adds a third layer. The model sees three Organization nodes and cannot tell which is canonical. Audit the rendered HTML, not just the plugin settings. The source of truth is what the page actually outputs.
Missing image fields in Article and Product schema cost citations. AI models use images in summary cards, shopping comparisons, and knowledge panels. A page without a clean Article.image or Product.image property loses visual representation in AI answers. The image should be a direct URL to a high-resolution JPG or PNG, not a placeholder or a tracking pixel.
Overuse of the “Thing” catch-all schema type is the refuge of plugin developers who do not want to think hard. Thing tells the model nothing specific. Use the most specific type available. A dentist is a Dentist, not a LocalBusiness or a Thing. A podcast episode is a PodcastEpisode, not a CreativeWork.
How to think about schema for emerging AI platforms
The AI search space will keep splitting. Perplexity, ChatGPT, Claude, Google AI Overviews, Gemini, and specialized verticals like You.com and Arc Search are not identical. Each has its own retrieval stack, its own parsing quirks, and its own trust signals. Writing schema for each platform individually is a losing battle.
The winning move is to write schema for Schema.org and trust the platforms to converge on the standard. Schema.org is governed by Google, Microsoft, Yahoo, and Yandex, which covers most of the web. Every major AI search engine parses it. Chasing platform-specific markup is like optimizing your email for Outlook 2007.
The second move is to treat schema as living infrastructure. New types are added every quarter. The MediaObject family expanded in 2025 to handle interactive demos and AI-generated content. The HowTo schema got a new isRecipe extension. Keep up with the Schema.org releases. Run an annual audit of your schema to confirm you are using current types and properties, not ones deprecated two years ago.
A practical starting point for a small business
If you run a small business and want to ship schema in a week, the priority order is simple. Implement Organization schema on your homepage with all the fields you can truthfully fill. Add Person schema to every team member’s profile page. Add Article schema to every blog post. Add FAQPage schema to your most-trafficked informational pages. Add Product or Service schema to every offer page. Add LocalBusiness schema to your contact page if you have a physical location.
That single week of work will change how AI models see your site. Queries that previously returned zero citations will start returning yours. Queries that returned thin mentions will return quoted paragraphs. The ranking shift is measurable within 30 to 60 days for most businesses.
Schema is not a silver bullet. If your content is bad, your structured data will mark up bad content. If your business has no differentiation, the schema will help the model articulate that lack of differentiation. But if you have something worth saying, structured data is the most direct channel you have to make sure an AI model hears it, parses it, and repeats it back to the people asking.