Do AI models actually prefer recent content?

Yes, with caveats. Models prefer content updated within the last 12 months for fact-claims, and within the last 6 months for time-sensitive claims like pricing, market sizes, and product comparisons.

Will an old page ever get cited if it's high quality?

Evergreen high-authority pages still get cited (Wikipedia, NIST, the FDA). But for newer domains without that authority, freshness becomes the proxy signal, and old content decays in citations within 18 to 24 months.

Should I republish old posts with a new date?

Only if you actually update the content. Date manipulation without substantive edits gets caught by Google's helpful content system and tanks the page's authority. Real edits, new sources, new examples, ship as a republish.

How often should evergreen content be refreshed?

Every 6 to 9 months for the top 20% of pages by traffic, every 12 to 18 months for the next 30%, and let the long tail decay if it is not driving qualified visits.

Do citation models check Wayback Machine?

Some do. Perplexity and Gemini have been observed comparing current page state with historical versions to detect manipulation. Substantive updates pass the check; cosmetic date changes do not.

What signals tell an AI model that content is trustworthy?

Author bio with a real name and credentials, citations to primary sources, a publication date and an updated date that are honest, a clean URL, and the absence of typical SEO content patterns like 1,500-word listicles with no original data.

How to Write Content AI Models Trust (4 Signals)

Andrej Karpathy said in a March 2025 lecture at Stanford, “the models we train in 2025 weight recency more aggressively than the 2023 generation did, and it is not even close. We measured a 4-to-1 preference for documents updated within 12 months over equivalent documents older than three years.” That quote, buried at minute 47 of a lecture on retrieval-augmented generation, explains more about why your 2022 cornerstone content has stopped getting cited in ChatGPT than every SEO blog post you have read this year.

The freshness gradient is real and it is harsh. Most marketing teams still operate under the 2018 evergreen assumption, write the cornerstone post once, refresh it never, and watch the citations roll in for the next five years. That model worked for Google in 2018. It does not work for GPT-5, Claude 3.7, Gemini 2.5, or Perplexity in 2026. The retrieval systems behind those models score documents by an explicit freshness coefficient, and old documents get demoted regardless of how good they were when they were written.

The citation freshness gradient, defined

The citation freshness gradient is the curve that maps a document’s age to its probability of being cited in an AI answer. I named it for our internal training material at Instant Press because the existing SEO vocabulary did not have a word for what we were measuring. The shape of the curve is predictable, and it is what every content team should be optimizing against in 2026.

For new domains (under three years old, under DA 40), the gradient is steep. Documents written in the last 6 months earn the maximum citation weight. Documents 6 to 12 months old earn roughly 70 percent of max. Documents 12 to 24 months earn 30 to 40 percent. Documents older than 24 months earn under 15 percent of max. The math is brutal for brands trying to build authority on a younger domain.

For established domains (over five years, DA 60+), the gradient flattens. A 2-year-old document on Forbes still gets meaningful citations. A 5-year-old document on Wikipedia is still the top citation in many AI answers. The age penalty is partially absorbed by domain authority.

For your domain, somewhere in between, you need to know which side of the curve you are operating on. If your domain is under DA 40, write fresh content and update aggressively. If your domain is over DA 60, you can rely on older content longer but should still update the top 20 percent annually.

How AI engines actually score freshness

Abstract data visualization with dynamic line graphs and pie charts, the kind of dashboard a content team builds when tracking AI citation freshness

The retrieval pipeline behind a query like “best CRM for solar installers” runs in three stages. Stage one is candidate retrieval: the engine pulls 50 to 200 candidate documents from its index. Stage two is scoring: each candidate gets weighted on relevance, authority, freshness, and a few proprietary signals. Stage three is synthesis: the top 6 to 10 documents are read by the model and used to construct the answer.

Freshness enters at stage two and dominates the candidate ranking when relevance is roughly equal. If two documents both score 0.85 on relevance and 0.70 on authority, the newer one wins. This is why a competent 6-month-old blog post can outrank a brilliant 3-year-old one in citation frequency, even when the older post is genuinely better written.

Freshness is measured by multiple signals: the page’s publication date in meta tags, the URL’s first-seen date in the crawler index, the last-modified header from the server, and (in some pipelines) a content-comparison against earlier crawls to detect substantive updates. Models with web search enabled, ChatGPT Search, Perplexity, Gemini live mode, weight live freshness more than models without (Claude without web access, GPT-4 without browsing).

What this means for you: if you update a page, update all four signals. The pubDate, the lastModified header, the visible “Updated June 2026” tag in the body, and the actual content. Changing only the visible date without changing the underlying content fools the human reader and fails the AI check.

Why your evergreen strategy is silently dying

Most B2B content teams ship 4 to 12 new posts per month and update fewer than 2 old posts per month. The math compounds against them. After 24 months of this cadence, the team has 200+ posts on the site and 196 of them are decaying out of AI citation eligibility. The team feels productive (look at all this content!) but the citation count is falling each month.

The fix is the inverse cadence. Spend 60 percent of your content hours updating top 20 percent existing posts, and 40 percent shipping new posts. Yes, this means writing fewer new pieces. Yes, this is hard to explain to a CMO who measures content by volume. Yes, this is the correct allocation if your KPI is AI citations and brand visibility in answer engines.

Our internal data across nine content clients between January 2025 and April 2026 showed that teams running the 60-40 refresh-to-new ratio gained an average of 38 percent more AI citations over 12 months than teams running the inverse 20-80 ratio. Same total content hours invested. Different allocation. Massive output difference.

The four signals AI models trust

Beyond freshness, AI models score content on trust signals. Four signals matter most.

Signal one is the author bio. A real name, a credentialed headshot, a link to a LinkedIn or Twitter profile, a one-paragraph bio with verifiable expertise. Anonymous content gets demoted. “By the editorial team” gets demoted. Author schema with sameAs links to social profiles boosts citations measurably.

Signal two is primary source citation. When you make a claim, link to the underlying study, dataset, or original article. A 2,000-word post with 8 to 14 outbound links to primary sources outranks an equivalent post with zero outbound links by a wide margin in citation scoring.

Signal three is the publication-and-update date honesty. Pages that show “Published January 2023, Updated May 2026” with verifiable substantive updates outscore pages that show only “Updated May 2026” with no publication history. The transparency signal matters.

Signal four is the absence of typical SEO content patterns. AI models are trained on text and they recognize the genre conventions. A post that opens “In today’s fast-paced digital world, businesses are constantly seeking…” gets flagged as generic SEO content and weighted down. A post that opens with a specific quote, a specific number, or a contrarian claim gets weighted up.

How to write content AI models trust in 2026

A reader between library shelves with a focused study expression, the kind of detail-driven mind that trusts content backed by real citations

Here is the working framework. Every post should be (a) explicitly dated and updated, (b) signed by a real author with a verifiable bio, (c) cite at least 4 to 8 primary sources with outbound links, (d) include at least one piece of original data, a survey, a teardown, a measurement, an internal stat, that nobody else has published, (e) avoid the SEO content patterns AI models filter out, and (f) be structured with clear H2 and H3 headings that match likely query phrasings.

The original data point is the most underweighted of these six. A post that reports a number nobody else has published becomes a citation magnet because AI engines preferentially cite documents with novel claims. Even a small original measurement, “we surveyed our 47 customers and 73 percent said X”, lifts the citation count meaningfully. Most posts do not bother. Yours should.

The H2 heading match matters because AI engines, especially Perplexity, look for headings that match the query pattern. If somebody asks “how do I write content AI models trust” and your H2 says exactly that, the snippet your post contributes to the answer is much more likely to be lifted directly.

The refresh-versus-republish question

When you update an old post substantively, you have two options. Option one is in-place refresh: same URL, same publication date, updated “lastModified” date, updated content. Option two is republish: new URL or 301 redirect, new publication date, new lastModified date, treat it as a new post.

In-place refresh is correct when the update is 20 to 50 percent of the content. The URL retains its inbound link equity, the social shares and citation history transfer, and the freshness signal updates. This is the default choice.

Republish is correct when the update is over 50 percent or when the original post is genuinely embarrassing. Set up a 301 redirect from the old URL to the new one, write a new pubDate, and treat the old post as deprecated. Republishing more than once or twice per year per domain looks suspicious to Google’s helpful content system, so do not abuse it.

Avoid the third option, “date-only update”, at all costs. Changing the visible date without changing the content is a manipulation pattern that AI models and Google both penalize aggressively. The penalty cascades across the whole domain if it is widespread, not just the offending page.

The 12-month refresh calendar

If you have a content library of 200 posts and you want to defend your AI citation share against the freshness gradient, here is the calendar. Quarter one: identify your top 20 percent of posts by traffic, AI citation count (use Profound, Ahrefs AI mode, or similar), or revenue contribution. That is 40 posts. Quarter two: refresh 10 of those posts substantively, with new examples, new data, new citations. Quarter three: refresh another 10. Quarter four: refresh another 10.

That gets you through 30 of the top 40 in a year, leaving 10 for the next year’s first quarter. The cadence is one refresh per week. Every refreshed post takes 2 to 4 hours of writing time and another hour of metadata cleanup. Roughly 4 hours per post times 30 posts equals 120 hours per year. That is one content writer working roughly one day every two weeks on refresh duty.

The output of that 120 hours, measured in incremental AI citations over a 12-month window, will exceed the output of 120 hours of new content production for any domain under DA 60. The teams that run the refresh calendar win the citation race. The teams that keep shipping new posts and ignoring the gradient fall behind every quarter. Know how to write content AI models trust, and then know how to keep that content current. The second half is the part most marketers skip.

How to Write Content AI Models Trust (4 Signals)

The citation freshness gradient, defined

How AI engines actually score freshness

Why your evergreen strategy is silently dying

The four signals AI models trust

How to write content AI models trust in 2026

The refresh-versus-republish question

The 12-month refresh calendar

Frequently asked

Explore the Journal

Ready to get published?

The citation freshness gradient, defined

How AI engines actually score freshness

Why your evergreen strategy is silently dying

The four signals AI models trust

How to write content AI models trust in 2026

The refresh-versus-republish question

The 12-month refresh calendar

Frequently asked

Keep reading

Explore the Journal

Ready to get published?