Generative Engine Optimization

Key takeaways

46% of high school seniors now use AI tools in their college search, and 18% have already removed a school from consideration based on an AI-generated answer (EAB, 2026).
The discovery surface shrank from ten blue links to two-to-seven cited sources inside a synthesized answer.
Generative engine optimization (GEO) is the practice of structuring content so AI engines cite, quote, and recommend your institution — citation, not ranking, is now the competition.
GEO sits on top of SEO: 76% of AI Overview citations come from pages already ranking in Google’s top 10 (Profound, 2025).
Perplexity cites .edu domains only 3.2% of the time; Wikipedia accounts for 47.9% of ChatGPT’s top citations (Search Engine Journal, 2024; Profound, 2025).
R1s outside the elite tier are structurally under-cited and exposed at the worst possible moment.
The institutions that move on findability over the next twelve months will define the next decade.

In this post

A high school senior opens ChatGPT at 11 p.m. and asks for the best psychology programs in the Pacific Northwest. She reads five names. She picks three to research. She closes the app. Your university is not on the list. This is the new front door, and it opens onto a discovery surface that collapsed from ten blue links to a handful of citations inside a synthesized answer. As of early 2026, 46% of high schoolers use AI in their college search and 18% have already removed a school based on an AI answer (EAB, 2026). The demographic cliff and this search shift are arriving at the same moment. The rest of this post explains what changed, what GEO actually is, and where R1s are losing ground.

§ 01The great decoupling

Search volume kept climbing while clicks cratered. Roughly 60% of searches now end without a click to a destination website (Frankel Agency, 2025). For higher ed, that is a strategic emergency, not a metrics quibble. The channel that drove the top of the enrollment funnel for fifteen years has fractured into something fundamentally different.

The pattern is consistent. Students ask the same questions they always asked. Which schools have the best program in their field. Which campus culture fits them. What financial aid looks like. The questions are unchanged. The answers now arrive synthesized, with two to seven cited sources surfaced inside a single response. If your institution is not in those citations, you are not in the consideration set. Traditional analytics will not show this. The traffic you lost never reached you in the first place.

A second pattern is sharper. AI referral traffic is currently small — around 1.08% of total site traffic — but converts at 4.4x the rate of traditional organic (Semrush, 2025). These are not casual browsers. They are prospective students who already trust the answer and arrive ready to take a next step. The volume will grow. The conversion advantage is the leading indicator.

The governance problem compounds the visibility problem. A typical R1 operates more than 100 subdomains across three or more CMS platforms, managed by thousands of casual editors with competing priorities. That would be like building a separate, unmapped physical structure for every office and cubicle on campus — we would never do that physically, but most R1s do it online every day. The result is duplicate content, orphaned pages, and diluted domain authority that confuses search crawlers and AI retrieval systems alike.

The 11 p.m. ChatGPT query is now the operational reality at the top of the funnel. Cabinet leaders who haven’t watched their institution disappear from those answers should run the test this week.

A deeper breakdown of the recommendation gap and elite-school over-indexing is in development for this hub.

§ 02What GEO actually means

Generative engine optimization (GEO) is the practice of structuring content so AI engines cite, quote, and recommend your institution. It is not a rebranding of SEO. It is a shift in what you are competing for. SEO competes for rank position on a search results page. GEO competes for citation inside a synthesized answer. Rank wins clicks; citation wins consideration.

The academic foundation for this field is the Princeton/IIT Delhi paper that introduced the term (Aggarwal et al., 2024). That study tested optimization techniques systematically and found three moves that lifted citation visibility by 30–40% without adding new substantive content: inline citations to original studies, government data, and peer-reviewed papers; direct quotations from named, credentialed humans; and statistics with specific numbers and attribution. None of those moves require new research, new programs, or new positioning. They require new content discipline.

What that adds up to is unflattering for most higher ed websites. The brochure-style program page — three paragraphs of generic positioning, a bulleted list of “what you’ll learn,” and an unnamed staff byline — is virtually invisible to AI engines. The pages that get cited look more like reference documents than marketing pages. They have credentialed authors. They cite primary sources inline. They include real numbers. They answer the question first.

I’ll say it plainly: the marketing instincts that built the modern university website are the wrong instincts for citation-based search. The fix is not better copywriting. The fix is restructuring how content is produced and what it has to contain to be eligible for citation in the first place.

A dedicated piece on citation-driving content patterns is in development for this hub.

§ 03Why R1s outside the elite tier are structurally under-cited

LLMs over-index on a small set of elite schools. Ask any major AI engine to recommend the best psychology programs, or the strongest engineering schools, or the best public research universities, and the same eight to twelve names come back. Harvard. Stanford. MIT. Berkeley. Michigan. Often the same handful in nearly the same order. Aspirational R1s, regional R1s, and most public flagships outside the top tier are largely absent unless explicitly named in the prompt.

The mechanics are not mysterious. Two structural facts drive most of the gap.

First, AI engines barely cite the .edu domain directly. Perplexity cites .edu sources only 3.2% of the time (Search Engine Journal, 2024). The single most-cited source in ChatGPT’s top citations is Wikipedia, at 47.9% (Profound, 2025). Universities whose research centers, signature programs, and senior faculty lack accurate Wikipedia entries are ceding the most-cited surface in generative AI to whatever the encyclopedia has on file — which is often outdated, incomplete, or absent entirely.

Second, earned media is the actual GEO infrastructure. Brands in the top quartile for off-site mentions earn 10x more AI citations than the next quartile (LLMvlab, 2026). Elite institutions accumulate that media weight passively, through national press coverage, prestige reputation, and decades of network effects. For everyone else, citation-worthy earned media has to be built deliberately — through faculty visibility, research promotion, signature program storytelling, and digital PR work that most higher ed comms shops have de-prioritized over the last decade.

This is the structural exposure: R1s outside the elite tier face the steepest pipeline pressure from the demographic cliff and the deepest visibility gap in the channel where students are searching. I think this is the most under-discussed strategic problem in higher ed digital right now.

A full treatment of earned media and digital PR for AI visibility is in development for this hub.

§ 04Why GEO sits on top of SEO, not beside it

Traditional SEO is the foundation for GEO, not its replacement. Profound’s analysis of AI Overview citations finds that 76% of citations come from pages ranking in Google’s top 10 (Profound, 2025). The pages AI engines cite are, with rare exception, the pages that already rank well in conventional search. The “SEO is dead” narrative is wrong in the most expensive direction possible.

This makes intuitive sense once you look at the signals. The same factors that earn a top-10 Google ranking — clean information architecture, credentialed authors, structured data, inline citations to primary sources, content freshness, semantic HTML — are exactly what feed the retrieval pipelines inside LLMs. A page that ranks fifty-eighth in Google is not going to be the citation Perplexity returns. The competition for citation runs through the competition for rank.

One contrary data point deserves honest treatment. A separate LLMvlab analysis claims that 93.7% of AI Overview links come from pages outside the top 10. The two findings disagree sharply, and the methodologies are not equivalent. The Profound data covers a far larger citation sample, and the directional finding — that top-ranking pages are heavily over-represented in AI citations — is corroborated by separate work from AirOps and Foundation Marketing. Treat the LLMvlab number as a single-source outlier until more evidence emerges.

The practical implication is uncomfortable: institutions that have neglected technical SEO over the past three years cannot leapfrog into GEO. The foundation has to be repaired first. Crawlability, page speed, semantic markup, internal linking, content freshness — the unglamorous infrastructure of search — is the prerequisite for citation visibility, not an alternative to it.

A companion pillar on technical SEO foundations is in development for this site.

§ 05What good looks like

The institutions that earn AI citations consistently do five things well. None of these is a feature; each is a sustained operating capability. Below is the rough diagnostic I’d run on any R1 in 2026.

Lever	What “good” looks like	Where to start
Content patterns	Body-level inline citations, named credentialed authors, statistics with attribution on every consequential page	Audit your top 10 program pages against the Princeton GEO triad
Schema markup	EducationalOccupationalProgram, FAQPage, and Person schema enforced at the CMS template level — not optional, not editor-by-editor	Run Google’s Rich Results Test on flagship program pages
Earned media and Wikipedia	Top-quartile off-site mentions; accurate Wikipedia entries for signature programs, research centers, and senior faculty	Map the citation footprint of three peer institutions and find the gap
Tracking	Citation rate, share of voice, and share of model measured quarterly across a stable prompt set	Build a 25-prompt audit using a free spreadsheet and incognito browser tabs
Governance	One web council with real authority, one published policy, schema and accessibility enforced at template level across 100+ subdomains	Stand up a governance council within 90 days

Each row is a workstream, not a project. Real citation visibility comes from running all five in parallel for several quarters, not from picking one. Institutions that try to bolt GEO onto a broken governance model — which is most institutions — discover quickly that the rate-limiting step is operational, not technical.

The most foundational decision sits underneath all five: whether to allow AI crawlers to access your content at all. That choice carries implications for IP protection, faculty research, training-data inclusion, and citation visibility — and most digital teams are making it by default.

A decision framework on AI bot access policy is in development for this hub.

§ 06Frequently asked questions

Is GEO the same as SEO?

No. SEO competes for rank position on a search results page; GEO competes for citation inside an AI-generated answer. The signals overlap heavily — clean architecture, credentialed authors, structured data, inline citations — but the optimization target is different. SEO earns clicks. GEO earns consideration in a zero-click environment.

Should we still invest in SEO if AI search is taking over?

Yes, more than before. 76% of AI Overview citations come from pages that already rank in Google’s top 10 (Profound, 2025). The foundation of GEO is strong SEO; institutions with broken technical SEO cannot leapfrog directly into citation visibility. The cost of neglecting traditional search infrastructure now compounds in the channel where students are actually searching.

How do we measure whether AI engines are citing our university?

Start with a free quarterly audit: 25 stable prompts run in incognito mode across ChatGPT, Perplexity, Gemini, and Google AI Overviews. Score each prompt on four dimensions — presence, position, framing accuracy, and sentiment. Enterprise tools (Profound, AthenaHQ, Conductor, Ahrefs Brand Radar) add scale and historical trending, but a spreadsheet-based audit is the right starting baseline.

Do we need to block AI crawlers to protect our content?

That decision is more nuanced than the technical configuration suggests. Blocking GPTBot, ClaudeBot, and Google-Extended protects your content from training-data inclusion but also reduces your visibility in the systems that use real-time retrieval. The right answer depends on institutional IP posture, faculty research protection, and how aggressively you intend to compete for AI citations. It is a general counsel-level decision most digital teams are making by default — usually by doing nothing.

§ 07The twelve-month window

This is the front door now. The 17-year-old at 11 p.m. is not a future scenario; she is the current reality at the top of the enrollment funnel. The institutions that move on findability over the next twelve months — repairing the SEO foundation, instrumenting schema, enforcing governance, building citation-worthy content, and tracking presence in AI answers — will define the competitive geometry for the next decade.

The institutions that wait will not lose the channel dramatically. They will lose it quietly, one consideration set at a time, in conversations they cannot see and traffic they never receive. The cabinet question is whether to act before the demographic cliff and the citation cliff compound, or after.

Start this week. Run the 11 p.m. test on your strongest disciplines — ask ChatGPT what it recommends and write down which institutions appear. That is your baseline. Convene your web governance council before the end of the month, and audit schema markup on your top five academic program pages. From there, the levers reinforce each other: schema needs governance to hold at scale, content needs earned media to compound, tracking needs all of them to mean anything.

The blog will spend the coming months walking each lever in depth. The pillar pages are the map; the cluster posts are the playbook.

Sources: WICHE (2024), EAB (2026), Frankel Agency (2025), Semrush (2025), Profound (2025), Search Engine Journal (2024), LLMvlab (2026), Aggarwal et al. (2024), Search Influence/UPCEA (2026).

The Front Door Moved: Generative Engine Optimization for R1 Universities