943 words
5 minutes
This isn't a new game, it's just the next level

In our last discussion, I pulled back the curtain on “AI Search,” revealing it’s not a magical oracle but a specific, two-part pipeline: a Retriever finds relevant documents, and a Generator synthesizes them into an answer.

This framework, known as Retrieval-Augmented Generation (RAG), might feel like a recent breakthrough, but it’s not a beginning.

It’s a destination.

It’s the result of a two-decade-long evolutionary journey that search engines have been on since their inception.

The mission has always been singular: to perfectly understand a user’s intent. Only the tools have changed.

Understanding this history is the key to seeing the present clearly and preparing for the future without panic.

Our strategies as SEOs have always been a reflection of the machine’s capabilities. As it got smarter, so did we.

Era 1: A web of strings#

In the early days, search engines were magnificent, but simple, text-matching machines. They didn’t understand ideas; they understood strings of characters.

Their job was to find documents that contained the exact keywords you typed.

Algorithms like BM25, a classic still respected in information retrieval, were brilliant at this, but they were fundamentally performing sophisticated keyword counting.

Our work as SEOs directly mirrored this mechanical reality. That’s how the era of keyword density started.

Era where, everyone was meticulously placing search terms in titles, H1s, and meta tags.

The optimization was done for a machine that could read, but not yet comprehend. We matched strings because the machine was matching strings.

Era 2: The semantic leap, when search learned to read between the lines#

The first true revolution was the shift from matching words to understanding meaning. This was the dawn of semantic search, powered by a technology called vectorization, or embeddings.

Suddenly, the word “king” was no longer just a string of four letters. It became a point on a vast, multi-dimensional map, located near “queen” and “monarchy” but far from “cabbage.”

This allowed search engines to grasp that a page about “royal succession in England” was deeply relevant to a query about the “next English king,” even if the exact keywords were missing.

This capability was supercharged by transformer models like BERT. This is not ancient history; it’s the very engine inside the modern systems we discussed.

When the RAG paper details its Retriever component, it’s not describing some futuristic tech.

The authors state it is based on a “bi-encoder architecture… where d(z) is a dense representation of a document produced by a BERTBASE document encoder… and q(x) a query representation produced by a query encoder, also based on BERTBASE” (Lewis et al., 2020, p. 3).

The modern Retriever is a BERT-based semantic search engine.

This technological leap forced our hand as SEOs, for the better. “Keyword stuffing” became a relic.

We started talking about “semantic SEO,” “user intent,” and “topical authority.”

Our focus shifted from optimizing a single page for a single keyword to building comprehensive content clusters that answered a whole ecosystem of related user questions.

We had to optimize web entiry for a machine that could finally understand context.

Era 3: The agent era, when search learned to ask its own questions#

This brings us to today’s frontier.

The latest evolution is that the AI is no longer just a passive recipient of information.

It is becoming an active, autonomous agent that knows what it doesn’t know. This is the central, powerful insight from the 2023 Toolformer paper from Meta AI.

They proved that a language model could be trained to recognize the limits of its own knowledge and proactively seek out missing facts using external tools, like a Wikipedia search API.

The goal, in the researchers’ own words, was to introduce “a model trained to decide which APIs to call, when to call them, what arguments to pass, and how to best incorporate the results into future token prediction” (Schick et al., 2023, p. 1).

Let that sink in. The AI is generating an answer, realizes it needs a specific piece of data it doesn’t have, and autonomously executes an internal search to get it.

It then integrates that fact into the text and continues. The AI is now a user of search, just like your customers.

This has profound implications for our work. Our content is no longer just the answer to a human’s question.

It is now also the answer to the AI’s own, self-generated questions.

For our content to be selected as the definitive “tool” for the AI agent, it must be more than just topically relevant. It must be structured for machine consumption:

  1. Factually Unambiguous: It needs a clear, reliable answer.
  2. Easily Parsable: The AI needs to extract the specific piece of data it’s looking for with minimal effort. A well-defined data point in a table, a clear definition under an H2, or a concise answer to a direct question is far more “tool-friendly” than a dense, narrative paragraph.

The lesson from evolution is clarity#

The historical trend is undeniable.

Each era of search evolution has pushed us further away from technical loopholes and closer to fundamental, inarguable quality.

  • Era 1 (Strings): You could win with clever tricks.
  • Era 2 (Semantics): You had to win with more comprehensive, authoritative content.
  • Era 3 (Agents): You must win by being the most reliable, clear, and citable source of truth.

The machine has gotten progressively smarter, and at every stage, it has rewarded those who were already focused on providing the best possible answer for the user.

AI Search isn’t a frightening new paradigm; it’s just the next, higher level of the game we’ve been playing all along. The bar for quality has simply been raised.

Stick with me for the next post, I’ll look under the hood of systems like Bing Chat and Perplexity to see exactly how they use these Retriever and Generator components in a live environment, and show you how the classic architecture of search still forms the unshakable foundation for it all.

This isn't a new game, it's just the next level
https://melky.co/next-level/
Author
Myriam
Published at
2025-07-29
License
CC BY-NC-SA 4.0