Leaked Google documents spill the secrets behind its mighty search engine (2024)

Leaked Google documents spill the secrets behind its mighty search engine (1)

Update (May 29, 5:44 pm ET): Google issued a statement, cautioning against assumptions based on "incomplete information."

What you need to know

  • Rand Fishkin of SparkToro received and published documents detailing Google Search's internal APIs, search ranking factors, and Google's data collection practices.
  • Some leaked information contradicts Google's public statements about search algorithms and ranking factors.
  • The documents were accidentally made public on GitHub from March 27 to May 7 and later indexed by a third-party service.

A massive leak of what seems to be thousands of internal documents offers a rare glimpse into the inner workings of Google Search, suggesting that Google may have been misleading the public about its search engine operations for years.

The documents were handed over to Rand Fishkin of SparkToro, a software company, who then made them public. Fishkin, a seasoned SEO expert with over a decade of experience, says a source gave him 2,500 pages of documents, hoping to debunk the "lies" Google employees had said about how the search algorithm actually works (via The Verge).

The documents spill the beans on internal APIs and break down what impacts search results. From these leaked papers, you can get a general sense of what works and what doesn't for ranking on Google, highlighting the key elements that matter most.

These leaks cover a wide range of topics, such as Google's data collection, which sites get a boost for sensitive issues like elections, and how Google treats small websites.

Interestingly, some information conflicts with what Google has publicly said. For example, Google has denied treating subdomains differently in rankings and claimed they don't use click-centric signals for content indexing, yet the leaks suggest otherwise, according to Fishkin.

Other surprises include using a sandbox for new sites, giving sites an "authority score" to bump them up in search results, and more.

Be an expert in 5 minutes

Get the latest news from Android Central, your trusted companion in the world of Android

Google has yet to respond to Android Central's request for comments, but we'll update this article when we do.

It looks like Google accidentally made these documents public on GitHub around March 27, and they were taken down by May 7. However, a third-party service indexed them, so they're still accessible.

Even though these documents reveal potential ranking factors, they don't specify the importance of each one in the final ranking, as SEO expert Mike King highlighted in his overview.

Earlier this year, Google launched a major Search update that prioritizes "helpful" content. The new algorithms are designed to determine if a webpage is made for search engines or real people.

Update

In an emailed statement to Android Central, a Google representative cautions the public not to jump to conclusions without all the facts.

"We would caution against making inaccurate assumptions about Search based on out-of-context, outdated, or incomplete information," the spokesperson said. "We’ve shared extensive information about how Search works and the types of factors that our systems weigh, while also working to protect the integrity of our results from manipulation."

Google also mentioned that it does not traditionally comment on the specifics of its ranking systems. Sharing such sensitive information could help spammers and bad actors manipulate the results, as per the company.

Search is always changing, and Google says it's constantly tweaking its systems to provide the best results. The spokesperson added that while Google's core ranking principles stay the same, individual signals can change often, be dropped, or just be tested and never used.

The search giant also reiterated its commitment to providing accurate information while protecting the integrity of search results. Finally, Google highlighted the potential for misinterpretation of the leaked documents.

Leaked Google documents spill the secrets behind its mighty search engine (2)

Jay Bonggolto

News Writer & Reviewer

Jay Bonggolto always keeps a nose for news. He has been writing about consumer tech and apps for as long as he can remember, and he has used a variety of Android phones since falling in love with Jelly Bean. Send him a direct message via Twitter or LinkedIn.

More about apps software

Overreacting is easy, but it will never fix any real problemsHow to edit texts in Google Messages

Latest

Samsung's next update might include a feature you'll probably never use
See more latest►

1 CommentComment from the forums

  • cknobman

    Search engine?

    I think it should be reworded to say "Propaganda and Manipulation" engine.

    Reply

Most Popular
Google TV introduces another set of ads for advertisers through its network
Max raises the prices of its Ad-Free and Ultimate plans
Nothing will prioritize AI over Phone 3, but is that the right call?
You can no longer fix Samsung phones at Best Buy
Our favorite last-gen smartwatch is $100 OFF right now—but who knows for how long
Galaxy AI experience will be further optimized for Samsung’s upcoming foldables
Google might bring AR directions and UWB support to Android's Find My Device
Google Keep is finally gaining the most requested window resize feature
This Best Buy deal carves a whopping 41% off the already affordable Moto G Play (2023)
Google brings 911 RCS texting capabilities to Messages across the US
Google Chrome blows competition away in Speedometer 3 tests
Leaked Google documents spill the secrets behind its mighty search engine (2024)

References

Top Articles
Latest Posts
Article information

Author: Horacio Brakus JD

Last Updated:

Views: 5571

Rating: 4 / 5 (51 voted)

Reviews: 90% of readers found this page helpful

Author information

Name: Horacio Brakus JD

Birthday: 1999-08-21

Address: Apt. 524 43384 Minnie Prairie, South Edda, MA 62804

Phone: +5931039998219

Job: Sales Strategist

Hobby: Sculling, Kitesurfing, Orienteering, Painting, Computer programming, Creative writing, Scuba diving

Introduction: My name is Horacio Brakus JD, I am a lively, splendid, jolly, vivacious, vast, cheerful, agreeable person who loves writing and wants to share my knowledge and understanding with you.