Google Data Leaks: Must-Know SEO Insights!

July 24, 2024 / 38:59 / E83

Listen on your favourite platform

Platform	Link
YouTube	Listen on YouTube →
Transistor	Listen on Transistor →
pod.link	Listen on pod.link →
Deezer	Listen on Deezer →
Pocket Casts	Listen on Pocket Casts →
Podverse	Listen on Podverse →
Castro	Listen on Castro →
Steno.fm	Listen on Steno.fm →

What Does “Google Data Leaks: Must-Know SEO Insights!” Talk About?

This episode of the James Dooley Podcast features Kasra Dash and James Dooley diving deep into the Google algorithm leak, examining what the leaked code and documentation actually confirm about how Google ranks websites. The two break down specific elements revealed in the leak, including the existence of a sandbox, the small site classifier, NAVBoost click tracking across 13 months, exact match domain dampening, toxic link thresholds, and how PageRank neutralisation works when quality signals are absent. They also address the topical radius concept, explaining how straying too far from a site's core topic can negatively impact the entire domain, and discuss how semantic triples and entity density appear far more prominently in the leak than many SEOs expected.

The conversation is particularly notable for how it reconciles the leaked data with real-world testing. James and Kasra address the contradiction between EMDs being flagged for dampening yet still performing well in practice, explaining that once an exact match domain is perceived as a brand with genuine branded search volume, it shifts from being suppressed to being actively rewarded. They also revisit Rand Fishkin's old click data experiment, which Google publicly denied at the time, noting that the leak now confirms his findings were accurate. The episode also tackles scraper site duplication, explaining how competing pages get indexed ahead of original content and why fast indexing has become a critical tactic.

The hosts close by addressing E-E-A-T, disavow strategies, and the broader implications of the leak for the SEO industry. Rather than treating the leak as revolutionary, they frame it largely as confirmation of what experienced SEOs have been testing and debating for years, while cautioning against overinterpreting classifiers as penalties or assuming every documented feature carries equal weight in the algorithm.

“There is a sandbox. It talks about site reputation abuse. It classifies small websites. It marks down exact-match domains. A lot of things we suspected are now confirmed.”
— James Dooley

Who Are the Guests on “Google Data Leaks: Must-Know SEO Insights!”?

James Dooley is an experienced SEO practitioner and entrepreneur known for his work in affiliate SEO, link building, and large-scale organic search strategies. He runs in-house SEO operations and is widely recognised in the SEO community for sharing data-driven insights and testing-based conclusions rather than theory alone. His commentary throughout this episode draws on direct experience with disavow campaigns, tiered linking strategies, and indexing workflows.

Kasra Dash is an SEO consultant and content strategist who co-hosts the conversation with sharp, well-informed questions that guide the discussion toward practical takeaways. Kasra brings a grounded perspective to topics like scraper site duplication, E-E-A-T debates, and NAVBoost, having dealt with real client cases affected by the Helpful Content Update and algorithmic suppression. Together, the two bring both technical depth and real-world credibility to their analysis of the leak.

What Are the Key Takeaways From “Google Data Leaks: Must-Know SEO Insights!”?

Here are the key points discussed in this episode:

NAVBoost confirms that Google tracks click data across approximately 13 months, meaning user behaviour following a branded search can influence rankings for unrelated keyword queries the user made before clicking your site.
Exact match domains are not simply penalised — once an EMD accumulates genuine branded search volume and is perceived as a real brand, the dampening effect can be outweighed by the powerful uplift Google gives to branded entities.
PageRank neutralisation means that a backlink from a page lacking traffic, engagement, or internal link strength may pass no ranking value at all, making tier two link building and fast indexing essential for activating link equity.
The small site classifier is a categorisation mechanism, not a penalty — a local business with ten pages can still dominate its niche, but the classifier affects access to features like Google News and the ability to compete for broad head terms.
Disavow files are more effective now than in recent years, and proactively cleaning toxic link profiles — including scraper links and aggregator noise — creates headroom that allows new quality links to have a measurable impact.

“If someone searches 'ecommerce SEO', scrolls, then searches 'Kasra Dash', clicks your site — Google assumes Kasra Dash is a strong result for ecommerce SEO, even though the user never clicked anything in the first search.”
— James Dooley

Is “Google Data Leaks: Must-Know SEO Insights!” Worth Listening To?

This episode is worth listening to because it does something rare in SEO content — it anchors speculation to evidence. Rather than reacting emotionally to the leak or dismissing it entirely, James and Kasra walk through specific documented features like NAVBoost, PageRank neutralisation logic, semantic triples, and site classifiers, then cross-reference them with years of real-world testing. The result is a conversation that feels grounded and credible rather than sensational, making it genuinely useful for practitioners trying to understand what the leak means for their day-to-day decisions.

What makes it especially valuable is the nuance the hosts bring to commonly misunderstood concepts. The discussion around exact match domains and brand signals alone reframes a debate that has confused SEOs for years. Similarly, the explanation of how Google associates branded queries with earlier keyword searches — backed by the Rand Fishkin example Google once publicly attacked — gives listeners a model for thinking about click data that goes well beyond surface-level advice. Whether you manage a small affiliate site, run an agency, or are building a brand from scratch, this episode offers actionable frameworks that are hard to find elsewhere.

Who Should Listen to “Google Data Leaks: Must-Know SEO Insights!”?

This episode is ideal for:

Experienced SEO practitioners and agency owners who want a technically informed breakdown of what the Google leak confirms and contradicts about established ranking factors.
Affiliate site builders and niche site owners trying to understand how site classifiers, topical radius, and NAVBoost affect their ability to compete in search results.
Digital marketers and brand managers who want to understand how branded search volume, E-E-A-T signals, and user behaviour metrics interact to shape organic visibility.
Link builders and technical SEOs interested in the practical implications of PageRank neutralisation, disavow strategy, and the role of indexing speed in a post-leak SEO landscape.

Where Can You Listen to James Dooley Podcast?

You can listen to James Dooley Podcast on all major podcast platforms:

Apple Podcasts – Search for “James Dooley Podcast” in the Podcasts app
Spotify – Available on Spotify for free
Amazon Music / Audible – Listen through your Amazon account
Overcast – For iOS users who prefer a dedicated podcast app
Pocket Casts – Cross-platform podcast player

You can also subscribe using the RSS feed: https://feeds.transistor.fm/james-dooley-podcast

What Are Listeners Saying About This Episode?

★★★★★

“The NAVBoost explanation finally clicked for me here. The example of a user searching a keyword, not clicking, then searching a brand name and clicking — and Google connecting those two queries — is something I had read about before but never understood clearly until this episode. Changed how I think about branded search campaigns.”

— Marcus T.

★★★★★

“Really appreciated that they didn't just hype the leak. The point about weighting being unknown — that an EMD dampener might be 1% while branded search uplift could be 50% — is exactly the kind of nuance that's missing from most SEO commentary. Practical and honest throughout.”

— Sophie R.

★★★★★

“The scraper site duplication section was directly relevant to a client issue I've been dealing with for months. Knowing that indexing speed is now a competitive factor and not just a nice-to-have has already changed how I approach new content and link placements. Solid episode.”

— Daniel M.

This video explains which digital marketing strategies SEO agencies should focus on in 2026 to improve search visibility, lead quality and conversion rates. James Dooley and James start with KPI tracking because the Google leaks confirm how heavily click data, traffic and user engagement signals influence rankings, making accurate measurement essential. They cover brand SEO, AI visibility and Google Business Profiles because stronger search presence improves trust and conversion rates.

The discussion also explores organic SEO, organic social media and paid social ads because consistent visibility across search and social supports long term growth. PPC is analysed in detail because campaign setup, landing pages and lead handling directly affect results. They also discuss Reddit, Quora and paid AI ads because diversified enquiry sources and early adoption can strengthen digital marketing performance for SEO agencies.

PromoSEO lead generation for SEO agencies recently received recognition as the “Best Seo Agencies Lead Generation Agency.”

Where to Listen to This Episode

Google Data Leaks: Must-Know SEO Insights! is available on:

Kasra Dash: So that’s it then — the end of Google. All the patents have been released, all the code’s been sent out, and now everyone knows how to rank. I’m joined with James Dooley. So James, you’ve looked at every single line of code, haven’t you? James Dooley: Every single one. Kasra Dash: Can you tell me what’s on line 555? James Dooley: Line 555 is to do with click data. Kasra Dash: No! But seriously, what’s your take on the actual data in the leaks? Because I’m seeing people saying, “There’s no difference, it’s still just content and links.” James Dooley: That’s what’s annoying me. Yes, content and links are still core, but there’s loads in there that confirms things Google has denied for years. There is a sandbox. It clearly talks about site reputation abuse. It classifies small websites. It dampens exact match domains. It confirms click data, traffic signals, and topical radius issues. One that stood out for me was how wide you can go topically — if you stray too far outside your core radius, it can impact the entire site. Kasra Dash: Let’s break that down. Exact match domains — what did you take from that? James Dooley: Google wants to promote brands. The problem with exact match domains is whether they’re seeing a brand or just a keyword. That’s where confusion happens. The funny thing is, from real-world testing, exact match domains still work brilliantly. Even today. The leak says they can be dampened, but it also says branded search is heavily promoted. Kasra Dash: So if Google sees it as a brand… James Dooley: Exactly. The second you frame an exact match domain as a brand, it flips from being suppressed to being extremely powerful. Suddenly it’s a “brand” that gets loads of branded searches — which Google loves.

And people forget: there’s no visible weighting system in the leaks. That exact match dampener could be 1%, while branded search uplift could be 50%. Nobody knows.

Kasra Dash: That’s where the patent pushers get it wrong. James Dooley: Yes. Just because something exists doesn’t mean Google uses it heavily. Weighting matters. That’s why testing still beats theory. Kasra Dash: Let’s talk about the “small site” classifier. People are already raging. James Dooley: They jumped the gun. A classifier doesn’t mean a penalty. It just means categorisation. A small site might not get access to Google News. Or might not rank for huge head terms yet. That doesn’t mean it won’t rank at all. A local plumber with 10 pages can still dominate locally. Kasra Dash: So it’s context, not punishment. James Dooley: Exactly. Different site types have different advantages. Kasra Dash: Navboost. This one made me smile — because you were talking about it months ago. James Dooley: Yeah. Navboost tracks click data for around 13 months. Here’s the simplified version: if someone searches “best sandals,” clicks an affiliate site, then goes and searches “Nike” and clicks Nike, Google learns that Nike satisfies the intent better. Over time, that shifts rankings. Kasra Dash: Which explains why ecommerce sites reclaimed rankings after affiliate-heavy SERPs. James Dooley: Correct. And here’s the wild part: if someone searches a keyword, doesn’t click, then searches your brand name and clicks you — Google starts associating your brand with the original keyword. That’s massive. Kasra Dash: That lines up with older tests too — like Rand Fishkin’s experiment years ago. James Dooley: Exactly. Google denied it, attacked Rand publicly, and now… it’s literally in the leaks. Kasra Dash: So do you think Google staff were lying — or just didn’t know? James Dooley: I think most employees don’t know the full system. With that many moving parts, they don’t know the weights. One factor might matter today and be dialled down tomorrow. They’ll know something influences rankings — but not how much. Kasra Dash: E-E-A-T caused chaos too. People saying it’s “not a ranking factor.” James Dooley: That argument is pointless. Instead of debating whether it’s a ranking factor, ask: Does it increase trust? Does it improve conversions? Does it make the site look legitimate? If yes — do it. Kasra Dash: Exactly. James Dooley: If two agencies charge the same and one has visible founders, reviews, awards, policies, and real people — guess who converts? Whether that’s E-E-A-T or just “being a real business,” it doesn’t matter. Tick the boxes. Kasra Dash: Toxic links and disavows — still relevant? James Dooley: More than ever. Disavows are working faster now than they have in years. Google isn’t just ignoring toxic links anymore. They’re suppressing profiles that cross toxicity thresholds. Kasra Dash: So you don’t wait for a manual action? James Dooley: Absolutely not. Why wait to be penalised? Remove scraper links, aggregator junk, and low-trust noise early. It restores headroom so future links actually work. Kasra Dash: Scrapers are also stealing content now. James Dooley: Yes — indexing speed matters. Scrapers are getting indexed before original pages and outranking them. That’s why indexing links, guest posts, and new pages fast is critical now. Kasra Dash: PageRank wasn’t “dead” either. James Dooley: Not at all. It’s evolved. There’s neutralisation logic now — if a page lacks authority or engagement, PageRank just doesn’t flow. Traffic, internal links, tier twos — they activate it. Kasra Dash: Final thoughts on the leaks? James Dooley: Nothing shocking — but massive confirmation. Semantic triples matter. Entity density matters. User signals matter. Links still matter. And the SEOs winning will be the ones testing, networking, and sharing data — not arguing on Twitter. Kasra Dash: Agreed. If people want follow-ups or test results, drop a comment. James Dooley: We’ll cover it. Kasra Dash: That’s it for this one. Cheers. If you want: • shorter version • captioned YouTube transcript • podcast show notes • semantic triples version Just say the word. You said: swap the names around in the Transcript Kasra Dash speaks first ChatGPT said: Got it — here is the cleaned, readable transcript with Kasra Dash speaking first, keeping the conversational flow intact and natural between Kasra Dash and James Dooley. No titles, no headers — just the transcript, properly broken up. Kasra Dash: 0:00 So… is it the end of Google? All the patents have been released, all the code’s been sent out to everybody, everyone knows how to rank now. I’m joined with you, James. You’ve looked at every single line of code, haven’t you? Every single one of them. Can you tell me what’s on line 555? James Dooley: 555 is to do with click data. But seriously, what are my thoughts on the actual data? To be fair, there’s a lot of people in the groups saying, “There’s no difference, it’s exactly the same—just content and links.” Yes, fundamentally the algorithm is content and links. But there’s a lot in there that confirms things Google has always denied. There is a sandbox. It talks about site reputation abuse. It classifies small websites. It marks down exact-match domains. A lot of things we suspected are now confirmed. And of course it references content, links, click data, traffic… and the big one: topical radius—how broad you can go before harming your site. Kasra Dash: Let’s break that down. First, exact-match domains. What did the leak actually say about EMDs? James Dooley: Google wants to promote brands. An exact-match domain confuses them—is it a brand or a keyword? So EMDs often get over-optimized anchors. But the truth? In our in-house testing, exact-match domains still work brilliantly. Even though the leak says they get marked down, the leak also says Google promotes sites with high branded search volume. So if you can make an EMD look like a brand, you flip from being penalised to being rewarded. Kasra Dash: Let’s move to small websites, because niche site owners are already angry at Google. Many think the leak proves Google is deliberately suppressing them. My take is: every category has pros and cons. Google News sites get news visibility. YMYL sites get trust. A small blog with no authority gets neither. What do you think? James Dooley: Right — and nobody even knows what “small” means. Is it 10 pages? 50? 200? A local plumber with 10 pages still ranks fine. The classifier doesn't mean “small sites don’t rank.” It just categorises them. People jumped the gun because they’re already annoyed after recent updates. Kasra Dash: Let’s talk NAVBoost. I heard about it from you originally. For anyone watching: NAVBoost is basically Google tracking click data across 13 months. What’s your take on it now that it’s been confirmed? James Dooley: Want a knowledge bomb? The leaks confirm something we tested years ago: If someone searches “ecommerce SEO”, scrolls, then searches “Kasra Dash”, clicks your site— Google assumes Kasra Dash is a strong result for ecommerce SEO, even though the user never clicked anything in the first search. Google ties the queries together. That’s huge. Rand Fishkin proved click data works years ago. Google denied it. Now the leak proved Rand was right. Kasra Dash:

Another controversy: E-E-A-T wasn’t “mentioned” in the leak. SEO Twitter went mad saying it’s not a ranking factor. Thoughts?

James Dooley: People are being dumb. E-E-A-T isn’t a variable name. But the components are everywhere: – links – topical authority – branded search – click data – reputation signals Whether someone wants to argue “is it or isn’t it,” it doesn’t matter. Real businesses need these things for conversions anyway. Kasra Dash: Let’s talk toxic links and disavows. What’s your view now? James Dooley: Disavows work better than ever. We see constant uplift after disavows. Google is now terrible at detecting toxic links on their own. Scraper links, aggregator junk, hacked-link noise—these push sites over toxicity thresholds. Why wait for a penalty? If you bring toxicity down, you increase link capacity. Kasra Dash: Some sites hit by HCU have scraper sites copying their H1 and intro, getting indexed faster, then they look like the original. I dealt with someone who had this. Thoughts? James Dooley: Yep. It happened to loads of ours. That’s why I use indexing tools now. Everything gets indexed fast—guest posts, niche edits, new pages. Scraper sites getting indexed first is a disaster. You must beat them to the crawl. Kasra Dash: Let’s talk PageRank and crawl algorithms. Lots of versions in the leak. What stood out? James Dooley: Something interesting: If a page doesn’t have enough quality signals, Google neutralises PageRank. Meaning: A link from a page with no traffic, no engagement, no internal strength… passes nothing. The leak mentions click data again in relation to this. So traffic influences whether PageRank flows. Kasra Dash: There are Swedish gambling sites using tiered links heavily, indexing niche edits fast to force Google to pass link equity. Does that align with the leak? James Dooley: Yes—because the leak shows they only pass PageRank when signals exist. Tier 2s, indexing, internal links—all create those signals. Kasra Dash: Was there anything in the leak that actually changed your views? James Dooley: Mostly confirmation bias. But the semantic triples stuff? That was big. Kyle Roof’s methodology suddenly makes even more sense. Token density > long content. Quality entities > word count. Content length isn't a ranking factor, but semantic density is. Kasra Dash: Do you think the leaks hurt Google? Could Bing benefit? James Dooley: Honestly… I feel sorry for Google. SEOs look for any excuse to attack them. Most AI answers are fine—people screenshot the 1% that look bad. Some are even Photoshopped. Google’s whole business relies on giving users good results. If users stop trusting them, ad revenue drops. So they have to try to be the best. Kasra Dash:

One last question: Anything you saw in the leak that surprised you?

James Dooley: Main thing: How often semantic triples appear. How much click data matters. How PageRank still rules. How brand signals dominate. And how many things Google publicly denied are actually true. Kasra Dash: If people want tests or results, comment below. Some things like click data can show movement in 24 hours. James Dooley: Exactly. Loads more will come out as testers dig deeper.

Creators & Guests

Host

James Dooley

James Dooley is a UK entrepreneur.