SEO Scanner — Godrank

features

What We Analyze

Every scan runs your page through 12 ranking signals extracted from Google's leaked Content Warehouse API, three patents, and the Quality Evaluator Guidelines.

content_effort.rs

🔎

Content Effort Score

contentEffort

Measures word count depth, lexical diversity, original media, and information density. Maps directly to Google's LLM content effort classifier.

panda_classifier.rs

🐼

Panda Quality Check

pandaDemotion

N-gram filler phrase detection from Panda patent US9031929B1. Identifies template language, thin content patterns, and quality demotion triggers.

ai_detector.rs

🤖

AI Pattern Detection

humanFingerprint

Detects LLM-generated vocabulary patterns, sentence uniformity, and predictable structure. Flags content that reads like AI output.

eeat_analyzer.rs

🎓

E-E-A-T Signals

authorReputationScore

Checks for author credentials, first-person experience, citations, and data points. Maps to Google's Experience, Expertise, Authority, Trust framework.

navboost.rs

🚀

NavBoost Readiness

goodClicks / lastLongestClicks

Evaluates user engagement signals: conclusion presence, FAQ sections, and structural elements that drive dwell time and reduce pogo-sticking.

info_gain.rs

🧠

Information Gain (AI)

US11354342B2

AI-powered analysis of content uniqueness vs. existing SERP results. Measures original insight, novel data, and topical depth beyond competitors.

intelligence sources

Our analysis is grounded in real signals from Google's internal systems — not guesswork.

🔒

Content Warehouse API Leak

2,596 modules · 14,014 attributes

The largest leak of Google's internal ranking systems. Includes contentEffort, pandaDemotion, siteAuthority, navDemotion, OriginalContentScore, and NavBoost signals.

📜

Panda Quality Classifier

Patent US9031929B1

Google's patent for scoring page quality using n-gram phrase analysis, filler detection, and template content patterns that trigger quality demotions.

📜

Information Gain Patent

Patent US11354342B2

Measures how much new information a page contributes beyond what's already in search results. Higher gain = higher rankings.

📖

Quality Evaluator Guidelines

Google QEG · Sept 2025

Official evaluation criteria used by Google's human raters. E-E-A-T framework, YMYL classification, Page Quality scoring, and Needs Met assessment.

process

How It Works

Three steps. Under 60 seconds. Full signal analysis.

Enter Target URL

Paste any URL or HTML source code. Our multi-proxy engine bypasses CORS, cookie walls, and anti-bot protection.

$ fetch --url target.com

AI Analyzes Signals

7 local classifiers + 5 AI-powered factors run in parallel. Claude or GPT-4o deep-analyzes content effort, information gain, and YMYL compliance.

$ analyze --signals 12

Get Actionable Report

Detailed report with scores, issues, and copy-paste fix prompts for each factor. Download as PDF or scan another URL.

$ report --format pdf

use cases

Built for Professionals

Whether you're optimizing a single page or auditing an entire site.

💻

SEO Professionals

Audit client pages against actual Google signals, not vanity metrics.

✍️

Content Teams

Get fix prompts to improve content before publishing.

🏢

Agencies

White-label PDF reports for client deliverables.

📰

Publishers

Ensure editorial content meets Google's quality bar.

about

What Is the Godrank SEO Scanner?

Godrank SEO Scanner is a reverse-engineering tool that evaluates any URL against real signals extracted from Google's leaked Content Warehouse API, three published patents, and the official Quality Evaluator Guidelines. Unlike traditional SEO audits that rely on proxy metrics and vanity scores, Godrank analyzes the same internal attributes that Google's own classifiers use to determine rankings.

In May 2024, thousands of internal API documents from Google's Content Warehouse leaked publicly — revealing 2,596 modules and 14,014 attributes that power Google's ranking systems. Signals like contentEffort, pandaDemotion, siteAuthority, navDemotion, and NavBoost were exposed for the first time. Godrank maps these signals to actionable checks that run against your content in under 60 seconds.

Every scan combines 7 local classifiers (word count analysis, Panda filler detection, AI pattern recognition, E-E-A-T signal mapping, structure checks, meta analysis, NavBoost readiness) with 5 AI-powered deep factors (Content Effort, Information Gain, YMYL Compliance, Topical Depth, Search Intent Match) powered by Claude or GPT-4o. The result is a comprehensive ranking signal audit that no other free tool offers.

intelligence

Understanding the Leaked Signals

Google's Content Warehouse API leak confirmed what SEOs long suspected — Google tracks far more signals than publicly disclosed. Here are the core systems that Godrank analyzes:

contentEffort — Google's internal classifier that measures how much genuine effort went into creating a page's content. It looks at depth, originality, media richness, and information density. Pages scoring low on contentEffort risk being filtered out of top results entirely, regardless of backlink strength.

pandaDemotion — Derived from the Panda Quality Classifier (Patent US9031929B1), this signal identifies pages with template language, filler phrases, and thin content patterns. Unlike the original Panda update which operated at site-level, the leaked API reveals a per-page demotion score that can surgically suppress individual URLs.

NavBoost / goodClicks / lastLongestClicks — Google's user engagement system that tracks click-through behavior from search results. Pages that earn goodClicks (long dwell time, no pogo-sticking) get boosted. Pages with poor engagement signals get demoted. Godrank checks structural elements that correlate with engagement: conclusion sections, FAQ blocks, table of contents, and scannable formatting.

Information Gain (Patent US11354342B2) — This patent describes a scoring method that measures how much new information a page contributes beyond what already exists in search results. Pages that simply rehash the same points as every other ranking result score low. Pages with original data, unique perspectives, or novel analysis score high. Our AI compares your content against the topical baseline to estimate your information gain.

OriginalContentScore — An internal metric that evaluates whether your content is genuinely original versus scraped, spun, or AI-generated commodity text. Google cross-references content fingerprints to detect duplication and derivative content at scale.

framework

E-E-A-T Through the Lens of Leaked Data

E-E-A-T — Experience, Expertise, Authoritativeness, and Trust — is the quality framework from Google's Quality Evaluator Guidelines. The API leak revealed that Google doesn't just rely on human raters for E-E-A-T. Automated classifiers map specific on-page patterns to each dimension. Godrank checks for these patterns:

Experience

First-person language, anecdotes, original photography, and "I tested" phrasing. The API tracks authorReputationScore which factors in demonstrated real-world involvement.

Expertise

Citations, data references, technical vocabulary appropriate to the topic, and credential mentions. The topicEmbeddings signal measures topical coherence and depth.

Authoritativeness

External link equity, brand mentions, and siteAuthority from the leaked API. This is the hardest E-E-A-T signal to build — it requires genuine off-page recognition.

Trust

The most critical signal. Covers HTTPS, privacy policy, contact information, transparent authorship, and absence of deceptive patterns. Trust failures tank all other scores.

scoring

How the Score Works

Godrank produces a weighted score from 0–100 across 12 ranking factors. Seven local classifiers analyze your page's HTML structure, text patterns, and metadata in the browser. Five AI-powered factors use Claude or GPT-4o to perform deep semantic analysis of content quality, information novelty, and intent alignment. Each factor is scored independently and weighted by its importance in Google's ranking stack.

A+

80–100 pts

65–79 pts

50–64 pts

30–49 pts

0–29 pts

Unlike SEO tools that give you a generic "SEO score" based on meta tags and page speed, Godrank evaluates content quality signals — the factors that actually determine whether Google considers your page worthy of ranking. A page with perfect technical SEO can still score poorly on Godrank if the content lacks depth, originality, or information gain.

compliance

YMYL — Your Money or Your Life

YMYL pages cover topics that can significantly impact a person's health, financial stability, safety, or well-being. Google applies substantially stricter quality standards to YMYL content. The leaked API reveals dedicated classifiers like healthScore that evaluate medical content accuracy, and the Quality Evaluator Guidelines mandate that all YMYL claims must meet expert consensus and be properly sourced.

Godrank's AI automatically detects YMYL topics in your content and flags unsourced claims, missing expert credentials, and compliance gaps. If your site covers health, finance, legal, or safety topics, this analysis is critical — YMYL violations are among the fastest paths to ranking demotion.

detection

AI Content Detection

Google's leaked API includes signals related to content authenticity and humanFingerprint analysis. While Google publicly states it doesn't penalize AI content per se, the contentEffort signal strongly correlates with human-generated content that demonstrates genuine research, original analysis, and personal insight.

Godrank scans for five telltale patterns of unedited AI output: LLM vocabulary flags (words like "delve", "tapestry", "leverage" at abnormal frequency), sentence uniformity (AI tends to produce sentences of similar length), hedging patterns ("It's important to note that..."), structural predictability (intro → 5 H2s → conclusion), and missing personal voice (no first-person perspective, anecdotes, or opinion). The goal isn't to flag AI usage — it's to identify content that reads like commodity AI output and will lose to human-crafted alternatives.

faq

Frequently Asked Questions

What is the Google Content Warehouse API leak?▼

In May 2024, over 2,500 internal API documents from Google's Content Warehouse were accidentally published to a public GitHub repository. These documents exposed 14,014 ranking attributes across modules like NavBoost, siteAuthority, contentEffort, and pandaDemotion. The leak confirmed many signals that SEOs had theorized about for years and revealed entirely new systems that Google had never publicly acknowledged. Godrank maps the most impactful of these signals to automated checks.

How is Godrank different from other SEO tools?▼

Traditional SEO tools like Ahrefs, SEMrush, and Moz focus on proxy metrics: backlinks, keyword positions, domain authority, and page speed. These are valuable but don't tell you whether Google considers your content high quality. Godrank analyzes the quality signals themselves — content effort, panda demotion risk, information gain, AI detection patterns, and E-E-A-T indicators. It answers the question other tools can't: "Does Google think this content deserves to rank?"

Is the scanner really free?▼

Yes. Godrank SEO Scanner is completely free with no signup required. You provide your own AI API key (Claude or OpenAI) for the deep analysis portion. The 7 local classifiers run entirely in your browser with zero external calls. Your data never touches our servers — everything is processed client-side.

Do I need an API key to use it?▼

An API key (Anthropic or OpenAI) unlocks the 5 AI-powered factors: Content Effort, Information Gain, YMYL Compliance, Topical Depth, and Search Intent Match. Without a key, the scanner still runs all 7 local classifiers and gives you a partial score. For the full 12-factor analysis, you'll need a key — both providers offer free tiers.

What is the Panda Quality Classifier?▼

Patent US9031929B1 describes Google's system for scoring page quality using n-gram phrase analysis. It identifies filler phrases, template language, and thin content patterns that trigger quality demotions. Unlike the original Panda update (2011) which operated at site-level, the patent describes per-page scoring. Godrank runs a local n-gram analysis that checks for 80+ known filler patterns and template phrases associated with Panda demotions.

What is Information Gain and why does it matter?▼

Information Gain (Patent US11354342B2) measures how much new, unique information your page adds beyond what's already available in existing search results. If your content simply rephrases the same points found on every other ranking page, your information gain is near zero. Pages with original research, proprietary data, unique case studies, or novel analysis score highest. Google uses this signal to decide which pages deserve to rank when content quality is otherwise similar.

What is NavBoost?▼

NavBoost is Google's user engagement ranking system, confirmed in the API leak. It tracks click behavior from search results: which pages users click, how long they stay (lastLongestClicks), and whether they return to search results (pogo-sticking). Pages that consistently earn long, satisfying visits get a NavBoost ranking increase. Godrank checks for structural elements that correlate with good engagement: clear conclusions, FAQ sections, table of contents, and scannable formatting.

Can I use the Paste HTML mode for any website?▼

Yes. If a site blocks our automated fetch (cookie walls, age gates, Cloudflare protection, JavaScript rendering), switch to the Paste HTML tab. Right-click the page, select "View Page Source", copy the full HTML, and paste it into the scanner. This works for any website regardless of anti-bot protection. The analysis is identical to URL mode.

How can I improve my Godrank score?▼

Each factor in your report includes specific fix prompts you can copy directly. Generally: increase content depth beyond 1,500 words with original analysis; eliminate filler phrases and template language; add first-person experience and real data; cite expert sources; ensure strong meta titles and descriptions; include FAQ and conclusion sections for NavBoost; and add author bios with credentials. Focus on the factors where you scored lowest — those represent your biggest ranking opportunities.

>_ Ready to decode Google?

Free. No signup. Scan any URL in under 60 seconds.