Topical Similarity and Theme Detection

Detect a website's primary theme, secondary themes, and content intent, then find similar domains. This uses only the homepage content and structure (HTML + headers). No WHOIS, backlinks, or external SEO databases.

Homepage content only Score 0-100 Same niche / close / adjacent

Analyze a domain for similarity

Enter a domain and jump directly to the similarity report.

If the domain was never checked before, analysis runs in background and the page will refresh when ready.

What we extract

  • - Title, meta description, H1, first H2
  • - Hero text and CTA patterns
  • - Image alt texts
  • - Language and basic tech fingerprints

How similarity is scored

  • - Semantic embedding cosine similarity (about 45%)
  • - Keyword overlap and structure
  • - CTA patterns and tech signals

Get started

Run a domain report and open the similarity section.

Check a domain

Recent similarity reports

Last checked domains that already have 2-5 similar sites detected.

Browse all domain reports

Privacy and limitations

We do not use WHOIS and do not rely on external SEO databases. We also sanitize extracted text to avoid storing personal data. Pages with very small amount of content can produce low-confidence results.