seo-audit: Add warning about web_fetch unable to detect JS-rendered schema

web_fetch strips <script> tags during HTML→markdown conversion, which
silently discards JSON-LD schema blocks. Additionally, many CMS plugins
(AIOSEO, Yoast, RankMath) inject schema via client-side JavaScript,
making it invisible to both web_fetch and curl.

This has led to false audit findings in production (reporting 'zero
structured data' on sites with extensive schema implementation).

Adds a prominent warning in the Audit Framework section and a note
in the Tools section recommending browser rendering, Rich Results Test,
or Screaming Frog for accurate schema detection.
This commit is contained in:
willscott-v2 2026-02-15 07:12:46 -06:00
parent a04cb61a57
commit 170af13f24

View file

@ -34,6 +34,19 @@ Before auditing, understand:
## Audit Framework
### ⚠️ Important: Schema Markup Detection Limitation
**`web_fetch` and `curl` cannot reliably detect structured data / schema markup.**
Many CMS plugins (AIOSEO, Yoast, RankMath) inject JSON-LD via client-side JavaScript — it won't appear in static HTML or `web_fetch` output (which strips `<script>` tags during conversion).
**To accurately check for schema markup, use one of these methods:**
1. **Browser tool** — render the page and run: `document.querySelectorAll('script[type="application/ld+json"]')`
2. **Google Rich Results Test** — https://search.google.com/test/rich-results
3. **Screaming Frog export** — if the client provides one, use it (SF renders JavaScript)
**Never report "no schema found" based solely on `web_fetch` or `curl`.** This has led to false audit findings in production.
### Priority Order
1. **Crawlability & Indexation** (can Google find and index it?)
2. **Technical Foundations** (is the site fast and functional?)
@ -364,10 +377,12 @@ Same format as above
- Google Search Console (essential)
- Google PageSpeed Insights
- Bing Webmaster Tools
- Rich Results Test
- Rich Results Test (**use this for schema validation — it renders JavaScript**)
- Mobile-Friendly Test
- Schema Validator
> **Note on schema detection:** `web_fetch` strips `<script>` tags (including JSON-LD) and cannot detect JS-injected schema. Always use the browser tool, Rich Results Test, or Screaming Frog for schema checks. See the warning at the top of the Audit Framework section.
**Paid Tools** (if available)
- Screaming Frog
- Ahrefs / Semrush