seo
>99% Success Rate
Schema Markup Scraper & SEO Auditor
Extract JSON-LD, Microdata, RDFa, Open Graph, and Twitter Cards from any URL with a comprehensive SEO audit scoring system.
TypeScript
Crawlee
Global
Key Features
Structured data extraction — JSON-LD, Microdata, and RDFa
Social meta tags — Open Graph, Twitter Cards, Dublin Core
SEO analysis with 0-100 scoring
Canonical URL and hreflang validation
Author extraction for EEAT signals
LocalBusiness detection with 80+ subtypes
Image alt text audit
Breadcrumb schema validation
Geo tags and NAP extraction
Use Cases
- Technical SEO auditing at scale
- Structured data validation for websites
- Competitive SEO analysis — compare schema markup across competitors
- EEAT signal assessment for content sites
- Local SEO auditing for businesses
- Pre-launch SEO checklist validation
Input Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
startUrls | array | Yes | URLs to analyze |
proxy | object | No | Proxy configuration |
maxRequestsPerCrawl | number | No | Limit total URLs to audit |
maxConcurrency | number | No | Parallel requests |
extractMetaTags | boolean | No | Extract meta tags (default: true) |
extractSeoAnalysis | boolean | No | Run SEO analysis (default: true) |
computeSeoScore | boolean | No | Calculate 0-100 SEO score (default: true) |
extractGeoData | boolean | No | Extract geo tags and NAP data |
Output Example
1{
2 "url": "https://example.com/product/...",
3 "title": "Example Product Page",
4 "linkedData": [
5 { "@type": "Product", "name": "..." }
6 ],
7 "openGraph": {
8 "og:title": "Example Product",
9 "og:type": "product"
10 },
11 "twitterCard": {
12 "card": "summary_large_image"
13 },
14 "seoAudit": {
15 "score": 78,
16 "issues": [
17 "Missing alt text on 3 images",
18 "No hreflang tags detected"
19 ]
20 },
21 "headings": {
22 "h1": ["Example Product"],
23 "h2": ["Description", "Reviews"]
24 }
25}
Frequently Asked Questions
What structured data formats are supported?
JSON-LD, Microdata, and RDFa. The scraper also extracts Open Graph, Twitter Cards, and Dublin Core metadata.
How is the SEO score calculated?
The 0-100 score evaluates title tags, meta descriptions, heading hierarchy, image alt text, canonical URLs, mobile viewport, structured data presence, and more.
Can I audit multiple pages at once?
Yes, provide multiple URLs in startUrls. The scraper processes them in parallel for fast bulk auditing.
Related Tools
⚡
testingGlobal API Load Tester
Simulate 10K+ RPS with geo-distributed load testing.
Learn more🖨️
mediaUniversal Web Printer
Convert URLs and HTML to PDF, PNG, JPEG, or WebP.
Learn more