Validate any XML sitemap — or just enter your domain and the tool auto-discovers your sitemap via robots.txt. Check XML structure, URL count, lastmod freshness, sitemap type, and get a health score out of 100.
Enter your domain to auto-discover the sitemap via robots.txt, or enter the direct sitemap.xml URL.
Enter just your domain (e.g. example.com) and the tool automatically finds your sitemap by checking robots.txt and trying common sitemap paths. Or enter the direct sitemap URL (e.g. example.com/sitemap.xml) for a direct check.
The tool parses your XML sitemap and checks: is the XML well-formed, is the namespace correct, how many URLs are listed, do URLs use HTTPS, are lastmod dates present and recent, is it a sitemap index or regular sitemap, and does it stay within Google's 50,000 URL limit.
The health score (0-100) summarises all checks. Common fixes include: adding lastmod dates, replacing HTTP with HTTPS URLs, splitting oversized sitemaps, and correcting malformed XML. After fixing, re-submit to Google Search Console and re-run this validator to confirm.
An XML sitemap is a file that lists the URLs on your website for search engines to crawl. It helps Google discover all your important pages, especially new content and pages with few internal links. Without a sitemap, search engines may miss pages that are not well linked internally.
Each XML sitemap file can contain a maximum of 50,000 URLs and must be smaller than 50MB uncompressed. If your site has more pages, use a sitemap index file that references multiple individual sitemaps. This checker warns you if your sitemap approaches these limits.
A sitemap index file is a special XML file that lists references to multiple individual sitemap files rather than URLs directly. It allows large sites to split their URLs across multiple sitemaps while submitting a single index file to search engines. This checker detects whether your sitemap is an index or a regular sitemap.
lastmod should be a valid date in W3C datetime format: YYYY-MM-DD (e.g. 2025-03-15) or full datetime with timezone (e.g. 2025-03-15T14:30:00+00:00). Dates in the future or more than 10 years in the past may be ignored by Google. Providing accurate lastmod dates helps Google prioritise recrawling of recently updated pages.
No — only include canonical, indexable pages. Do not include: pages with noindex meta tags, pages blocked by robots.txt, paginated pages (unless they have unique content), session-based or parameterised URL variants, 404 pages, 301 redirect pages, or duplicate content pages with canonical tags pointing elsewhere.
Submit through Google Search Console: go to Sitemaps in the left panel, enter your sitemap URL, and click Submit. Google will show the submission date, number of URLs discovered, and any errors. Re-submit after making significant changes to your sitemap. You can also add your sitemap URL to robots.txt with a Sitemap: directive.