Sitemap Parser API - Extract URLs from XML Sitemaps
Fetches a sitemap URL and parses either <urlset> (regular sitemap) or <sitemapindex> (index of sub-sitemaps). Returns each entry's loc, with lastmod, changefreq, and priority when present. The type field indicates which format was detected.
Code examples
curl -X POST https://api.botoi.com/v1/sitemap \
-H "Content-Type: application/json" \
-d '{"url":"https://vercel.com/sitemap.xml","limit":3}'When to use this API
Crawl a site respecting its sitemap
Fetch /sitemap.xml, recurse into any sitemapindex children, and queue each URL for crawling. Much more efficient than following hyperlinks, and it surfaces pages that nothing links to from the homepage.
SEO content audit
Pull the full sitemap for your site and cross-reference lastmod against the actual last-modified date of each page. Stale lastmod values hurt crawl budgets; fix them in your sitemap generator.
Diff sitemaps between deploys
Capture the sitemap during each deploy and compare new URLs added or removed vs. the previous deploy. Catches accidental unpublishing or inadvertent page exposure.
Frequently asked questions
What is the difference between urlset and sitemapindex?
Does this follow sitemapindex children automatically?
What is the URL limit per request?
Can I parse compressed (.gz) sitemaps?
What timeout applies to the fetch?
Get your API key
Free tier includes 5 requests per minute with no credit card required. Upgrade for higher limits.