LAGIC
Lead Audience Growth Intelligence Computing
F

Find Sitemap from url — Website | Lagic

Built ForSEO AgenciesE-commerceMarket Research

Find every XML sitemap for any website to prepare for SEO audits and data scraping.

Curated by Lagic·Verified working

Configure Agent

The full URL of the website where you want to find sitemaps. Must include the protocol (http:// or https://). Example: The Agent will check robots.txt, common sitemap locations, and HTML content to discover all XML sitemaps.

When enabled, the Agent searches for ALL sitemaps on the website including nested sitemap indexes. When disabled, it returns only the primary sitemap. Enable this to get a comprehensive list of all available sitemaps.

When enabled, the Agent skips XML format validation of discovered sitemaps. Use this option if sitemaps are being rejected due to minor formatting issues. When disabled, only properly formatted XML sitemaps are returned.

Maximum time in seconds to wait for each HTTP request to complete. Increase this value for slow websites or decrease for faster execution. Valid range: 1-60 seconds.

When enabled, the Agent outputs detailed logs about the sitemap discovery process including checked URLs, response times, and found links. Useful for debugging. When disabled, only essential information is logged.

Results to deliver

6,800 credits

This agent actively searches live listings — results may vary. You are only charged for what is delivered, up to this number.

Lagic Proxy

Country auto-rotated. Need a specific region? Contact support.

Pricing

68 credits per result
✓ 30 free credits on signup✓ Refund if 0 results✓ No card required

Sample Data Preview

A list of all discovered sitemap URLs, including those from sitemap indexes.The total count of sitemaps found for the specified domain.The original website URL that was scanned.
https://...305https://...
https://...258https://...
.........
Exports as:CSVXLSXJSON

Overview

Enter any website URL to automatically discover all its XML sitemaps. The tool checks robots.txt, common sitemap locations, and HTML to provide a complete list of sitemap URLs.

A sitemap is the official roadmap to a website's content, created for search engines and other automated tools. Before you can analyze a competitor's content or scrape a site for data, you need to know what pages exist. This tool automates the discovery process, saving you the manual effort of hunting for sitemap files. ### How it Works Simply provide a website URL, and the tool intelligently searches for sitemaps using multiple methods: 1. **robots.txt Check:** It first checks the `robots.txt` file, which is the standard place to declare sitemap locations. 2. **Common Locations:** It then probes for over a dozen common sitemap filenames and paths (e.g., `sitemap.xml`, `sitemap_index.xml`, `sitemap.xml.gz`). 3. **Homepage Scan:** It also inspects the website's homepage for any links pointing to a sitemap. This multi-pronged approach ensures you find all sitemaps, including those that are nested within a sitemap index file. ### Who It's For - **SEO Specialists:** Quickly gather a complete list of a client's or competitor's indexable pages for a technical audit. - **Data Analysts & Researchers:** Use the sitemap as the starting point for a large-scale web scraping project to ensure complete data coverage. - **Web Developers:** Verify that sitemaps are correctly generated and accessible after a site migration or major update.

Key Capabilities

  • A list of all discovered sitemap URLs, including those from sitemap indexes.
  • The total count of sitemaps found for the specified domain.
  • The original website URL that was scanned.
  • Discover all sitemaps for a competitor to analyze their site structure and content strategy.
  • Generate a comprehensive list of URLs to feed into a web scraper for market research.
  • Perform a technical SEO audit by finding the official list of pages a website wants indexed.
  • Create a complete content inventory for a website before a migration or redesign project.
  • Verify that a newly deployed website has a correct and accessible sitemap.
  • Find sitemaps for multiple websites in bulk to prepare for a large-scale data extraction campaign.
  • Monitor a website's structure over time by regularly checking for changes in its sitemaps.

Field Dictionary

How To Run This Extractor

1

Enter the full URL of the target website.

2

Enable 'Find all sitemaps' to discover nested sitemaps within a sitemap index.

3

Optionally, enable 'Skip verification' for websites with minor sitemap formatting errors.

4

Adjust the 'Request timeout' if the target website is slow to load.

5

Run the tool to start the discovery process.

6

Download the results as a JSON file containing the list of found sitemap URLs.

Frequently Asked Questions

What do I need to use this tool?
You only need the full URL of the website you want to check, for example, `https://www.example.com`.
What format is the output?
Is it legal to find a website's sitemap?
Can I check many websites at once?
How is this different from just typing `domain.com/sitemap.xml` in my browser?
What does the 'Find all sitemaps' option do?
Why would I need to 'Skip verification'?
How up-to-date is the sitemap information?
Can I schedule this tool to run periodically?
Why would I increase the request timeout?