Sitemap (sitemap.xml) is a file with a list of website pages. Sitemap tells search engines what pages you'd like them to index when they were last updated, how often they should be re-crawled and their priority against other pages on your website. Sitemap.xml helps web crawlers scan all pages on your website and correctly determine the webpage priority.
To submit sitemap.xml to search engines, type in your file URL in search engine webmaster tools dashboards (Google Search Console, Bing - Webmaster Tools, etc.) and add a string in robots.txt as follows:
Sitemap tool includes a Sitemap generator, a sitemap loader, and a scheduled autorefresh.
• Utilize a Sitemap generator if your CMS doesn't support an up-to-date complete sitemap.xml creating. Keep a generated sitemap updated by refreshing it from time to time.
• Use Sitemap loader and a sitemap autorefresh combined with the related audit tools: Website Audit tool, Index checker, and Website Watcher.
How do I generate a sitemap?
If you wish to generate a sitemap, complete as follows:
- Create project or pick one from the list.
- Move to Sitemap tool page and press Generate sitemap.
- Pick website page count.
- Press Generate.
To download a sitemap to your computer, press Download.
You can download a sitemap in XML or CSV formats.
Notice that a generated sitemap contains only XML mandatory tags.
||required||References the current protocol standard.|
A parent XML tag for each URL.
The remaining tags are child elements of this tag.
A URL of a webpage.
It must begin with the protocol (for example, HTTP) and end with a trailing slash if a
webserver requires it. This value must be less than 2,048 characters.
The last modified date of a sitemap. The date should be in W3C Datetime format. This
format allows to omit the time and use YYYY-MM-DD.
Notice that this tag has nothing to do with an "If-Modified-Since (304)" header that
a server can return, search engines can use information from both sources differently.
How frequently the page is likely to change. This value provides general information to
search engines and may not correlate exactly to how often they crawl the page.
Valid values are:
The value "always" should be used to describe documents that change each time they
are accessed. The value "never" should be used to describe archived URLs.
Please note that the value of this tag is considered a hint and not a command. Even
though search engine crawlers may consider this information when making decisions,
they may crawl pages marked "hourly" less frequently than that, and they may crawl
pages marked "yearly" more frequently than that. Crawlers may periodically crawl
pages marked "never" so that they can handle unexpected changes to those pages.
The priority of this URL relative to other URLs on your site. Valid values range from 0.0
to 1.0. This value does not affect how your pages are compared to pages on other sites—
it only lets the search engines know which pages you deem most important for the
The default priority of a page is 0.5.
Please note that the priority you assign to a page is not likely to influence the position
of your URLs in a search engine's result pages. Search engines may use this information
when selecting between URLs on the same site, so you can use this tag to increase the
likelihood that your most important pages are present in a search index. Also, please
note that assigning a high priority to all of the URLs on your site is not likely to help you.
Since the priority is relative, it is only used to select between URLs on your site.
How can I utilize a sitemap?
Related tools (Website Audit tool, Index checker, Website Watcher) can use generated sitemaps to pull pages to scan and crawl or to calculate the report cost. You can import pages from a generated sitemap in the tools settings.
Sitemap autorefresh tool
Sitemap autorefresh tool crawls sitemap pages to detect possible changes and refresh a sitemap when those are discovered. The tool sends alerts to the associated Email address or displays in the Notification center.
To schedule a sitemap autorefresh, complete as follows:
- Move to the Sitemap tool page and switch to the Settings page (1).
- Check box near Autorefresh sitemap from file (2).
- Enter the path to your sitemap file.
- Pick days of month (one, several or daily) (3).
- Set time to refresh (4).
- Pick the preferable alert option: Notification center or Email (5).
Auto-import fresh data
Topvisor can automatically import and submit sitemap changes to lists of pages in the related audit tools settings. Hence, you'll be able to track all newly added sitemap pages and/or stop tracking removed pages.
- Pick a tool (1).
- Add a mask to refresh (2).*
- Add a max page count (3).**
*The default mask is '/' ('slash' refers to the root directory). If not set otherwise, the tool will crawl all website pages to discover removed and added pages and to update the list of pages assigned to audit tools accordingly.
You may wish to track only certain webpages or website sections ( e.g. a product line). To make the tool track only specific sections, submit a mask to refresh selectively.
**CMS crash may result in new pages (e.g. 100 at once) being accidentally added to your sitemap. These pages will be imported to the tool with the sitemap autorefresh turned on. If the tool has scheduled tasks, it will process all new pages on auto-pilot. As the result, the total report cost will be calculated as follows: the number of pages multiplied by 1-page check cost.
Max page count will prevent possible money losses. Please pay close attention to this option.