SEO
35 Minutes

XML sitemaps and SEO: Your comprehensive guide

Stefanie Sommer

9.4.2025

Have you ever wondered how you can help search engines like Google find all the important pages on your website? Get right here XML sitemaps into the game. An XML sitemap is basically nothing more than a table of contents for your website — but for search engines instead of for people. In the past, websites often had HTML sitemaps that visitors could click through, but these were hardly exciting for human users. The XML sitemap, on the other hand, is a machine-readable format that lists all relevant URLs on your website.

Why is that important? Imagine that Google is visiting your website. Of course, Google can find many pages via internal links. But no website is perfectly linked, and especially with large or new websites, there is a risk that some pages will remain “under the radar.” According to Google, XML sitemaps are the second most important Source to discover new content.

In other words, a good sitemap ensures that Google & Co. does not overlook any important content. It provides search engines with valuable additional information — for example, which pages are particularly important to you and when they were last updated. In short: XML sitemaps speed up and improve the indexing of your content in search engines.

In this guide, you will find out in a practical and understandable way what an XML sitemap is, and when you do it Really needhow to create and optimize them, and what typical mistakes you should avoid. Whether you're an SEO professional, marketing manager, or interested newbie — here you'll find helpful tips from basics to advanced best practices. So let's get started and make your website sitemap-fit 😉.

What is an XML sitemap?

An XML sitemap is a special file (in XML format) that lists all important pages (URLs) on your website so that search engines a complete overview receive about your content. Think of them as a digital table of contents or a phone book for your website. This file contains the URLs of your pages and, optionally, additional information about each URL, such as when the page was last changed or whether there are alternative language versions. However, these extras are optionally. Google itself explains:

“A sitemap tells Google which pages and files on your website you think are particularly important, and it also provides valuable additional information, such as when the page was last updated or about alternative language versions of the page. “

Important: One XML sitemap is primarily intended for search engines, not for visitors to your site. In contrast to the HTML sitemap, which a person could read in the browser, the XML sitemap is structured in such a way that crawlers (such as Googlebot) can process it easily and quickly. While both types of sitemaps share the same goal — to help search engines discover content — they do so in different ways. The HTML version can also be used by users for navigation, while the XML version is a Feed for search engines is.

<urlset>A typical XML sitemap file starts with a declaration of the XML version and an element with the appropriate namespace. For each URL, there is a <url>block that contains at least the address, for example:

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
   <url>
      <loc>https://www.your-domain.com/example-page/</loc>
      <lastmod>2025-03-01</lastmod>
      <changefreq>weekly</changefreq>
      <priority>0.8</priority>
   </url>
</urlset> 

The tags<lastmod>, <changefreq>and <priority>are optional. You can specify when the page was last modified, approximately how often it changes, and what priority it has relative to your other pages. But watch out: search engines Most often ignore this information to a large extent

Google, for example, prefers to rely on its own findings and treat the priority and changefreq fields as clues rather than commands. So you don't have to worry about entering perfect values everywhere — a valid URL list is all you need.

In summary: The XML sitemap is a simple file (usually called sitemap.xml) that contains all important URLs on your website. It helps search engines that entire scope understand your website and discover new or updated pages more quickly In the next section, we'll look at whether any website requires such a site  or when its use is particularly useful.

Sitemap example for an e-commerce website with category, product, blog and about pages.

When is an XML sitemap necessary?

You might be wondering: Does every website really need a sitemap? — In theory no, but it is almost always practical recommendable. First of all: It never hurts to have a correct XML sitemap (as long as you don't make nonsense with it). Google won't punish you without a sitemap, but you could miss out on opportunities

An XML sitemap is particularly useful in the following cases:

  • Very large websites: If you have hundreds or even thousands of pages (e.g. large online stores, news sites, forums), the sitemap helps the crawler immensely. The bigger and more complex the page, the sooner you need a sitemap. Google can also crawl large pages, but a sitemap directly delivers a List of all pages, which makes crawling more efficient.
  • New websites: A new website often still has a few incoming links from outside. Search engines could only discover them slowly. With a submitted sitemap, you are actively telling Google: “Here are my pages, please have a look!” This can speed up indexing. This can give a noticeable boost, especially in the first few weeks of a website.
  • Websites with few internal links/orphaned pages: Do you have pages that are difficult or impossible to access via the navigation (Orphan Pages)? These can be easily overlooked. A sitemap ensures that even such isolated content is known. Of course, you should also improve your internal linking in the long term, but the sitemap will catch many gaps by then.
  • Pages with lots of pictures/videos or special formats: If your business relies heavily on image galleries, videos, or other media, sitemaps can also help. There is Sitemap extensions for images, videos and news to provide additional information (e.g. image subtitle, video duration, news release date). In this way, search engines not only find the page itself, but also understand embedded media better.
  • Frequently updated websites: Do you regularly publish new content or often change pages (e.g. news sites, blogs)? A sitemap with correct last-modified data can show Google which content is fresh. Although Google crawls a lot of its own accord, the sitemap can be used as signal serve to ensure that new URLs exist or that there are changes.

And when is a sitemap lesser urgent? With small websites With a manageable structure (e.g. the classic one-pager or a company website with 5-10 pages), Google can find everything even without a sitemap. If all pages are well linked together, the crawler will get through easily. But: Since modern CMS often generate a sitemap automatically, there is nothing wrong with having one even for a small page — safe is safe. The XML sitemap acts as an assurance that nothing is overlooked. Google itself says that sitemaps are particularly useful for large, frequently updated websites be recommended.

Our tip: invest a few minutes to set up a sitemap, no matter how small your site is. You don't lose anything — quite the opposite. And as pages grow or become more complex, you'll be happy that you've maintained the sitemap from the start.

Best practices for XML sitemaps

An XML sitemap is only useful if it cleanly created and maintained will. In the worst case scenario, a bad sitemap can send false signals or waste crawler resources. To make your sitemap work optimally, you should follow the following best practices:

  • Include indexable pages only: list only URLs that should be indexed by search engines! Sounds banal, but it is often done wrong. Avoid “Garbage URLs” such as pages with a noindex tag, 404 error pages, redirect URLs, or duplicate content variants. Having URLs like this in the sitemap is counterproductive—you're telling Google that they're important pages even though they're not. Better: Only genuine, 200-OK pages without a noindex and with canonical URL Belong on yourself. A clean XML sitemap contains only indexable pages.
  • Keep up to date: The sitemap should always reflect the current status of your website. When you publish new content, it must appear in the sitemap promptly. In the same way, deleted or moved pages must not remain in the sitemap for months. An outdated sitemap that lists pages that have long been removed will only confuse Google (and may lead to unnecessary crawls into the void). It is ideal, the sitemap automatically to be generated — e.g. via your CMS or a plugin — so that every change is taken into account immediately. Many content management systems such as WordPress, Joomla & Co. have corresponding functions or plugins that update the sitemap with every change
    Use this automation so you don't have to run after you by hand.
  • No duplicate URLs & no unnecessary parameters: Make sure that each URL is only unodc appears in the sitemap. If your site is accessible via several URL variants (with/without www., with different parameters, etc.), you should only use the canonical List variant. Unwanted parameters (session IDs, tracking parameters like? utm_source=... etc.) have no place in the sitemap. They bloat the file and cause Google to waste time with redundant URLs. Make sure that your sitemap generator filters out such parameters.
  • Structured layout for large websites: If your site has an extremely large number of URLs, it's a good idea to split up the sitemap. Google sets an upper limit of 50,000 URLs resp 50 MB file size (uncompressed) firm.
  • For very large projects, you can therefore create several sitemap files (e.g. divided by subject areas or page types) and share them in a higher-level Sitemap index file summarize. A sitemap index is like a sitemap of sitemaps — it lists the individual sitemap files. This makes administration clearer and avoids size limits. Attention: If you use a lot of sitemaps, remember to log every new file back into Google or add it to the index.

Reference the file in the default location and in robots.txt: You usually put the sitemap.xml in root directory from the website (e.g. https://www.deine-domain.de/sitemap.xml). This allows you to quickly find it at the main address. In addition, you should include the sitemap in your robots.txt Enter a file. example:

Makefile
CopyEdit
User agent: *

Disallow: /internal-directory/

Sitemap: https://www.deine-domain.de/sitemap.xml

  • The sitemap: in robots.txt helps all crawlers (not just Google) get directly to the sitemap. Since robots.txt is usually the first thing a crawler fetches, you make sure that the search engine knows where your table of contents is right from the start
  • Submission to Search Console (and other search engines): Even though Google can find your sitemap via robots.txt should, is it recommended that they also manually submit to Google Search Console. This gives you the submission in black and white and can later see in Search Console exactly how many URLs Google has indexed from your sitemap and whether there are any problems (more about this in the next section).
    The same applies to Bing: In Bing Webmaster Tools, you can also store your sitemap to actively point this out to Bing. Yahoo and DuckDuckGo mostly use Bing data, and Yandex and Baidu have their own webmaster tools — depending on whether these search engines are relevant to you, you can also enter your sitemap there
  • No superfluous information and gimmicks: As mentioned above, fields such as <changefreq>and <priority>in the sitemap are optional. Many webmasters meticulously enter values here in the hope that Google, for example, would give preference to pages with priority 1.0. In practice, you can usually save yourself the trouble — Google largely ignores priority information.
    Even the Change Frequency (“frequency of change”) is often misunderstood. If you stubbornly enter “daily” everywhere, even though the page doesn't change daily at all, Google quickly notices this and doesn't attach any importance to the field. Our advice: Leave these fields out unless you can fill them with real data in a meaningful way. One exception is the <lastmod>date: If you can maintain it reliably automatically (e.g. always the publication/update date for a blog), then it doesn't hurt to specify this — lastmod is particularly helpful for news pages. But avoid “manipulating” Lastmod (such as daily up-dating without actual changes) — you won't gain anything.
  • Hreflang in sitemaps only for professionals: <xhtml:link>In XML sitemaps, it is possible to use so-called tags for Hreflang to use (provide alternative language and country versions). This can be useful for international websites. But be careful: If Hreflang is already specified in the HTML code of the pages, there is a risk of conflicting information if the sitemap contains other information. Contradictions are the last thing we want — Google could then both Ignore information. If you want to use Hreflang in the sitemap, make sure that the data 100% match on-page hreflang tags. If in doubt, leave Hreflang out of the sitemap and manage it only on the site. Our recommendation (for most cases): Really in the sitemap only the URLs lists. All other detailed information (language versions, priorities, etc.) is optional and can also be omitted — this makes the file simpler and less prone to errors.

If you follow these best practices, you've already won a lot. Your sitemap becomes a reliable assistant for crawling, rather than a potential source of error. In the next step, we'll figure out how to correctly submit the sitemap to Google and what you need to consider when indexing it.

Indexing & Google Search Console

The best sitemap is of little use if search engines are unaware of its existence. Therefore, the process of Submitting and monitoring in appropriate tools is important. Here is a short guide on how to proceed and what you should pay attention to:

1. Submit to Google

As soon as your XML sitemap is ready and available online, you should use it in the Google Search Console (GSC) sign in. Sign in to the GSC, select your property (website) and navigate in the menu to “Sitemaps”. There you can find the URL of your sitemap (e.g. https://www.deine-domain.de/sitemap.xml) enter and send. The Search Console will attempt to retrieve the file and give you immediate feedback as to whether the retrieval thriving was. Ideally, the status “Successful” appears with an indication of the number of URLs found. If there are problems (e.g. the file not found, the format is incorrect, etc.), the GSC reports an error, which you should then fix.

2. Status check

After you submit the sitemap, you can see some useful information in the Sitemaps report in Search Console. Among other things, there are:

  • URLs detected: How many pages did Google find in the sitemap file (e.g. “URLs detected: 120”).
  • Indexed URLs: How many of the submitted URLs are already in the index (e.g. “Indexed: 110"). This number is particularly interesting — it shows whether Google has actually included the majority of your pages.
  • Errors/Warnings: If certain URLs couldn't be indexed or the sitemap has problems, you'll see warnings here. Common examples: “URL blocked by robots.txt”, “Alternative site with canonical tag” (i.e. Google has identified the page as a duplicate and is indexing the main version) or “Not found (404) pages in sitemap”.

This information helps you immensely. For example, you can quickly see if all important pages indexed were or whether some may not be in the index even though they are in the sitemap. But remember: Even if you submit a URL via a sitemap, there is no guaranteethat Google is also crawling or indexing them. The sitemap is an invitation, not a command. For example, it may happen that the search engine shows “20 URLs discovered, 18 indexed” — Google has not (yet) included the missing 2 pages, possibly due to quality problems or because they are very similar to other pages. That is normal. The only important thing is that no important Page remains permanently unindexed.

Tip: If the GSC consistently shows significantly fewer indexed URLs than submitted URLs, you should check why (content thin? Duplicate content? Crawling issues?). The sitemap has done its job, Google knows about the page — but perhaps Google does not (yet) consider it worthy of indexing. This means researching the causes.

3. Monitoring and troubleshooting

Check the Search Console regularly to keep an eye on the health of your sitemap and indexing. The GSC also reports, for example, if a submitted sitemap is no longer available later or if new errors occur. A common mistake Is, for example, that you forget to update the sitemap after a domain change or relaunch, and then Google crawls an old sitemap for a while that points to the old domain — this is easy to avoid if you always check the sitemap settings after making changes.

Ways to share the sitemap with Google, we've already mentioned: via robots.txt and the Search Console. You can also use the direct route: Once submitted, Google will regularly retrieve your sitemap by itself. If you're in a hurry (e.g. with an important update), you can go to the GSC on “Resubmit” click — this triggers another crawl of the sitemap. As a rule, however, this is not necessary; Google will drop by from time to time anyway.

4. Submitting to other search engines

In addition to Google — depending on your target group — you shouldn't forget Bing. In the Bing Webmaster Tools There is also a sitemaps area that works in a similar way. Bing's market share is significantly smaller (often only ~2—3% in the B2C sector, possibly slightly higher at ~20-30% in B2B), but when a sitemap already exists, it hardly takes time to submit it there as well. Bing also has an interesting new approach called IndexNow, which is discussed briefly in the next section (FAQ). Yandex and Baidu also offer options, but these are only relevant if you are specifically represented in Russia and China.

In summary: Google Search Console is your most important tool for indexation to monitor your sitemap URLs. It shows you whether your sitemap was successfully processed and what the indexing status of your pages is. Remember that although a sitemap improves discovery, no guarantee of indexing offers — Google still decides for itself what goes into the index. Nevertheless, the sitemap is an indispensable means of keeping Google up to date and quickly identifying potential problems.

Technical requirements and file size

When creating an XML sitemap, you need to follow a few technical rules so that search engines easily accept the file. Here are the most important requirements and limits according to the official Sitemaps log and Google's guidelines:

  • Maximum URLs per file: A single sitemap file can have up to 50,000 URLs included. That sounds like a lot — and that's easily enough for most websites. If you still have more pages (large e-commerce sites, news archives, etc.), distribute the URLs to several sitemaps and use a sitemap index (see next point).
  • Maximum file size: The sitemap file may be uncompressed at most 50MB Be big. The same applies here: larger than 50 MB = divide. By the way, Google and Bing allow you to submit compressed sitemaps (gzip). So if you upload your sitemap.xml packaged as sitemap.xml.gz, the packed The file may be smaller, but the size is decisive upon unpacking (it must be less than 50 MB). Previously, this limit was 10 MB, but was increased to 50 MB to suit modern websites. In practice, 50 MB ~ almost exactly 50,000 average-sized URLs — so you usually reach the URL limit first anyway.
  • Format and coding: The file must be in XML format exist, be UTF-8 encoded and comply with the official scheme. This means, among other things, that all < and & etc. in URLs must be correctly validated/escaped. A typical structure has already been outlined above. If you create a sitemap manually or use an unknown tool, it's best to validate the file with a sitemap validator or directly in the Search Console (if it's incorrect, GSC will report this). Syntax errors can prevent Google from reading the entire file — you should definitely avoid that.
  • Valid tags: <url>The following XML tags are mainly allowed in a URL listing: <urlset>as a container, including several entries. Within <url><loc>(the URL itself) is mandatory. You can optionally add <lastmod>(last change date), <changefreq>(change frequency) and <priority>(priority). There are extensions for special sitemaps, e.g. <image:image>for images or <video:video>for videos. Important: Follow the specification carefully — a common mistake is, for example, to enter date values in the wrong format (must be ISO-8601, e.g. 2025-03-14) or to include your own, prohibited fields. That leads to mistakes.
  • No mixed domains: A sitemap file can only ever contain URLs a domain or subdomain included. So in a sitemap, you can't add entries from various Mix websites. For example, if you had separate subdomains for desktop and mobile, you'd need two sitemaps (or an index that lists them both). However, this is usually not an issue, as most websites simply use a domain. The same applies to protocols: Only list https://URLs if your site is running over HTTPS. HTTP versions of the same pages do not belong (they should point to HTTPS via a redirect anyway).
  • Use the sitemap index: If — for whatever reason — you have multiple sitemap files (whether because of size or for thematic separation), you should use a Index sitemap create. <lastmod>This has the format <sitemapindex>and lists the individual sitemap files with <loc>and optionally. In turn, a maximum of 50,000 sitemaps may be listed in an index. You can also nest: an index can reference other indexes, but you rarely need that. In practice, a single index with all sitemaps is usually sufficient. Rich best only The index in the Search Console — Google automatically extracts all linked sitemaps from this. So you don't have to manage tens of individual files separately.
  • Example structure for large pages: Let's say you run a shop with 200,000 products. For example, you could have four sitemaps (sitemap-prod-A.xml,..., sitemap-prod-D.xml), each with ~50k URLs. In addition, maybe a sitemap-blog.xml for blog articles, a sitemap-kategorien.xml, etc. You can list all these individual sitemaps in a sitemap-index.xml. In robots.txt and GSC, you would then only announce sitemap-index.xml. It keeps everything together nicely.
  • Test and monitor: Last but not least — test your sitemap before you “sharpen” it. As a test, load them in a browser or with a tool like Screaming Frog (which also has a sitemap check function) and see if there are all URLs that should be in and none that don't belong in. After you submit, watch Search Console. If Google reports parsing errors, correct them immediately. Fortunately, search engines are quite tolerant of minor problems when it comes to sitemaps — a certain level of fault tolerance is built in
    . But of course, an error-free file is ideal.

In short: Stick to the standards (max. 50k URLs/50MB, correct XML format) and structure the sitemap sensibly for your website size. Then nothing stands in the way of successful processing. Search engines will thank you by being able to crawl your content more efficiently.

Practical examples and common mistakes

There are also pitfalls with XML sitemaps. Although many websites have a sitemap, they don't make full use of its potential — or even cause problems because the sitemap is incorrectly maintained. Let's take a look at some common mistakes About sitemaps and how you can avoid them:

1: “Broken” URLs in the sitemap

One of the classics is that the sitemap contains links that do not (anymore) work at all or should not be indexed. For example, pages that have a 404 error Throw, pages that redirect via 301, or those with noindex. Having such URLs in the sitemap is counterproductive. Solution: Make sure that your sitemap is cleaned regularly. After a relaunch or major updates, the sitemap should be kept up to date so that, for example, old, deleted URLs are removed. Monitor messages in Search Console—it shows you when URLs appear in the sitemap that are causing an error. Tip: Maintain clean forwarding management and fix 404 errors promptly. If 404 pages end up in your sitemap, either correct the sitemap or — better — fix the broken links. (Find out how to avoid and fix 404 errors in general in our 404 errors and SEO guide.)

2: Unimportant or unwanted pages are included

Sometimes plugins or tools generate sitemaps that also include archive pages, pagination pages (page/2/, etc.) or filter parameters. These are usually unimportant or even disruptive for indexing. A good sitemap should only high-quality, relevant pages included. Solution: If necessary, adjust the configuration of your sitemap generator. Many SEO plugins allow you to exclude certain post types or parameters. Search the sitemap manually or with an SEO tool and filter for example for “? “, “=” or obvious patterns that don't belong (session IDs, etc. as mentioned above). Anything that is not intended for the index should remain outside.

3: Sitemap was never submitted or cannot be found

Some website operators rely on Google to find the sitemap by itself. In many cases, however, this does not happen, especially if the sitemap: notice is missing in the robots.txt. Or the sitemap is in an unusual location or is called something else (e.g. /sitemap_index.xml — which is common — or worse: /abc/def/sitemap.xml). If Google doesn't know the file, the best sitemap won't do anything. Solution: Always reference in robots.txt and enter it in the Google Search Console. This gives you double security. In the GSC sitemaps report, you can also see whether Google was able to retrieve the file — this also tells you whether the path is correct.

4: Static sitemap that doesn't keep pace with the website

This point applies in particular to larger, frequently updated websites. Example: Developers export and upload a sitemap once, but don't update it regularly. As a result, new pages only end up in the sitemap much later or not at all and old ones remain there forever. This is particularly annoying if, for example, you have an online shop with a constantly changing range of products — important new products may then remain undetected for longer. Solution: Automate the process. Use the CMS function or a cron job that, for example, regenerates the sitemap every day. There are also tools (such as Screaming Frog) with which you can crawl and build a sitemap, but this is intended more as a one-time solution or backup, as it is error-prone and time-consuming. Ideal is a dynamic sitemap directly from your system.

5: Conflicting hreflang information

This is a bit more specific: If you use Hreflang in the sitemap (see Best Practices), there is a risk that the information does not match the information in the HTML. We've already touched on this — two different hreflang sets are confusing Google

Solution: Decide which path to take. Either you maintain Hreflang consistently in the source code of every page (then omit it from the sitemap), or you do it in the sitemap and Disable on-page hreflang tags. This is how you avoid conflicting signals.

6: Misuse of Priority/Changefreq

Many believe they could influence Google with certain settings in the sitemap — for example, give all pages priority 1.0 in the hope of a better ranking, or set changefreq=daily so that Google crawls daily. In fact, this information is hardly taken into account. Google evaluates the importance of a page primarily based on content and incoming links, not on the basis of a number entry in the sitemap. An artificially inflated priority can even look suspicious. Solution: Only use priority and changefreq if you have a real reason. For example, you could give very important pages a slightly higher priority than less important pages — within your site This can help to signal what is more important. But don't expect miracles. And only enter realistic Changefreq values (or omit the field). In general, the following applies: Content and internal linking are more important than any sitemap setting.

7: Forgetting to remove old sitemaps

Sometimes the file names or paths of sitemaps change (e.g. due to plugin changes or website relaunches). A common mistake is that the old sitemap file stays online and may still be in the robots.txt while the new one is already being used. This can result in Google seeing two different sitemaps with potentially contradictory content. Solution: Delete or prevent access to outdated sitemap files. Update robots.txt to the latest version. In Search Console, you can mark old sitemaps as “remove” so that they disappear from the report. Keep your setup slim: a current sitemap (or index) per property.

8: Not looking at Search Console

Finally: The biggest mistake is to check the matter off after you submit it. Many problems would be noticed if you occasionally looked at the Google Search Console or Bing Webmaster Tools. There, Google will inform you about errors in the sitemap, the indexing rate, any discrepancies, etc. If you ignore this, you are wasting optimisation potential. Solution: Make it a routine to look at the GSC (indexing & sitemaps section) at least once a month. Especially after major changes or when new content has been added, a quick check is worthwhile: Are the new URLs in the sitemap? Have they been indexed yet? Are there any errors in error messages? — This allows you to react quickly instead of realising that something went wrong months later.

As you can see, many of these mistakes can be avoided with common sense and some regular care. One XML sitemap is not a one-time “set and forget” thing, but should be treated like a living document that evolves with your website. If you follow these best practices and keep an eye on the typical sources of error, your sitemap becomes a valuable asset for your SEO — and no reason for headaches.

FAQ — Common questions about XML sitemaps

Finally, we'll answer a few frequently asked questions about XML sitemaps:

What is an XML sitemap?

An XML sitemap is a structured page overview in XML format that helps search engines efficiently crawl the page structure of a website. It lists all relevant individual documents, such as pages, images or videos, together with metadata (e.g. date of change or priority). The file follows the XML Version 1.0 Encoding UTF-8 specification and can contain various sitemap formats, such as for news, images or a video sitemap. A well-maintained sitemap XML improves the presentation in search results and significantly supports search engine optimisation.

Why is a sitemap important for SEO?

Creating a sitemap ensures that all relevant pages — including deeply linked or new content — are quickly found by search engines. This makes indexing easier and improves visibility in SERPs. This is a decisive advantage, especially for complex websites or large sitemaps, for example with lots of rich media content. A clear page structure in the form of a main sitemap or sitemap index file helps to give crawlers a kind of map of your website — which also benefits website visitors in the long term.

Does every website need an XML sitemap?

Not necessarily — but especially for large or dynamic websites, with frequent updates or complex page structures, a sitemap XML is extremely helpful. It ensures that all content can be crawled and indexed correctly — especially if internal links are not optimally set. A sitemap is therefore not a must, but offers clear advantages for search engine optimisation and technical analysis.

How can I create an XML sitemap?

That depends on which system you're using:

  • Content Management Systems (CMS): Many CMS offer a sitemap function out of the box or there are plugins for it. For example, WordPress from version 5.5 automatically creates a sitemap (/wp-sitemap.xml). Popular SEO plugins such as Yoast SEO or RankMath also create sitemaps and offer configuration options (which content types should be included, etc.). There are similar solutions for Shopify, Joomla, Drupal, etc. So first, check whether your CMS already provides a sitemap or a plugin is available — this is usually the case.
  • Manual creation: If you have a static website or a self-build system without a sitemap function, you can use tools such as Screaming Frog or XML-sitemaps.com use. Screaming Frog can crawl your website and export a sitemap.xml from it. Online generators such as xml-sitemaps.com (often free for smaller sites) also create a sitemap file when you enter the URL. You would then have to upload them to your server.
  • My script: Developers can also write their own script, which, for example, reads all URLs from the database and formats them as XML. This is particularly worthwhile for very large pages or special cases. Many shop systems have something like this running in the background.
  • Important: No matter how you create them — validate the result. Check that all important URLs are included and that the XML is correct. A quick check in the browser or with a validator tool is usually sufficient.

In most cases, the automatic solution via CMS/plugin Preferably because it keeps the sit map up to date. With WordPress, for example, all you need to do is activate the corresponding feature and the plugin will do the rest.

How do I submit my XML sitemap to Google?

You can create your sitemap and then submit it directly to Google Search Console under “Sitemaps.” Make sure that it complies with the XML version 1.0 encoding UTF-8 standard. Large websites should use a main sitemap or a sitemap index file to logically group multiple individual documents. Important: Make sure that the sitemap is displayed correctly and that all URLs are accessible — this makes analysis easier and improves crawling behaviour.

Does a sitemap guarantee that all my pages are indexed?

No An XML sitemap only guarantees that Google learns about the URLs not that it indexes them. Google decides whether a page is indexed based on many factors (content quality, duplication, technical accessibility, user signals, etc.). The sitemap helps with discovery from pages, but it is not an “indexing ticket.” Google itself points out that a sitemap is no guarantee of indexing. It may happen that some pages are reported as “Discovered — currently not indexed” in the Search Console even though they are in the sitemap. Reasons can include thin content, crawling budget, or other SEO problems. Here, you then have to work on the content or website quality.

Note: You can't force yourself to index and rank, certainly not just through a sitemap. The sitemap is a helpful tool, but it's not a miracle cure. It ensures that Google is nothing oversees, but then your side must be convincing on its own. After all: When pages not Being indexed, you can become aware of it more quickly and take action thanks to the GSC and sitemap.

Does an XML sitemap affect my ranking on Google?

Not directly. An XML sitemap is not a ranking factor in the strict sense of the word. The presence of a sitemap does not automatically increase your site's ranking. But what can happen indirectly: Through faster and more complete indexing can Your pages appear earlier in search results, which is of course positive. But Google doesn't rank pages without a sitemap worse as long as it finds them elsewhere. Gary Illyes from Google was once asked whether not having a sitemap was a ranking disadvantage — he said no. It is more important that your website is technically clean and has strong content. Conclusion: A sitemap helps with crawling/indexing, but content, backlinks, user experience, etc. count much more for ranking. So you should have both: good content and a sitemap for optimal conditions.

What do I do if my website has more than 50,000 URLs?

As described above, in this case you must use multiple sitemap files work. Break down your URLs — e.g. by categories, annual archives, product groups, or simply in blocks of 50k. Then create a Sitemap index file, which lists all individual sitemaps. You submit this index file to Google (instead of each individual sitemap). Google will retrieve all sitemaps referenced in it one by one. Make sure that each individual file in turn complies with the 50K/50MB limits. In theory, you could manage up to 50,000 sitemaps with 50,000 URLs each (i.e. 2.5 billion URLs) — in practice, we barely get anywhere. If your website thus is big, you probably have your own indexing management team. For most, it's enough to manage 2-10 sitemaps. For example, news sites often create a new sitemap (for new articles) every day and keep older articles in separate sitemaps per month/year. Important: Don't forget to add new sitemaps to the index as well if you change the scheme. And keep the robots.txt up to date if there are sitemap paths entered there.

Can I also use multiple sitemaps?

Yes! Especially with large pages, it makes sense to divide content into several sitemap formats — for example by language, content type or topicality. You can then summarise them centrally using a sitemap index file. This gives you control over the structure of your websites and gives search engines a clear map. This strategy is also recommended for various rich media content — e.g. via a video sitemap.

What is IndexNow and do I need it?

IndexNow is a relatively new approach (initiated by Microsoft/Bing) that allows search engines upfront to provide information about new or changed content. Basically, you're actively pinging the search engine instead of waiting for it to fetch your sitemap. Bing and Yandex already support IndexNow; Google is still experimenting with it (as of 2024), but has not (yet) officially integrated it. For you, this means that if Bing is important to you, IndexNow can be useful. You can set up a special API URL or use tools/plugins that support IndexNow to immediately inform Bing as soon as you change a page. Does IndexNow replace the sitemap? Not really. It is more of an addition to swifter Informing. You should still maintain your XML sitemap because it serves as a complete reference to all URLs — IndexNow only ever reports changes. Kevin Indig once summed it up this way: “If you want things to be really fast: Forget the sitemap, use IndexNow and ping Bing directly.”

For Google, however, you stick with the classic sitemap, as IndexNow currently plays no role there. Conclusion: Nice-to-have for Bing/Yandex, but not a must — the XML sitemap remains the foundation.

conclusion

XML sitemaps are a important SEO tool to make it easier for search engines to navigate your site. They can noticeably improve indexing, especially for large or complexly structured websites. They may not be necessary for smaller projects, but they never hurt — and they grow with your demands. It is important that you use the sitemap Create and maintain correctly: List only relevant, indexable pages, keep entries up to date and regularly check the Search Console for errors. A good sitemap is always Up to date and free from technical errors.

Remember: The sitemap isn't a guarantee of top rankings, but it does ensure that your content gets a fair chance of even getting into the index. She is, so to speak, yours SEO-Background noise — invisible to users, but essential for crawlers. If you follow the best practices outlined in this guide, you're on the right track: All important pages are reliably indexed by Google without you constantly having to manually help. This allows you to focus on the essentials — great content and a strong website structure — while the sitemap quietly and diligently does its job.

Finally, a tip: Don't get bogged down in perfectionism. It is almost impossible to always display 100% perfect of a huge website in the sitemap.

And you don't even have to. Google forgives minor discrepancies. What is more important is that the big lines Voices: all important pages in, gross nonsense outside. With this in mind: Make your website sitemap-fit and offer Google & Co. an optimal overview!

Do you need assistance? If this all seems too technical for you or you want to make sure that your website is SEO-optimised all around, we can help from panpan.digital I'm happy to continue with you. With our professional SEO Consulting We help you optimise your sitemap and the entire website for search engines. And as part of our digital strategy consulting Let's develop a holistic online strategy together so that your company can get off to a successful start in the digital space. Just talk to us — we'll take your SEO to the next level!

Have you ever wondered how you can help search engines like Google find all the important pages on your website? Get right here XML sitemaps into the game. An XML sitemap is basically nothing more than a table of contents for your website — but for search engines instead of for people. In the past, websites often had HTML sitemaps that visitors could click through, but these were hardly exciting for human users. The XML sitemap, on the other hand, is a machine-readable format that lists all relevant URLs on your website.

Why is that important? Imagine that Google is visiting your website. Of course, Google can find many pages via internal links. But no website is perfectly linked, and especially with large or new websites, there is a risk that some pages will remain “under the radar.” According to Google, XML sitemaps are the second most important Source to discover new content.

In other words, a good sitemap ensures that Google & Co. does not overlook any important content. It provides search engines with valuable additional information — for example, which pages are particularly important to you and when they were last updated. In short: XML sitemaps speed up and improve the indexing of your content in search engines.

In this guide, you will find out in a practical and understandable way what an XML sitemap is, and when you do it Really needhow to create and optimize them, and what typical mistakes you should avoid. Whether you're an SEO professional, marketing manager, or interested newbie — here you'll find helpful tips from basics to advanced best practices. So let's get started and make your website sitemap-fit 😉.

What is an XML sitemap?

An XML sitemap is a special file (in XML format) that lists all important pages (URLs) on your website so that search engines a complete overview receive about your content. Think of them as a digital table of contents or a phone book for your website. This file contains the URLs of your pages and, optionally, additional information about each URL, such as when the page was last changed or whether there are alternative language versions. However, these extras are optionally. Google itself explains:

“A sitemap tells Google which pages and files on your website you think are particularly important, and it also provides valuable additional information, such as when the page was last updated or about alternative language versions of the page. “

Important: One XML sitemap is primarily intended for search engines, not for visitors to your site. In contrast to the HTML sitemap, which a person could read in the browser, the XML sitemap is structured in such a way that crawlers (such as Googlebot) can process it easily and quickly. While both types of sitemaps share the same goal — to help search engines discover content — they do so in different ways. The HTML version can also be used by users for navigation, while the XML version is a Feed for search engines is.

<urlset>A typical XML sitemap file starts with a declaration of the XML version and an element with the appropriate namespace. For each URL, there is a <url>block that contains at least the address, for example:

<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
   <url>
      <loc>https://www.your-domain.com/example-page/</loc>
      <lastmod>2025-03-01</lastmod>
      <changefreq>weekly</changefreq>
      <priority>0.8</priority>
   </url>
</urlset> 

The tags<lastmod>, <changefreq>and <priority>are optional. You can specify when the page was last modified, approximately how often it changes, and what priority it has relative to your other pages. But watch out: search engines Most often ignore this information to a large extent

Google, for example, prefers to rely on its own findings and treat the priority and changefreq fields as clues rather than commands. So you don't have to worry about entering perfect values everywhere — a valid URL list is all you need.

In summary: The XML sitemap is a simple file (usually called sitemap.xml) that contains all important URLs on your website. It helps search engines that entire scope understand your website and discover new or updated pages more quickly In the next section, we'll look at whether any website requires such a site  or when its use is particularly useful.

Sitemap example for an e-commerce website with category, product, blog and about pages.

When is an XML sitemap necessary?

You might be wondering: Does every website really need a sitemap? — In theory no, but it is almost always practical recommendable. First of all: It never hurts to have a correct XML sitemap (as long as you don't make nonsense with it). Google won't punish you without a sitemap, but you could miss out on opportunities

An XML sitemap is particularly useful in the following cases:

  • Very large websites: If you have hundreds or even thousands of pages (e.g. large online stores, news sites, forums), the sitemap helps the crawler immensely. The bigger and more complex the page, the sooner you need a sitemap. Google can also crawl large pages, but a sitemap directly delivers a List of all pages, which makes crawling more efficient.
  • New websites: A new website often still has a few incoming links from outside. Search engines could only discover them slowly. With a submitted sitemap, you are actively telling Google: “Here are my pages, please have a look!” This can speed up indexing. This can give a noticeable boost, especially in the first few weeks of a website.
  • Websites with few internal links/orphaned pages: Do you have pages that are difficult or impossible to access via the navigation (Orphan Pages)? These can be easily overlooked. A sitemap ensures that even such isolated content is known. Of course, you should also improve your internal linking in the long term, but the sitemap will catch many gaps by then.
  • Pages with lots of pictures/videos or special formats: If your business relies heavily on image galleries, videos, or other media, sitemaps can also help. There is Sitemap extensions for images, videos and news to provide additional information (e.g. image subtitle, video duration, news release date). In this way, search engines not only find the page itself, but also understand embedded media better.
  • Frequently updated websites: Do you regularly publish new content or often change pages (e.g. news sites, blogs)? A sitemap with correct last-modified data can show Google which content is fresh. Although Google crawls a lot of its own accord, the sitemap can be used as signal serve to ensure that new URLs exist or that there are changes.

And when is a sitemap lesser urgent? With small websites With a manageable structure (e.g. the classic one-pager or a company website with 5-10 pages), Google can find everything even without a sitemap. If all pages are well linked together, the crawler will get through easily. But: Since modern CMS often generate a sitemap automatically, there is nothing wrong with having one even for a small page — safe is safe. The XML sitemap acts as an assurance that nothing is overlooked. Google itself says that sitemaps are particularly useful for large, frequently updated websites be recommended.

Our tip: invest a few minutes to set up a sitemap, no matter how small your site is. You don't lose anything — quite the opposite. And as pages grow or become more complex, you'll be happy that you've maintained the sitemap from the start.

Best practices for XML sitemaps

An XML sitemap is only useful if it cleanly created and maintained will. In the worst case scenario, a bad sitemap can send false signals or waste crawler resources. To make your sitemap work optimally, you should follow the following best practices:

  • Include indexable pages only: list only URLs that should be indexed by search engines! Sounds banal, but it is often done wrong. Avoid “Garbage URLs” such as pages with a noindex tag, 404 error pages, redirect URLs, or duplicate content variants. Having URLs like this in the sitemap is counterproductive—you're telling Google that they're important pages even though they're not. Better: Only genuine, 200-OK pages without a noindex and with canonical URL Belong on yourself. A clean XML sitemap contains only indexable pages.
  • Keep up to date: The sitemap should always reflect the current status of your website. When you publish new content, it must appear in the sitemap promptly. In the same way, deleted or moved pages must not remain in the sitemap for months. An outdated sitemap that lists pages that have long been removed will only confuse Google (and may lead to unnecessary crawls into the void). It is ideal, the sitemap automatically to be generated — e.g. via your CMS or a plugin — so that every change is taken into account immediately. Many content management systems such as WordPress, Joomla & Co. have corresponding functions or plugins that update the sitemap with every change
    Use this automation so you don't have to run after you by hand.
  • No duplicate URLs & no unnecessary parameters: Make sure that each URL is only unodc appears in the sitemap. If your site is accessible via several URL variants (with/without www., with different parameters, etc.), you should only use the canonical List variant. Unwanted parameters (session IDs, tracking parameters like? utm_source=... etc.) have no place in the sitemap. They bloat the file and cause Google to waste time with redundant URLs. Make sure that your sitemap generator filters out such parameters.
  • Structured layout for large websites: If your site has an extremely large number of URLs, it's a good idea to split up the sitemap. Google sets an upper limit of 50,000 URLs resp 50 MB file size (uncompressed) firm.
  • For very large projects, you can therefore create several sitemap files (e.g. divided by subject areas or page types) and share them in a higher-level Sitemap index file summarize. A sitemap index is like a sitemap of sitemaps — it lists the individual sitemap files. This makes administration clearer and avoids size limits. Attention: If you use a lot of sitemaps, remember to log every new file back into Google or add it to the index.

Reference the file in the default location and in robots.txt: You usually put the sitemap.xml in root directory from the website (e.g. https://www.deine-domain.de/sitemap.xml). This allows you to quickly find it at the main address. In addition, you should include the sitemap in your robots.txt Enter a file. example:

Makefile
CopyEdit
User agent: *

Disallow: /internal-directory/

Sitemap: https://www.deine-domain.de/sitemap.xml

  • The sitemap: in robots.txt helps all crawlers (not just Google) get directly to the sitemap. Since robots.txt is usually the first thing a crawler fetches, you make sure that the search engine knows where your table of contents is right from the start
  • Submission to Search Console (and other search engines): Even though Google can find your sitemap via robots.txt should, is it recommended that they also manually submit to Google Search Console. This gives you the submission in black and white and can later see in Search Console exactly how many URLs Google has indexed from your sitemap and whether there are any problems (more about this in the next section).
    The same applies to Bing: In Bing Webmaster Tools, you can also store your sitemap to actively point this out to Bing. Yahoo and DuckDuckGo mostly use Bing data, and Yandex and Baidu have their own webmaster tools — depending on whether these search engines are relevant to you, you can also enter your sitemap there
  • No superfluous information and gimmicks: As mentioned above, fields such as <changefreq>and <priority>in the sitemap are optional. Many webmasters meticulously enter values here in the hope that Google, for example, would give preference to pages with priority 1.0. In practice, you can usually save yourself the trouble — Google largely ignores priority information.
    Even the Change Frequency (“frequency of change”) is often misunderstood. If you stubbornly enter “daily” everywhere, even though the page doesn't change daily at all, Google quickly notices this and doesn't attach any importance to the field. Our advice: Leave these fields out unless you can fill them with real data in a meaningful way. One exception is the <lastmod>date: If you can maintain it reliably automatically (e.g. always the publication/update date for a blog), then it doesn't hurt to specify this — lastmod is particularly helpful for news pages. But avoid “manipulating” Lastmod (such as daily up-dating without actual changes) — you won't gain anything.
  • Hreflang in sitemaps only for professionals: <xhtml:link>In XML sitemaps, it is possible to use so-called tags for Hreflang to use (provide alternative language and country versions). This can be useful for international websites. But be careful: If Hreflang is already specified in the HTML code of the pages, there is a risk of conflicting information if the sitemap contains other information. Contradictions are the last thing we want — Google could then both Ignore information. If you want to use Hreflang in the sitemap, make sure that the data 100% match on-page hreflang tags. If in doubt, leave Hreflang out of the sitemap and manage it only on the site. Our recommendation (for most cases): Really in the sitemap only the URLs lists. All other detailed information (language versions, priorities, etc.) is optional and can also be omitted — this makes the file simpler and less prone to errors.

If you follow these best practices, you've already won a lot. Your sitemap becomes a reliable assistant for crawling, rather than a potential source of error. In the next step, we'll figure out how to correctly submit the sitemap to Google and what you need to consider when indexing it.

Indexing & Google Search Console

The best sitemap is of little use if search engines are unaware of its existence. Therefore, the process of Submitting and monitoring in appropriate tools is important. Here is a short guide on how to proceed and what you should pay attention to:

1. Submit to Google

As soon as your XML sitemap is ready and available online, you should use it in the Google Search Console (GSC) sign in. Sign in to the GSC, select your property (website) and navigate in the menu to “Sitemaps”. There you can find the URL of your sitemap (e.g. https://www.deine-domain.de/sitemap.xml) enter and send. The Search Console will attempt to retrieve the file and give you immediate feedback as to whether the retrieval thriving was. Ideally, the status “Successful” appears with an indication of the number of URLs found. If there are problems (e.g. the file not found, the format is incorrect, etc.), the GSC reports an error, which you should then fix.

2. Status check

After you submit the sitemap, you can see some useful information in the Sitemaps report in Search Console. Among other things, there are:

  • URLs detected: How many pages did Google find in the sitemap file (e.g. “URLs detected: 120”).
  • Indexed URLs: How many of the submitted URLs are already in the index (e.g. “Indexed: 110"). This number is particularly interesting — it shows whether Google has actually included the majority of your pages.
  • Errors/Warnings: If certain URLs couldn't be indexed or the sitemap has problems, you'll see warnings here. Common examples: “URL blocked by robots.txt”, “Alternative site with canonical tag” (i.e. Google has identified the page as a duplicate and is indexing the main version) or “Not found (404) pages in sitemap”.

This information helps you immensely. For example, you can quickly see if all important pages indexed were or whether some may not be in the index even though they are in the sitemap. But remember: Even if you submit a URL via a sitemap, there is no guaranteethat Google is also crawling or indexing them. The sitemap is an invitation, not a command. For example, it may happen that the search engine shows “20 URLs discovered, 18 indexed” — Google has not (yet) included the missing 2 pages, possibly due to quality problems or because they are very similar to other pages. That is normal. The only important thing is that no important Page remains permanently unindexed.

Tip: If the GSC consistently shows significantly fewer indexed URLs than submitted URLs, you should check why (content thin? Duplicate content? Crawling issues?). The sitemap has done its job, Google knows about the page — but perhaps Google does not (yet) consider it worthy of indexing. This means researching the causes.

3. Monitoring and troubleshooting

Check the Search Console regularly to keep an eye on the health of your sitemap and indexing. The GSC also reports, for example, if a submitted sitemap is no longer available later or if new errors occur. A common mistake Is, for example, that you forget to update the sitemap after a domain change or relaunch, and then Google crawls an old sitemap for a while that points to the old domain — this is easy to avoid if you always check the sitemap settings after making changes.

Ways to share the sitemap with Google, we've already mentioned: via robots.txt and the Search Console. You can also use the direct route: Once submitted, Google will regularly retrieve your sitemap by itself. If you're in a hurry (e.g. with an important update), you can go to the GSC on “Resubmit” click — this triggers another crawl of the sitemap. As a rule, however, this is not necessary; Google will drop by from time to time anyway.

4. Submitting to other search engines

In addition to Google — depending on your target group — you shouldn't forget Bing. In the Bing Webmaster Tools There is also a sitemaps area that works in a similar way. Bing's market share is significantly smaller (often only ~2—3% in the B2C sector, possibly slightly higher at ~20-30% in B2B), but when a sitemap already exists, it hardly takes time to submit it there as well. Bing also has an interesting new approach called IndexNow, which is discussed briefly in the next section (FAQ). Yandex and Baidu also offer options, but these are only relevant if you are specifically represented in Russia and China.

In summary: Google Search Console is your most important tool for indexation to monitor your sitemap URLs. It shows you whether your sitemap was successfully processed and what the indexing status of your pages is. Remember that although a sitemap improves discovery, no guarantee of indexing offers — Google still decides for itself what goes into the index. Nevertheless, the sitemap is an indispensable means of keeping Google up to date and quickly identifying potential problems.

Technical requirements and file size

When creating an XML sitemap, you need to follow a few technical rules so that search engines easily accept the file. Here are the most important requirements and limits according to the official Sitemaps log and Google's guidelines:

  • Maximum URLs per file: A single sitemap file can have up to 50,000 URLs included. That sounds like a lot — and that's easily enough for most websites. If you still have more pages (large e-commerce sites, news archives, etc.), distribute the URLs to several sitemaps and use a sitemap index (see next point).
  • Maximum file size: The sitemap file may be uncompressed at most 50MB Be big. The same applies here: larger than 50 MB = divide. By the way, Google and Bing allow you to submit compressed sitemaps (gzip). So if you upload your sitemap.xml packaged as sitemap.xml.gz, the packed The file may be smaller, but the size is decisive upon unpacking (it must be less than 50 MB). Previously, this limit was 10 MB, but was increased to 50 MB to suit modern websites. In practice, 50 MB ~ almost exactly 50,000 average-sized URLs — so you usually reach the URL limit first anyway.
  • Format and coding: The file must be in XML format exist, be UTF-8 encoded and comply with the official scheme. This means, among other things, that all < and & etc. in URLs must be correctly validated/escaped. A typical structure has already been outlined above. If you create a sitemap manually or use an unknown tool, it's best to validate the file with a sitemap validator or directly in the Search Console (if it's incorrect, GSC will report this). Syntax errors can prevent Google from reading the entire file — you should definitely avoid that.
  • Valid tags: <url>The following XML tags are mainly allowed in a URL listing: <urlset>as a container, including several entries. Within <url><loc>(the URL itself) is mandatory. You can optionally add <lastmod>(last change date), <changefreq>(change frequency) and <priority>(priority). There are extensions for special sitemaps, e.g. <image:image>for images or <video:video>for videos. Important: Follow the specification carefully — a common mistake is, for example, to enter date values in the wrong format (must be ISO-8601, e.g. 2025-03-14) or to include your own, prohibited fields. That leads to mistakes.
  • No mixed domains: A sitemap file can only ever contain URLs a domain or subdomain included. So in a sitemap, you can't add entries from various Mix websites. For example, if you had separate subdomains for desktop and mobile, you'd need two sitemaps (or an index that lists them both). However, this is usually not an issue, as most websites simply use a domain. The same applies to protocols: Only list https://URLs if your site is running over HTTPS. HTTP versions of the same pages do not belong (they should point to HTTPS via a redirect anyway).
  • Use the sitemap index: If — for whatever reason — you have multiple sitemap files (whether because of size or for thematic separation), you should use a Index sitemap create. <lastmod>This has the format <sitemapindex>and lists the individual sitemap files with <loc>and optionally. In turn, a maximum of 50,000 sitemaps may be listed in an index. You can also nest: an index can reference other indexes, but you rarely need that. In practice, a single index with all sitemaps is usually sufficient. Rich best only The index in the Search Console — Google automatically extracts all linked sitemaps from this. So you don't have to manage tens of individual files separately.
  • Example structure for large pages: Let's say you run a shop with 200,000 products. For example, you could have four sitemaps (sitemap-prod-A.xml,..., sitemap-prod-D.xml), each with ~50k URLs. In addition, maybe a sitemap-blog.xml for blog articles, a sitemap-kategorien.xml, etc. You can list all these individual sitemaps in a sitemap-index.xml. In robots.txt and GSC, you would then only announce sitemap-index.xml. It keeps everything together nicely.
  • Test and monitor: Last but not least — test your sitemap before you “sharpen” it. As a test, load them in a browser or with a tool like Screaming Frog (which also has a sitemap check function) and see if there are all URLs that should be in and none that don't belong in. After you submit, watch Search Console. If Google reports parsing errors, correct them immediately. Fortunately, search engines are quite tolerant of minor problems when it comes to sitemaps — a certain level of fault tolerance is built in
    . But of course, an error-free file is ideal.

In short: Stick to the standards (max. 50k URLs/50MB, correct XML format) and structure the sitemap sensibly for your website size. Then nothing stands in the way of successful processing. Search engines will thank you by being able to crawl your content more efficiently.

Practical examples and common mistakes

There are also pitfalls with XML sitemaps. Although many websites have a sitemap, they don't make full use of its potential — or even cause problems because the sitemap is incorrectly maintained. Let's take a look at some common mistakes About sitemaps and how you can avoid them:

1: “Broken” URLs in the sitemap

One of the classics is that the sitemap contains links that do not (anymore) work at all or should not be indexed. For example, pages that have a 404 error Throw, pages that redirect via 301, or those with noindex. Having such URLs in the sitemap is counterproductive. Solution: Make sure that your sitemap is cleaned regularly. After a relaunch or major updates, the sitemap should be kept up to date so that, for example, old, deleted URLs are removed. Monitor messages in Search Console—it shows you when URLs appear in the sitemap that are causing an error. Tip: Maintain clean forwarding management and fix 404 errors promptly. If 404 pages end up in your sitemap, either correct the sitemap or — better — fix the broken links. (Find out how to avoid and fix 404 errors in general in our 404 errors and SEO guide.)

2: Unimportant or unwanted pages are included

Sometimes plugins or tools generate sitemaps that also include archive pages, pagination pages (page/2/, etc.) or filter parameters. These are usually unimportant or even disruptive for indexing. A good sitemap should only high-quality, relevant pages included. Solution: If necessary, adjust the configuration of your sitemap generator. Many SEO plugins allow you to exclude certain post types or parameters. Search the sitemap manually or with an SEO tool and filter for example for “? “, “=” or obvious patterns that don't belong (session IDs, etc. as mentioned above). Anything that is not intended for the index should remain outside.

3: Sitemap was never submitted or cannot be found

Some website operators rely on Google to find the sitemap by itself. In many cases, however, this does not happen, especially if the sitemap: notice is missing in the robots.txt. Or the sitemap is in an unusual location or is called something else (e.g. /sitemap_index.xml — which is common — or worse: /abc/def/sitemap.xml). If Google doesn't know the file, the best sitemap won't do anything. Solution: Always reference in robots.txt and enter it in the Google Search Console. This gives you double security. In the GSC sitemaps report, you can also see whether Google was able to retrieve the file — this also tells you whether the path is correct.

4: Static sitemap that doesn't keep pace with the website

This point applies in particular to larger, frequently updated websites. Example: Developers export and upload a sitemap once, but don't update it regularly. As a result, new pages only end up in the sitemap much later or not at all and old ones remain there forever. This is particularly annoying if, for example, you have an online shop with a constantly changing range of products — important new products may then remain undetected for longer. Solution: Automate the process. Use the CMS function or a cron job that, for example, regenerates the sitemap every day. There are also tools (such as Screaming Frog) with which you can crawl and build a sitemap, but this is intended more as a one-time solution or backup, as it is error-prone and time-consuming. Ideal is a dynamic sitemap directly from your system.

5: Conflicting hreflang information

This is a bit more specific: If you use Hreflang in the sitemap (see Best Practices), there is a risk that the information does not match the information in the HTML. We've already touched on this — two different hreflang sets are confusing Google

Solution: Decide which path to take. Either you maintain Hreflang consistently in the source code of every page (then omit it from the sitemap), or you do it in the sitemap and Disable on-page hreflang tags. This is how you avoid conflicting signals.

6: Misuse of Priority/Changefreq

Many believe they could influence Google with certain settings in the sitemap — for example, give all pages priority 1.0 in the hope of a better ranking, or set changefreq=daily so that Google crawls daily. In fact, this information is hardly taken into account. Google evaluates the importance of a page primarily based on content and incoming links, not on the basis of a number entry in the sitemap. An artificially inflated priority can even look suspicious. Solution: Only use priority and changefreq if you have a real reason. For example, you could give very important pages a slightly higher priority than less important pages — within your site This can help to signal what is more important. But don't expect miracles. And only enter realistic Changefreq values (or omit the field). In general, the following applies: Content and internal linking are more important than any sitemap setting.

7: Forgetting to remove old sitemaps

Sometimes the file names or paths of sitemaps change (e.g. due to plugin changes or website relaunches). A common mistake is that the old sitemap file stays online and may still be in the robots.txt while the new one is already being used. This can result in Google seeing two different sitemaps with potentially contradictory content. Solution: Delete or prevent access to outdated sitemap files. Update robots.txt to the latest version. In Search Console, you can mark old sitemaps as “remove” so that they disappear from the report. Keep your setup slim: a current sitemap (or index) per property.

8: Not looking at Search Console

Finally: The biggest mistake is to check the matter off after you submit it. Many problems would be noticed if you occasionally looked at the Google Search Console or Bing Webmaster Tools. There, Google will inform you about errors in the sitemap, the indexing rate, any discrepancies, etc. If you ignore this, you are wasting optimisation potential. Solution: Make it a routine to look at the GSC (indexing & sitemaps section) at least once a month. Especially after major changes or when new content has been added, a quick check is worthwhile: Are the new URLs in the sitemap? Have they been indexed yet? Are there any errors in error messages? — This allows you to react quickly instead of realising that something went wrong months later.

As you can see, many of these mistakes can be avoided with common sense and some regular care. One XML sitemap is not a one-time “set and forget” thing, but should be treated like a living document that evolves with your website. If you follow these best practices and keep an eye on the typical sources of error, your sitemap becomes a valuable asset for your SEO — and no reason for headaches.

FAQ — Common questions about XML sitemaps

Finally, we'll answer a few frequently asked questions about XML sitemaps:

What is an XML sitemap?

An XML sitemap is a structured page overview in XML format that helps search engines efficiently crawl the page structure of a website. It lists all relevant individual documents, such as pages, images or videos, together with metadata (e.g. date of change or priority). The file follows the XML Version 1.0 Encoding UTF-8 specification and can contain various sitemap formats, such as for news, images or a video sitemap. A well-maintained sitemap XML improves the presentation in search results and significantly supports search engine optimisation.

Why is a sitemap important for SEO?

Creating a sitemap ensures that all relevant pages — including deeply linked or new content — are quickly found by search engines. This makes indexing easier and improves visibility in SERPs. This is a decisive advantage, especially for complex websites or large sitemaps, for example with lots of rich media content. A clear page structure in the form of a main sitemap or sitemap index file helps to give crawlers a kind of map of your website — which also benefits website visitors in the long term.

Does every website need an XML sitemap?

Not necessarily — but especially for large or dynamic websites, with frequent updates or complex page structures, a sitemap XML is extremely helpful. It ensures that all content can be crawled and indexed correctly — especially if internal links are not optimally set. A sitemap is therefore not a must, but offers clear advantages for search engine optimisation and technical analysis.

How can I create an XML sitemap?

That depends on which system you're using:

  • Content Management Systems (CMS): Many CMS offer a sitemap function out of the box or there are plugins for it. For example, WordPress from version 5.5 automatically creates a sitemap (/wp-sitemap.xml). Popular SEO plugins such as Yoast SEO or RankMath also create sitemaps and offer configuration options (which content types should be included, etc.). There are similar solutions for Shopify, Joomla, Drupal, etc. So first, check whether your CMS already provides a sitemap or a plugin is available — this is usually the case.
  • Manual creation: If you have a static website or a self-build system without a sitemap function, you can use tools such as Screaming Frog or XML-sitemaps.com use. Screaming Frog can crawl your website and export a sitemap.xml from it. Online generators such as xml-sitemaps.com (often free for smaller sites) also create a sitemap file when you enter the URL. You would then have to upload them to your server.
  • My script: Developers can also write their own script, which, for example, reads all URLs from the database and formats them as XML. This is particularly worthwhile for very large pages or special cases. Many shop systems have something like this running in the background.
  • Important: No matter how you create them — validate the result. Check that all important URLs are included and that the XML is correct. A quick check in the browser or with a validator tool is usually sufficient.

In most cases, the automatic solution via CMS/plugin Preferably because it keeps the sit map up to date. With WordPress, for example, all you need to do is activate the corresponding feature and the plugin will do the rest.

How do I submit my XML sitemap to Google?

You can create your sitemap and then submit it directly to Google Search Console under “Sitemaps.” Make sure that it complies with the XML version 1.0 encoding UTF-8 standard. Large websites should use a main sitemap or a sitemap index file to logically group multiple individual documents. Important: Make sure that the sitemap is displayed correctly and that all URLs are accessible — this makes analysis easier and improves crawling behaviour.

Does a sitemap guarantee that all my pages are indexed?

No An XML sitemap only guarantees that Google learns about the URLs not that it indexes them. Google decides whether a page is indexed based on many factors (content quality, duplication, technical accessibility, user signals, etc.). The sitemap helps with discovery from pages, but it is not an “indexing ticket.” Google itself points out that a sitemap is no guarantee of indexing. It may happen that some pages are reported as “Discovered — currently not indexed” in the Search Console even though they are in the sitemap. Reasons can include thin content, crawling budget, or other SEO problems. Here, you then have to work on the content or website quality.

Note: You can't force yourself to index and rank, certainly not just through a sitemap. The sitemap is a helpful tool, but it's not a miracle cure. It ensures that Google is nothing oversees, but then your side must be convincing on its own. After all: When pages not Being indexed, you can become aware of it more quickly and take action thanks to the GSC and sitemap.

Does an XML sitemap affect my ranking on Google?

Not directly. An XML sitemap is not a ranking factor in the strict sense of the word. The presence of a sitemap does not automatically increase your site's ranking. But what can happen indirectly: Through faster and more complete indexing can Your pages appear earlier in search results, which is of course positive. But Google doesn't rank pages without a sitemap worse as long as it finds them elsewhere. Gary Illyes from Google was once asked whether not having a sitemap was a ranking disadvantage — he said no. It is more important that your website is technically clean and has strong content. Conclusion: A sitemap helps with crawling/indexing, but content, backlinks, user experience, etc. count much more for ranking. So you should have both: good content and a sitemap for optimal conditions.

What do I do if my website has more than 50,000 URLs?

As described above, in this case you must use multiple sitemap files work. Break down your URLs — e.g. by categories, annual archives, product groups, or simply in blocks of 50k. Then create a Sitemap index file, which lists all individual sitemaps. You submit this index file to Google (instead of each individual sitemap). Google will retrieve all sitemaps referenced in it one by one. Make sure that each individual file in turn complies with the 50K/50MB limits. In theory, you could manage up to 50,000 sitemaps with 50,000 URLs each (i.e. 2.5 billion URLs) — in practice, we barely get anywhere. If your website thus is big, you probably have your own indexing management team. For most, it's enough to manage 2-10 sitemaps. For example, news sites often create a new sitemap (for new articles) every day and keep older articles in separate sitemaps per month/year. Important: Don't forget to add new sitemaps to the index as well if you change the scheme. And keep the robots.txt up to date if there are sitemap paths entered there.

Can I also use multiple sitemaps?

Yes! Especially with large pages, it makes sense to divide content into several sitemap formats — for example by language, content type or topicality. You can then summarise them centrally using a sitemap index file. This gives you control over the structure of your websites and gives search engines a clear map. This strategy is also recommended for various rich media content — e.g. via a video sitemap.

What is IndexNow and do I need it?

IndexNow is a relatively new approach (initiated by Microsoft/Bing) that allows search engines upfront to provide information about new or changed content. Basically, you're actively pinging the search engine instead of waiting for it to fetch your sitemap. Bing and Yandex already support IndexNow; Google is still experimenting with it (as of 2024), but has not (yet) officially integrated it. For you, this means that if Bing is important to you, IndexNow can be useful. You can set up a special API URL or use tools/plugins that support IndexNow to immediately inform Bing as soon as you change a page. Does IndexNow replace the sitemap? Not really. It is more of an addition to swifter Informing. You should still maintain your XML sitemap because it serves as a complete reference to all URLs — IndexNow only ever reports changes. Kevin Indig once summed it up this way: “If you want things to be really fast: Forget the sitemap, use IndexNow and ping Bing directly.”

For Google, however, you stick with the classic sitemap, as IndexNow currently plays no role there. Conclusion: Nice-to-have for Bing/Yandex, but not a must — the XML sitemap remains the foundation.

conclusion

XML sitemaps are a important SEO tool to make it easier for search engines to navigate your site. They can noticeably improve indexing, especially for large or complexly structured websites. They may not be necessary for smaller projects, but they never hurt — and they grow with your demands. It is important that you use the sitemap Create and maintain correctly: List only relevant, indexable pages, keep entries up to date and regularly check the Search Console for errors. A good sitemap is always Up to date and free from technical errors.

Remember: The sitemap isn't a guarantee of top rankings, but it does ensure that your content gets a fair chance of even getting into the index. She is, so to speak, yours SEO-Background noise — invisible to users, but essential for crawlers. If you follow the best practices outlined in this guide, you're on the right track: All important pages are reliably indexed by Google without you constantly having to manually help. This allows you to focus on the essentials — great content and a strong website structure — while the sitemap quietly and diligently does its job.

Finally, a tip: Don't get bogged down in perfectionism. It is almost impossible to always display 100% perfect of a huge website in the sitemap.

And you don't even have to. Google forgives minor discrepancies. What is more important is that the big lines Voices: all important pages in, gross nonsense outside. With this in mind: Make your website sitemap-fit and offer Google & Co. an optimal overview!

Do you need assistance? If this all seems too technical for you or you want to make sure that your website is SEO-optimised all around, we can help from panpan.digital I'm happy to continue with you. With our professional SEO Consulting We help you optimise your sitemap and the entire website for search engines. And as part of our digital strategy consulting Let's develop a holistic online strategy together so that your company can get off to a successful start in the digital space. Just talk to us — we'll take your SEO to the next level!

Latest posts

Knowledge Base

Interviews, tips, guides, news and best practices on the topics of search engine optimisation, SEA and digital strategy.
5 Minutes

Hreflang: A Comprehensive Guide for International SEO Success

Hreflang is essential for international SEO, helping search engines serve the right content to the right audience. Learn how to implement hreflang tags correctly, avoid common mistakes, and keep your multilingual pages performing well.
To the article
SEO
10 Minutes

Google Index Management Basics for E‑Commerce Websites

Understand Google index management basics to boost your e-commerce site's SEO performance. Learn best practices on canonicalising product variants, streamlining site structure, and optimising XML sitemaps to drive traffic and revenue.
To the article
SEO
4 Minuten

Redirects in SEO: How to Handle Them the Right Way

Redirects are key to preserving SEO value during URL changes, relaunches, and product updates. This post explains when to use 301 vs. 302 redirects, common mistakes, and how to use smart redirect management for user and search engines.
To the article

Get in touch

Wir melden uns innerhalb von 24 Stunden mit einer Antwort bei dir – versprochen.

By entering your data, you agree to pan pan's privacy policy.

Great, we have received your message!

We are looking forward to working with you to achieve your success!
Oops, something went wrong!

Please try again or contact us directly via email.