There are several different types of sitemap formats, as described in the sitemap protocol. Probably the most natural and easiest to submit is the RSS sitemap format. Since your site probably already publishes your content as RSS to which your users can subscribe, you will not have to put much additional effort into this by using the same feed for the search engines.
If your site publishes several different RSS feeds, probably one for each category that your site covers, you could of course submit all of them to Google. In my case I have over 3000 different RSS URLs, so decided to first give it a try with a single feed - the one that publishes all my new articles regardless of their category.
Before submitting the RSS sitemap to a search engine I recommend that you first check the validity of the feed you would like to submit through the W3C RSS validator. This could save you some problems wondering for what reason Google rejects your sitemap. For example one thing that lots of CMSs mess up when generating a geed is the pubDate format. The W3C validator kept insisting that my BST (British Summer Time) pubDate is implausible, i.e it is in the future. I managed to go around this through changing the format from TimeZone to time difference from GMT. Namely I switched from
to
Another peculiarity is that the W3C validator recommends that you should insert a
To finish this post - don't expect that right after googlebot reads you sitemap(s), it will start crawling and indexing your pages immediately. This will likely take some time, don't push it.
No comments:
Post a Comment