CreeperSiteMap

Mar 30 2011 Published by under MediaCreeper

Somewhere along all updates I forgot to update the CreeperSiteMapEngine, perhaps one of the most important pieces for search enging robots to index, it generates an XML with links, priority, update frequence for almost all pages on the site, again it was a silly error .. encoding of urls. As we accept IDN-domains, a small piece of code that handled the encoding that wasn’t very well written, which made the script emit errors and halt less than half way through.

That might explain why I haven’t seen as much traffic to certain pages as expected, hopefully the sitemap.xml will be re-indexed in a while and traffic returns.

There is also a humans.txt file available as well, with credits to the helpful people around me.

No responses yet

A few updates

Jan 17 2011 Published by under MediaCreeper

I just rolled out a few updates, most dealing with IDN-domains. Noticed that it broke in a few places, encoding, encoding encoding… well, I’ll fix it tomorrow as it isn’t a show-stopper.

The sorting of the SiteCloud on the /latest page has been fixed, IDN-domains where always on the “bottom” as it was sorted on the domain name, the IDN prefix, “xn--“, made that happen — now the domains are unIDNed (a word?) .. so they appear normal.

A few cleaners applied, printfriendly.com and webcache.googleusercontent.com are stripped and original links are restored.

No responses yet