Looks like a bot has been scouring my website without properly identifying itself. I noticed that my older posts were getting a lot of unexplained hits. I checked the logs, looked up the IPs, and discovered the visitors were bots from the rankcrawler.com domain. The bots don’t properly identify themselves in their user agent field, as good bots should do:
Some of the bots came from these IPs (though there may be others):
87.98.249.75
87.98.133.249
91.121.26.45
94.23.152.34
94.23.153.8
As you can see, Rankcrawler prefers to disguise itself as a regular browser. This is a no-no.
87.98.249.75 – – [29/May/2009:23:56:09 -0400] “GET /page/2/ HTTP/1.0” 200 34160 “-” “Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.0.6) Gecko/2009011913 Firefox/3.0.6”
87.98.249.75 – – [30/May/2009:00:11:16 -0400] “GET /2006/07/ HTTP/1.0” 200 41171 “-” “Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.9.0.6) Gecko/2009011913 Firefox/3.0.6”
91.121.26.45 – – [29/May/2009:20:47:22 -0400] “GET / HTTP/1.0” 200 34467 “-” “Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)”
91.121.26.45 – – [30/May/2009:00:01:23 -0400] “GET /2008/05/ HTTP/1.0” 200 27858 “-” “Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)”
Continue reading →