
It seems everyone and their dog has a web bot these days. They are a major source of bandwidth use (which could cost you money), but, worse, each time one scrapes your site it registers in your web stats as a visit. Identifying bots is not easy. Some stats programs, like Firestats, the one I use on my blogs, have a list of known bots and ignores any hits from them. This is a good thing but there are unscrupulous bot overlords out there who try and disguise their activity by faking their browser/OS identification packet headers and spreading their hits out several minutes apart. Here’s a good example of one such bot out of the Ukraine* - probably a SPAM bot as the former SSR states are the major source of SPAM bots (it’s not just SPAM bots that are being unethical - there’s one bot out of Sweden that pulls the same trick and when one site owner confronted the bot owner he claimed to be developing a Nordic search bot.)
Note: those stats are not referrers, they are actual hits on the site from the same IP. 92.112.201.0/24 (that a Class C or 0-255 in the IP range) just hit my firewall deny rules
*inetnum: 92.112.109.0 - 92.113.255.255
netname: UKRTELNET
descr: Ukrtelecom IP access network
descr: NCC#
country: ua
remarks: E-mail for SPAM and abuse postmaster@ukrtel.net
admin-c: ARM42-RIPE
tech-c: ARM42-RIPE
status: ASSIGNED PA “status:” definitions
mnt-by: AS6849-MNT
source: RIPE # Filtered
person: Remiga Alexander
address: JSC UKRTELECOM
address: 18, Shevchenko blvd
address: Ukraine, Kiev
phone: +380 (44) 230-9024
nic-hdl: ARM42-RIPE
mnt-by: AS6849-MNT
source: RIPE # Filtered
Technorati Tags: web bot, bandwidth, web stats, Identifying bots, Firestats, SPAM
No comments yet.