Knowledge Archives
http://www.co-internet.net/net/articles/infoseekdebunk.txt.html

Infoseek-GO Spider Codes Debunked
------------------------------------------------------
by
Dirk Brockhausen
------------------------------------------------------

Many Infoseek spiders originate from hosts
that seem to bear a cryptic string of characters in
their domain names, e.g. cca26101.infoseek.com,
cca26111.infoseek.com, cca261e7.infoseek.com, etc.

While this may seem somewhat weird at first glance,
there is an underlying logic to it which can be
discerned when viewing the third level domain in
context with their original IP.

Let's review this using the following entry pair as a
practical example:

cca26101.infoseek.com - 204.162.97.1

The string "cca26101" is made up of 8 (i.e. 4 x 2)
characters:

cc|a2|61|01.

These twin characters can be converted to integers.
Note that to do this we will not be working the decimal
system here but, rather, the hexadecimal system which
is based on 16.
Within this system, the following assignments hold
true:

0 : 0
1 : 1
2 : 2
3 : 3
4 : 4
5 : 5
6 : 6
7 : 7
8 : 8
9 : 9
a : 10
b : 11
c : 12
d : 13
e : 14
f : 15

Based on this table, we can convert the twin
characters easily as follows:

cc : 12 x 16 + 12 = 192 + 12 = 204
a2 : 10 x 16 + 2 = 160 + 2 = 162
61 : 6 x 16 + 1 = 96 + 1 = 97
01 : 0 x 16 + 1 = 0 + 1 = 1

The base being 16, it is with 16 that the values
have to be multiplied.

Merging these results, we obtain:

204.162.97.1

This is identical with the IP of
cca26101.infoseek.com.

Let's take two more examples to illustrate that we
are not dealing with mere numerological coincidence
here.

cca261e7.infoseek.com - 204.162.97.231

cc : 12 x 16 + 12 = 192 + 12 = 204
a2 : 10 x 16 + 2 = 160 + 2 = 162
61 : 6 x 16 + 1 = 96 + 1 = 97
e7 : 14 x 16 + 7 = 224 + 7 = 231

Result: 204.162.97.231.

cde2ccee.infoseek.com 205.226.204.238

cd : 12 x 16 + 13 = 192 + 13 = 205
e2 : 14 x 16 + 2 = 224 + 2 = 226
cc : 12 x 16 + 12 = 192 + 12 = 204
ee : 14 x 16 + 14 = 224 + 14 = 238

Result: 205.226.204.238

The lowest value would be 00000000.infoseek.com,
the highest ffffffff.infoseek.com.

This equates to IPs 0.0.0.0 and 255.255.255.255
respectively - which are, of course, the limits
of any IP range.

Thus, what may seem like utter nonsense to the
uneducated eye, is shown to be a sequence of 4
integer blocks representing the spider's IP address.



This text may freely be republished or distributed in unmodified form provided the following resource box is included intact either at the beginning or the end of the article and a complimentary copy or notice (link) is sent to the author at the address specified below:

Ralph Tegtmeier is the co-founder and principal of fantomaster.com GmbH (Belgium), < http://fantomaster.com/ >, a company specializing in webmasters software development, industrial-strength cloaking and search engine positioning services.

He has been a web marketer since 1994 and is editor-in-chief of fantomNews, a free newsletter focusing on search engine optimization, available at: < http://fantomaster.com/fantomnews-sub.html > You can contact him at mailto:fneditor@fantomaster.com
(c) copyright 2002 by fantomaster.com
All rights reserved.
Downloaded at: < http://fantomaster.com/ >