<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>marketingfan.com &#187; index labels</title>
	<atom:link href="http://www.marketingfan.com/tags/index-labels/feed" rel="self" type="application/rss+xml" />
	<link>http://www.marketingfan.com</link>
	<description></description>
	<lastBuildDate>Tue, 29 Mar 2011 12:38:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Google Bowling via Proxy</title>
		<link>http://www.marketingfan.com/google-proxy-bowling</link>
		<comments>http://www.marketingfan.com/google-proxy-bowling#comments</comments>
		<pubDate>Thu, 16 Aug 2007 19:00:22 +0000</pubDate>
		<dc:creator>Christoph C. Cemper</dc:creator>
				<category><![CDATA[bowling]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google bowling]]></category>
		<category><![CDATA[google proxy bowling]]></category>
		<category><![CDATA[google proxy hacking]]></category>
		<category><![CDATA[google search]]></category>
		<category><![CDATA[hacking]]></category>
		<category><![CDATA[incredibill]]></category>
		<category><![CDATA[index labels]]></category>
		<category><![CDATA[matt twine]]></category>
		<category><![CDATA[penalties]]></category>
		<category><![CDATA[proxies]]></category>
		<category><![CDATA[proxy sites]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[supplemental index]]></category>
		<category><![CDATA[www google]]></category>
		<category><![CDATA[dan thies]]></category>

		<guid isPermaLink="false"></guid>
		<description><![CDATA[<p><a href="http://www.marketingfan.com/google-proxy-bowling">Google Bowling via Proxy</a><br/><br/>By <a href="http://www.marketingfan.com">Marketingfan.com Internet Markting Blog</a></p>
No matter what Google says, other evil competitors can "bowl" your site out of the Google search results by utilizing proxy sites.... Dan Thies has a great post up - and to illustrate that I thought I can add to this great post with a concrete examples, screen shots, sites and urls - no need to keep silent on this anymore... ]]></description>
			<content:encoded><![CDATA[	<p><p><a href="http://www.marketingfan.com/google-proxy-bowling">Google Bowling via Proxy</a><br/><br/>By <a href="http://www.marketingfan.com">Marketingfan.com Internet Markting Blog</a></p><br />
<div class="tweetmeme_button" style="float: left; margin-right: 10px;"><br />
<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fwww.marketingfan.com%2Fgoogle-proxy-bowling"><br />
<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fwww.marketingfan.com%2Fgoogle-proxy-bowling&style=normal&b=2" height="61" width="50" /><br />
</a><br />
</div>So with <span class="caps">SES </span>San Jose just around the corner, Dan Thies put up a great post detailling <a href="http://www.seofaststart.com/blog/google-proxy-hacking" title="">all the headaches</a> that a website owner could get when looking at his serps a bit closer or with <a href="http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business" title="">the right tools to do so</a> ...</p>

	<p>Dan calls this &#8220;Google Proxy Hacking&#8221;, but frankly, we are not hacking any of Google&#8217;s proxies &#8211; so I&#8217;m talking about <b>Google Bowling via Proxy Sites</b> &#8211; related to the older black hat term &#8220;Google Bowling&#8221; for buying too many / bad links for competitor sites to knock them off the serps. Yes, it IS possible to knock a competitor site off the <span class="caps">SER</span>Ps, altought <a href="http://www.google.com/support/webmasters/bin/answer.py?answer=34449&#038;query=harm&#038;topic=&#038;type" title="">Google says</a>= there is <s>nothing</s> <em>almost nothing</em> a competitor can do to harm you (yeah, right &#8211; the Google folks weakened this message some months ago, because the &#8220;nothing&#8221; was plain wrong &#8211; and they knew it).</p>

	<p>If you read thru <a href="http://www.seofaststart.com/blog/google-proxy-hacking" title="">Dan&#8217;s post</a> you might get <a href="http://www.seofaststart.com/blog/google-proxy-hacking#comment-688" title="">headaches just like this guy</a>  from all those details and the partly <b>wrong promises</b> for a cure for it with two solutions that <span class="caps">BOTH</span> address only the outdated part of the problem.</p>

	<p>So I though I have to illustrate to you what&#8217;s going on and <b>how Google Bowling via Proxies</b> actually looks like</p>

	<p><img src="/files/u2/proxy-dust-rules1-070816.png" width="716" height="391" alt="proxy-dust-rules1-070816.png" /></p>

	<p>The above results are returned if you search for the unique phrase</p>

	<p><b><br />
<a href="http://www.google.com/search?client=opera&#038;rls=en&#038;q=%22related+details+is+the+CEMPER.COM+expertise+that+you+can+order%22&#038;sourceid=opera&#038;num=10&#038;ie=utf-8&#038;oe=utf-8" title="">related details is the <span class="caps">CEMPER</span>.COM expertise that you can order</a>   </b></p>

	<p>which <s>is</s> was only found on my company site <a href="http://www.cemper.com" title="">cemper.com</a> ... (ok &#8211; now it&#8217;s also found on this marketingfan.com<br />
blog and on <a href="http://www.marketingfan.at" title="">marketingfan.at</a> as soon as we translate it)</p>

	<p><h3><b>But <span class="caps">WTH</span> is Proxy Dust ???</b></h3></p>

	<p>As you can see this unique phrase which <a href="http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business" title="">should id if my page is healthy</a> does not show my <a href="http://www.cemper.com" title="">own site</a> but &#8220;one of those <span class="caps">PITA</span> sites&#8221;: run by a guy called Matt Twine from the <span class="caps">UK </span>(if that IS his real name&#8230;)</p>

	<p>and as you can image the url <a href="http://www.proxydust.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t" title="">http://www.proxydust.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t</a> has an <span class="caps">EXACT</span> copy of my company site&#8217;s home page there&#8230;</p>

	<p>Did I hear Spam Report? yadda yadda &#8211; don&#8217;t bother &#8211; the Googlers don&#8217;t seem to care, because I submitted that 2 weeks ago&#8230;</p>

	<p><h3><b>But it get&#8217;s worse</b></h3></p>

	<p>Now clicking that &#8220;filter=0&#8221; to reveal all search results we see this <span class="caps">HUGE</span> list of pages &#8211; cemper.com coming second&#8230;. as a filtered result right after that proxy site used for google bowling&#8230;</p>

	<p><img src="/files/u2/proxy-dust-no1-cemper-3more-proxysites-part1.png" width="695" height="485" alt="proxy-dust-no1-cemper-3more-proxysites-part1.png" /></p>

	<p>[... pages cut out here &#8230; ]</p>

	<p><img src="/files/u2/proxy-dust-no1-cemper-3more-proxysites-part2.png" width="695" height="398" alt="proxy-dust-no1-cemper-3more-proxysites-part2.png" /></p>


	<p>But also we have a <a href="http://www.unblockfilters.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t" title="">couple more</a> <a href="http://www.glik.us/scgi-bin/nph-noxy.cgi/000110A/http/www.cemper.com" title="">scumbags</a> <a href="http://69.41.173.145/ru/www.cemper.com/" title="">stealing my content</a> and trying to hijack my site&#8230;</p>

	<p>In fact only the <a href="http://www.proxydust.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t" title="">ProxyDust copy wins</a> big time over <span class="caps">CEMPER</span>.COM because &#8230; believe it or not&#8230;</p>

	<p><b>that fricking domain registered in January 2007 got a Wikipedia backlink</b></p>

	<p>And <a href="http://www.cemper.com" title="">my site</a> does not.</p>

	<p>I currently think that&#8217;s the main reason why Google chose them over my own site &#8211; which is from 2000, not heavily SEOed, but I bet a handful more trusted than this Mark &#8220;Thief&#8221; Twine&#8217;s site.</p>

	<p>Well, it might well be that Mark has <span class="caps">NO CLUE</span> about what he does, but all those ads plastered around my site indicate different.</p>

	<p>In fact it appears the whole strategy of running those proxy sites is to earn money from the ads placed on other&#8217;s content and cashing in on their work&#8230;</p>


	<p><h3><b>What we (legit webmasters) can do&#8230; </b></h3></p>

	<p>Frankly, I love <a href="http://www.seofaststart.com/blog/google-proxy-hacking" title="">Dan&#8217;s general post</a> as an introduction to this post, because I would have hated to explain it in all length as he did.   <s>But what he points out as &#8220;solutions&#8221; are somewhat <b>old school methods</b> to identify bots that pretend to be Google, Yahoo or MSNbot&#8230;.  </s></p>

	<p>Dan&#8217;s post <span class="caps">ALSO</span> contains the 2nd method for sending <span class="caps">ALL</span> visitors a &#8220;noindex, nofollow&#8221; that do <span class="caps">NOT</span></p>

	<p>1) Identify as spiders<br />
2) Pass a &#8220;valid IP address&#8221; test</p>

	<p>Pretty cool &#8211; I think that might work &#8211; and will test this <span class="caps">ASAP</span>, in addition to my own method of blocking those scumbags.</p>


	<p>Further readings:</p>

	<p>I discussed this with <a href="http://incredibill.blogspot.com/2007/07/google-proxy-hijacking-myths-urban.html" title="">IncrediBill last week</a> who has a great post up on identifying fake bots &#8211; but his comment is also just</p>

	<p><blockquote><br />
<span class="caps">PROXYDUST</span> appears to just pass thru the user agent as-is, hard to say without seeing an actual hijacking if they do something special with Googlebot.</p>

	<p>Anyway, they operate out of uk2net and the easiest way to make sure you&#8217;ve got all their IPs is to just block the entire data center.</p>

	<p>inetnum: 83.170.96.0 &#8211; 83.170.111.255<br />
netname: <span class="caps">UK2</span>-NET<br />
route: 83.170.96.0/20<br />
</blockquote></p>

	<p>and then</p>

	<p><blockquote><br />
Automating it is sometimes proxy and behavior specific, nothing I could tell you how to do in a quick post.</p>

	<p>Some of them actually slip through the cracks for a while until they reveal themselves so it&#8217;s not 100% bulletproof.</p>

	<p>The only way to get most of them is to simply block all hosting centers.<br />
</blockquote></p>

	<p>I actually blocked a <span class="caps">TON</span> of IP ranges,including those of a rogue bot called Twiceler in the last 2 weeks&#8230;</p>

	<p>but the &#8220;noindex&#8221; hack mentioned above is the next countermeasure&#8230;</p>

	<p><b><span class="caps">I REALLY</span> hope I can generalize this to protect <span class="caps">ALL</span> my sites without having to change all of them&#8230;</b></p>



	<p>And then we got some more cool posts on</p>

	<p><a href="http://hamletbatista.com/2007/07/16/you've-won-the-battle-but-not-the-war-10-ways-to-protect-your-site-from-negative-seo/  ">10 Ways to protect your site from negative <span class="caps">SEO</span>&#8221;</a> where hamlet refers to &#8220;negative <span class="caps">SEO</span>&#8221; for all kinds of actions a competitor could take against you &#8230; frightening &#8230;. and <a href="http://hamletbatista.com/2007/07/03/the-never-ending-serps-hijacking-problem-is-there-a-definite-solution/" title="">Never Ending <span class="caps">SERP </span>Hijacking</a>  where he correctly states that the <span class="caps">REAL</span> problem are those sites like proxydust that <span class="caps">DO NOT</span> pretend to be Google&#8230;.</p>


	<p><h3>What about you?</h3></p>

	<p>Has <span class="caps">YOUR</span> site been hijacked? Do you know?</p>

	<p>How you could know? Just follow <a href="http://www.jimboykin.com/google-supplemental-results/" title="">Jim&#8217;s post</a>  to find if a page is in supplemental &#8230; but actually make sure you look at the results closely&#8230; because what you might find is that somebody is stealing your content&#8230;.</p>

	<p>You should do that for <span class="caps">EVERY PAGE</span> of your site &#8211; best case &#8211; if you <a href="http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business" title="">got the right tools</a> for it&#8230;. but it costs a lot of resources either way &#8211; by hand or by machine tool.</p>


	<p><b><br />
Let me know about <span class="caps">YOUR</span> hijack experiences !<br />
</b><br />
(and I&#8217;m sure people <em>should</em> talk about this at the <span class="caps">SES</span> in San Jose , however I fear they won&#8217;t too much&#8230;)</p>




	<p>Update: You could of course get around the initial problem of having too less trust in Google by <a href="http://www.marketingfan.com/tools/seo-tools/common-forward-links-tool-super-authority-links" title="">getting real juicy authority links</a> <a href="http://www.marketingfan.com/a/search-engines/3-great-uses-for-the-msn-linkfromdomain-command.php" title="">using <span class="caps">MSN</span>&#8217;s linkfromdomain command</a> by effectively even letting your competitor <a href="http://www.marketingfan.com/a/search-engines/research/indirect-linking-truncated-page-rank-and-getting-rid-of-link-buying-penalties.php" title="">link indirect to you</a>  ... obviously you still want to make sure you get only the <a href="http://www.marketingfan.com/search-engines/seo/link-building/strongest-subpages-suck-where-you-should-really-get-links" title="">juicy pages</a> and not spend your time with dead meat.</p>


 <div class="tweetthis" style="text-align:left;"><p> <a class="tt" href="http://twitter.com/intent/tweet?text=Google+Bowling+via+Proxy+http%3A%2F%2Fmarketingfan.com%2F%3Fp%3D1273" title="Post to Twitter"><img class="nothumb" src="http://www.marketingfan.com/wp-content/plugins/tweet-this/icons/en/twitter/tt-twitter-big4.png" alt="Post to Twitter" /></a> <a class="tt" href="http://delicious.com/post?url=http://www.marketingfan.com/google-proxy-bowling&amp;title=Google+Bowling+via+Proxy" title="Post to Delicious"><img class="nothumb" src="http://www.marketingfan.com/wp-content/plugins/tweet-this/icons/en/delicious/tt-delicious-big4.png" alt="Post to Delicious" /></a> <a class="tt" href="http://delicious.com/post?url=http://www.marketingfan.com/google-proxy-bowling&amp;title=Google+Bowling+via+Proxy" title="Post to Delicious">Post to Delicious</a> <a class="tt" href="http://reddit.com/submit?url=http://www.marketingfan.com/google-proxy-bowling&amp;title=Google+Bowling+via+Proxy" title="Post to Reddit"><img class="nothumb" src="http://www.marketingfan.com/wp-content/plugins/tweet-this/icons/en/reddit/tt-reddit-big4.png" alt="Post to Reddit" /></a> <a class="tt" href="http://reddit.com/submit?url=http://www.marketingfan.com/google-proxy-bowling&amp;title=Google+Bowling+via+Proxy" title="Post to Reddit">Post to Reddit</a> <a class="tt" href="http://stumbleupon.com/submit?url=http://www.marketingfan.com/google-proxy-bowling&amp;title=Google+Bowling+via+Proxy" title="Post to StumbleUpon"><img class="nothumb" src="http://www.marketingfan.com/wp-content/plugins/tweet-this/icons/en/su/tt-su-big4.png" alt="Post to StumbleUpon" /></a> <a class="tt" href="http://stumbleupon.com/submit?url=http://www.marketingfan.com/google-proxy-bowling&amp;title=Google+Bowling+via+Proxy" title="Post to StumbleUpon">Stumble This Post</a></p></div>]]></content:encoded>
			<wfw:commentRss>http://www.marketingfan.com/google-proxy-bowling/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

