<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xml:base="http://www.marketingfan.com" xmlns:dc="http://purl.org/dc/elements/1.1/">
<channel>
 <title>supplemental index</title>
 <link>http://www.marketingfan.com/seo-glossary/supplemental-index</link>
 <description>The taxonomy view with a depth of 0.</description>
 <language>en</language>
<item>
 <title>Google Bowling via Proxy</title>
 <link>http://www.marketingfan.com/search-engines/google-proxy-bowling</link>
 <description> &lt;p&gt;
&lt;p&gt;So with &lt;span class=&quot;caps&quot;&gt;SES&lt;/span&gt; San Jose just around the corner, Dan Thies put up a great post detailling &lt;a href=&quot;http://www.seofaststart.com/blog/google-proxy-hacking&quot;&gt;all the headaches&lt;/a&gt; that a website owner could get when looking at his serps a bit closer or with &lt;a href=&quot;http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business&quot;&gt;the right tools to do so&lt;/a&gt; &amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Dan calls this &amp;#8220;Google Proxy Hacking&amp;#8221;, but frankly, we are not hacking any of Google&amp;#8217;s proxies &amp;#8211; so I&amp;#8217;m talking about &lt;b&gt;Google Bowling via Proxy Sites&lt;/b&gt; &amp;#8211; related to the older black hat term &amp;#8220;Google Bowling&amp;#8221; for buying too many / bad links for competitor sites to knock them off the serps. Yes, it IS possible to knock a competitor site off the SERPs, altought &lt;a href=&quot;http://www.google.com/support/webmasters/bin/answer.py?answer=34449&amp;#38;query=harm&amp;#38;topic=&amp;#38;type&quot;&gt;Google says&lt;/a&gt;= there is &lt;s&gt;nothing&lt;/s&gt; &lt;em&gt;almost nothing&lt;/em&gt; a competitor can do to harm you (yeah, right &amp;#8211; the Google folks weakened this message some months ago, because the &amp;#8220;nothing&amp;#8221; was plain wrong &amp;#8211; and they knew it).&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;If you read thru &lt;a href=&quot;http://www.seofaststart.com/blog/google-proxy-hacking&quot;&gt;Dan&amp;#8217;s post&lt;/a&gt; you might get &lt;a href=&quot;http://www.seofaststart.com/blog/google-proxy-hacking#comment-688&quot;&gt;headaches just like this guy&lt;/a&gt;  from all those details and the partly &lt;b&gt;wrong promises&lt;/b&gt; for a cure for it with two solutions that &lt;span class=&quot;caps&quot;&gt;BOTH&lt;/span&gt; address only the outdated part of the problem.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;So I though I have to illustrate to you what&amp;#8217;s going on and &lt;b&gt;how Google Bowling via Proxies&lt;/b&gt; actually looks like&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;img src=&quot;/files/u2/proxy-dust-rules1-070816.png&quot; width=&quot;716&quot; height=&quot;391&quot; alt=&quot;proxy-dust-rules1-070816.png&quot; /&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;The above results are returned if you search for the unique phrase&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt; &lt;b&gt;&lt;br /&gt;
&lt;a href=&quot;http://www.google.com/search?client=opera&amp;#38;rls=en&amp;#38;q=%22related+details+is+the+CEMPER.COM+expertise+that+you+can+order%22&amp;#38;sourceid=opera&amp;#38;num=10&amp;#38;ie=utf-8&amp;#38;oe=utf-8&quot;&gt;related details is the &lt;span class=&quot;caps&quot;&gt;CEMPER&lt;/span&gt;.&lt;span class=&quot;caps&quot;&gt;COM&lt;/span&gt; expertise that you can order&lt;/a&gt;   &lt;/b&gt;&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;which &lt;s&gt;is&lt;/s&gt; was only found on my company site &lt;a href=&quot;http://www.cemper.com&quot;&gt;cemper.com&lt;/a&gt; &amp;#8230; (ok &amp;#8211; now it&amp;#8217;s also found on this marketingfan.com &lt;/p&gt;
&lt;p&gt;blog and on &lt;a href=&quot;http://www.marketingfan.at&quot;&gt;marketingfan.at&lt;/a&gt; as soon as we translate it)&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;h2&gt;&lt;b&gt;But &lt;span class=&quot;caps&quot;&gt;WTH&lt;/span&gt; is Proxy Dust ???&lt;/b&gt;&lt;/h2&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;As you can see this unique phrase which &lt;a href=&quot;http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business&quot;&gt;should id if my page is healthy&lt;/a&gt; does not show my &lt;a href=&quot;http://www.cemper.com&quot;&gt;own site&lt;/a&gt; but &amp;#8220;one of those &lt;span class=&quot;caps&quot;&gt;PITA&lt;/span&gt; sites&amp;#8221;: run by a guy called Matt Twine from the UK (if that IS his real name&amp;#8230;)&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;and as you can image the url &lt;a href=&quot;http://www.proxydust.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t&quot;&gt;&lt;a href=&quot;http://www.proxydust.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t&quot;&gt;http://www.proxydust.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t&lt;/a&gt;&lt;/a&gt; has an &lt;span class=&quot;caps&quot;&gt;EXACT&lt;/span&gt; copy of my company site&amp;#8217;s home page there&amp;#8230; &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Did I hear Spam Report? yadda yadda &amp;#8211; don&amp;#8217;t bother &amp;#8211; the Googlers don&amp;#8217;t seem to care, because I submitted that 2 weeks ago&amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;h2&gt;&lt;b&gt;But it get&amp;#8217;s worse&lt;/b&gt;&lt;/h2&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Now clicking that &amp;#8220;filter=0&amp;#8221; to reveal all search results we see this &lt;span class=&quot;caps&quot;&gt;HUGE&lt;/span&gt; list of pages &amp;#8211; cemper.com coming second&amp;#8230;. as a filtered result right after that proxy site used for google bowling&amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;img src=&quot;/files/u2/proxy-dust-no1-cemper-3more-proxysites-part1.png&quot; width=&quot;695&quot; height=&quot;485&quot; alt=&quot;proxy-dust-no1-cemper-3more-proxysites-part1.png&quot; /&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;[... pages cut out here &amp;#8230; ]&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;img src=&quot;/files/u2/proxy-dust-no1-cemper-3more-proxysites-part2.png&quot; width=&quot;695&quot; height=&quot;398&quot; alt=&quot;proxy-dust-no1-cemper-3more-proxysites-part2.png&quot; /&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;But also we have a &lt;a href=&quot;http://www.unblockfilters.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t&quot;&gt;couple more&lt;/a&gt; &lt;a href=&quot;http://www.glik.us/scgi-bin/nph-noxy.cgi/000110A/http/www.cemper.com&quot;&gt;scumbags&lt;/a&gt; &lt;a href=&quot;http://69.41.173.145/ru/www.cemper.com/&quot;&gt;stealing my content&lt;/a&gt; and trying to hijack my site&amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;In fact only the &lt;a href=&quot;http://www.proxydust.com/index.php?q=aHR0cDovL3d3dy5jZW1wZXIuY29t&quot;&gt;ProxyDust copy wins&lt;/a&gt; big time over &lt;span class=&quot;caps&quot;&gt;CEMPER&lt;/span&gt;.&lt;span class=&quot;caps&quot;&gt;COM&lt;/span&gt; because &amp;#8230; believe it or not&amp;#8230; &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;b&gt;that fricking domain registered in January 2007 got a Wikipedia backlink&lt;/b&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;And &lt;a href=&quot;http://www.cemper.com&quot;&gt;my site&lt;/a&gt; does not.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;I currently think that&amp;#8217;s the main reason why Google chose them over my own site &amp;#8211; which is from 2000, not heavily SEOed, but I bet a handful more trusted than this Mark &amp;#8220;Thief&amp;#8221; Twine&amp;#8217;s site.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Well, it might well be that Mark has NO &lt;span class=&quot;caps&quot;&gt;CLUE&lt;/span&gt; about what he does, but all those ads plastered around my site indicate different. &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;In fact it appears the whole strategy of running those proxy sites is to earn money from the ads placed on other&amp;#8217;s content and cashing in on their work&amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;h2&gt;&lt;b&gt;What we (legit webmasters) can do&amp;#8230; &lt;/b&gt;&lt;/h2&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Frankly, I love &lt;a href=&quot;http://www.seofaststart.com/blog/google-proxy-hacking&quot;&gt;Dan&amp;#8217;s general post&lt;/a&gt; as an introduction to this post, because I would have hated to explain it in all length as he did.   &lt;s&gt;But what he points out as &amp;#8220;solutions&amp;#8221; are somewhat &lt;b&gt;old school methods&lt;/b&gt; to identify bots that pretend to be Google, Yahoo or MSNbot&amp;#8230;.  &lt;/s&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Dan&amp;#8217;s post &lt;span class=&quot;caps&quot;&gt;ALSO&lt;/span&gt; contains the 2nd method for sending &lt;span class=&quot;caps&quot;&gt;ALL&lt;/span&gt; visitors a &amp;#8220;noindex, nofollow&amp;#8221; that do &lt;span class=&quot;caps&quot;&gt;NOT&lt;/span&gt; &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;1) Identify as spiders&lt;/p&gt;
&lt;p&gt;2) Pass a &amp;#8220;valid IP address&amp;#8221; test&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Pretty cool &amp;#8211; I think that might work &amp;#8211; and will test this &lt;span class=&quot;caps&quot;&gt;ASAP&lt;/span&gt;, in addition to my own method of blocking those scumbags.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Further readings:&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;I discussed this with &lt;a href=&quot;http://incredibill.blogspot.com/2007/07/google-proxy-hijacking-myths-urban.html&quot;&gt;IncrediBill last week&lt;/a&gt; who has a great post up on identifying fake bots &amp;#8211; but his comment is also just&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;div class=&quot;bb-quote&quot;&gt;&lt;b&gt;Incredibill wrote:&lt;/b&gt;&lt;br /&gt;
&lt;blockquote class=&quot;bb-quote-body&quot;&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;PROXYDUST&lt;/span&gt; appears to just pass thru the user agent as-is, hard to say without seeing an actual hijacking if they do something special with Googlebot.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Anyway, they operate out of uk2net and the easiest way to make sure you&amp;#8217;ve got all their IPs is to just block the entire data center.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;inetnum: 83.170.96.0 &amp;#8211; 83.170.111.255&lt;/p&gt;
&lt;p&gt;netname: UK2-&lt;span class=&quot;caps&quot;&gt;NET&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;route: 83.170.96.0/20&lt;/p&gt;&lt;/blockquote&gt;
&lt;/div&gt;
&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;and then&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;div class=&quot;bb-quote&quot;&gt;&lt;b&gt;Incredibill wrote:&lt;/b&gt;&lt;br /&gt;
&lt;blockquote class=&quot;bb-quote-body&quot;&gt;
&lt;p&gt;Automating it is sometimes proxy and behavior specific, nothing I could tell you how to do in a quick post.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Some of them actually slip through the cracks for a while until they reveal themselves so it&amp;#8217;s not 100% bulletproof.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;The only way to get most of them is to simply block all hosting centers.&lt;/p&gt;&lt;/blockquote&gt;
&lt;/div&gt;
&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;I actually blocked a &lt;span class=&quot;caps&quot;&gt;TON&lt;/span&gt; of IP ranges,including those of a rogue bot called Twiceler in the last 2 weeks&amp;#8230; &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;but the &amp;#8220;noindex&amp;#8221; hack mentioned above is the next countermeasure&amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;b&gt;I &lt;span class=&quot;caps&quot;&gt;REALLY&lt;/span&gt; hope I can generalize this to protect &lt;span class=&quot;caps&quot;&gt;ALL&lt;/span&gt; my sites without having to change all of them&amp;#8230;&lt;/b&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;And then we got some more cool posts on &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;a href=&quot;http://hamletbatista.com/2007/07/16/you’ve-won-the-battle-but-not-the-war-10-ways-to-protect-your-site-from-negative-seo/  &quot;&gt;10 Ways to protect your site from negative SEO&amp;#8221;&lt;/a&gt; where hamlet refers to &amp;#8220;negative SEO&amp;#8221; for all kinds of actions a competitor could take against you &amp;#8230; frightening &amp;#8230;. and &lt;a href=&quot;http://hamletbatista.com/2007/07/03/the-never-ending-serps-hijacking-problem-is-there-a-definite-solution/&quot;&gt;Never Ending &lt;span class=&quot;caps&quot;&gt;SERP&lt;/span&gt; Hijacking&lt;/a&gt;  where he correctly states that the &lt;span class=&quot;caps&quot;&gt;REAL&lt;/span&gt; problem are those sites like proxydust that DO &lt;span class=&quot;caps&quot;&gt;NOT&lt;/span&gt; pretend to be Google&amp;#8230;.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;h2&gt;What about you?&lt;/h2&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Has &lt;span class=&quot;caps&quot;&gt;YOUR&lt;/span&gt; site been hijacked? Do you know? &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;How you could know? Just follow &lt;a href=&quot;http://www.jimboykin.com/google-supplemental-results/&quot;&gt;Jim&amp;#8217;s post&lt;/a&gt;  to find if a page is in supplemental &amp;#8230; but actually make sure you look at the results closely&amp;#8230; because what you might find is that somebody is stealing your content&amp;#8230;.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;You should do that for &lt;span class=&quot;caps&quot;&gt;EVERY&lt;/span&gt; &lt;span class=&quot;caps&quot;&gt;PAGE&lt;/span&gt; of your site &amp;#8211; best case &amp;#8211; if you &lt;a href=&quot;http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business&quot;&gt;got the right tools&lt;/a&gt; for it&amp;#8230;. but it costs a lot of resources either way &amp;#8211; by hand or by machine tool.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;b&gt;&lt;/p&gt;
&lt;p&gt;Let me know about &lt;span class=&quot;caps&quot;&gt;YOUR&lt;/span&gt; hijack experiences !&lt;/p&gt;
&lt;p&gt;&lt;/b&gt;&lt;/p&gt;
&lt;p&gt;(and I&amp;#8217;m sure people &lt;em&gt;should&lt;/em&gt; talk about this at the &lt;span class=&quot;caps&quot;&gt;SES&lt;/span&gt; in San Jose , however I fear they won&amp;#8217;t too much&amp;#8230;)&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Update: You could of course get around the initial problem of having too less trust in Google by &lt;a href=&quot;http://www.marketingfan.com/tools/seo-tools/common-forward-links-tool-super-authority-links&quot;&gt;getting real juicy authority links&lt;/a&gt; &lt;a href=&quot;http://www.marketingfan.com/a/search-engines/3-great-uses-for-the-msn-linkfromdomain-command.php&quot;&gt;using MSN&amp;#8217;s linkfromdomain command&lt;/a&gt; by effectively even letting your competitor &lt;a href=&quot;http://www.marketingfan.com/a/search-engines/research/indirect-linking-truncated-page-rank-and-getting-rid-of-link-buying-penalties.php&quot;&gt;link indirect to you&lt;/a&gt;  ... obviously you still want to make sure you get only the &lt;a href=&quot;http://www.marketingfan.com/search-engines/seo/link-building/strongest-subpages-suck-where-you-should-really-get-links&quot;&gt;juicy pages&lt;/a&gt; and not spend your time with dead meat.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;&lt;/p&gt;
 </description>
 <comments>http://www.marketingfan.com/search-engines/google-proxy-bowling#comments</comments>
 <category domain="http://www.marketingfan.com/a/search-engines/seo/penalties/index.php">penalties</category>
 <category domain="http://www.marketingfan.com/a/search-engines/index.php">search engines</category>
 <category domain="http://www.marketingfan.com/seo-glossary/google-bowling">google bowling</category>
 <category domain="http://www.marketingfan.com/seo-glossary/google-proxy-bowling">google proxy bowling</category>
 <category domain="http://www.marketingfan.com/seo-glossary/google-proxy-hacking">google proxy hacking</category>
 <category domain="http://www.marketingfan.com/seo-glossary/supplemental-index">supplemental index</category>
 <category domain="http://www.marketingfan.com/products-people-companies/dan-thies">dan thies</category>
 <category domain="http://www.marketingfan.com/products-people-companies/google">google</category>
 <category domain="http://www.marketingfan.com/products-people-companies/incredibill">incredibill</category>
 <category domain="http://www.marketingfan.com/products-people-companies/matt-twine">matt twine</category>
 <category domain="http://www.marketingfan.com/tags/bowling">bowling</category>
 <category domain="http://www.marketingfan.com/tags/google-search">google search</category>
 <category domain="http://www.marketingfan.com/tags/hacking">hacking</category>
 <category domain="http://www.marketingfan.com/tags/index-labels">index labels</category>
 <category domain="http://www.marketingfan.com/tags/proxies">proxies</category>
 <category domain="http://www.marketingfan.com/tags/proxy-sites">proxy sites</category>
 <category domain="http://www.marketingfan.com/tags/www-google">www google</category>
 <pubDate>Thu, 16 Aug 2007 20:00:22 +0200</pubDate>
 <dc:creator>Marketing Fan</dc:creator>
 <guid isPermaLink="false">1273 at http://www.marketingfan.com</guid>
</item>
<item>
 <title>Why Removed Supplemental Index Labels are good for my business</title>
 <link>http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business</link>
 <description> &lt;p&gt;
&lt;p&gt;Earlier this week Google has removed the &amp;#8220;supplemental index&amp;#8221; labels from the SERPs, and as with every major poops from Google the whole &lt;span class=&quot;caps&quot;&gt;SEO&lt;/span&gt; scene freaked out on this! &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Me too &amp;#8211; because I have to thank Google for giving me &amp;#8211; and my clients a new competitive advantage.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;b&gt;Why do I thank Google for removing interesting signals?&lt;/b&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Well, until last week every wannabeo-seo and his mother could see (in the SERPs, see grandfathered sample below)&lt;/p&gt;
&lt;p&gt;if a page had a problem with ranking&amp;#8230; &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;img src=http://farm2.static.flickr.com/1282/968031873_7e98839066_o.jpg&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;There have been huge posts by &lt;a href=&quot;http://www.jimboykin.com/damned-to-google-hell-supplemental-results/&quot;&gt;Jim&lt;/a&gt; , &lt;a href=&quot;http://www.seo4fun.com/notes/supplementals.html&quot;&gt;Halfdeck&lt;/a&gt; and &lt;a href=&quot;http://www.seo4fun.com/blog/2007/02/19/why-duplicate-content-causes-supplimental-results.html&quot;&gt;Halfdeck again&lt;/a&gt; and a lot more on what/why/where supplementals are. Even I posted about &lt;a href=&quot;http://www.marketingfan.com/a/45-of-zero-pages-listed-welcome-to-supplemental-hell.php&quot;&gt;Supplemental Hell&lt;/a&gt; and one &lt;a href=&quot;http://www.marketingfan.com/a/buy-blog-posts-get-supplemental.php&quot;&gt;PayPerPost link buying penalty&lt;/a&gt; bringing pages into the to supplemental index.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;b&gt;Today however&amp;#8230;&lt;/b&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;People need to put more effort into detecting if a page has problem due to being in the supplemental index.&lt;/p&gt;
&lt;p&gt;That means a certain (large) amount of SEOs just won&amp;#8217;t be able to do this in their everyday job.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Halfdeck has his own &lt;a href=&quot;http://www.seo4fun.com/php/pagerankbot.php&quot;&gt;Supplemenal Detector&lt;/a&gt; which is a fancy &lt;span class=&quot;caps&quot;&gt;JAVA&lt;/span&gt; application that is in fact a &amp;#8220;pagerank emulator&amp;#8221; &amp;#8211; &lt;s&gt;and all pages below a certain threshold are marked as supplemental.  I encourage you to download this data scraper, and I&amp;#8217;m sure it works nicely &amp;#8211; but haven&amp;#8217;t tried it. &lt;/s&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;After playing with Halfdeck&amp;#8217;s Pagerank emulator I must say that it&amp;#8217;s a great way to simulate how the &amp;#8220;link juice&amp;#8221; flows thru your site and where you are actually wasting precious link juice (i.e. on useless stats pages). Halfdeck even implemented a &amp;#8220;backlink emulator&amp;#8221; where you can judge on the effects of an additional PRx link to any page you like&amp;#8230; pretty cool tool &amp;#8211; it just lacks &lt;span class=&quot;caps&quot;&gt;TBPR&lt;/span&gt; live queries, but I hope he can add that in the next version.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;But, in fact I never cared much about the &lt;span class=&quot;caps&quot;&gt;TOTAL&lt;/span&gt; number of supplementals, but always if &lt;strong&gt;a single&lt;/strong&gt; page is in supplemental. Why that? &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Well, I guess Link Ninja Master Jim Boykin knows why &amp;#8211; it&amp;#8217;s because you don&amp;#8217;t want to get links on pages in the supplemental index because they won&amp;#8217;t get crawled as often.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Jim&amp;#8217;s recent explanation on finding &lt;a href=&quot;http://www.jimboykin.com/google-supplemental-results/&quot;&gt;if a page is in supplemental&lt;/a&gt; pretty well details how to detect if a page is &amp;#8220;healthy&amp;#8221; at all &amp;#8211; i.e. ranks for obscure terms.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;If a page does not even rank for an obscure terms on it, you don&amp;#8217;t need a link there.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;So you ask again, &lt;b&gt;why is this cool for your business?&lt;/b&gt;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Because the way to check if a page is worthy to spend time to get a link on it has just become a bit harder. You will need more work, time, effort, unless &lt;b&gt;you automate it&lt;/b&gt;. Just as we do here.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;And this is the perfect situation to use an &amp;#8220;internal tool&amp;#8221; (as many SEOs have) as a competitie advantage to get more and better links in a shorter time&amp;#8230; heck &amp;#8211; some link builders might spend another couple clicks on each page to find out if it qualifies for hunting for link. &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;We Don&amp;#8217;t :-)&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;In fact we already see the &amp;#8220;Supplemental Index&amp;#8221; on top of our brower toolbar bar when we visit a page.&lt;/p&gt;
&lt;p&gt;In fact we already see the &amp;#8220;Supplemental Index&amp;#8221; label as it used to be printed for &lt;span class=&quot;caps&quot;&gt;ALL&lt;/span&gt; google users in the past, nicely embedded in the SERPs.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;And in fact Google has bought itself now 10-20 more Google queries per &lt;span class=&quot;caps&quot;&gt;SERP&lt;/span&gt; page my Link Arbeiter team screens when looking for links &amp;#8211; plus we are inflating the pageloads of all those sites we screen by one&amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Do you think that hurts Google? Nah &amp;#8211; enough resources.&lt;/p&gt;
&lt;p&gt;Do you think it hurts me? Nah &amp;#8211; just got a bunch more proxy IPs to make up for the bigger Google scraping load. &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Do you think it will hurt the &lt;span class=&quot;caps&quot;&gt;SEO&lt;/span&gt; scene building links? Well &amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;I would assume a huge bunch of people won&amp;#8217;t even notice a difference &amp;#8211; after all even SEOmoz de-classified &lt;a href=&quot;http://www.seomoz.org/blog/answer-these-ten-questions-before-you-charge-for-seo-services&quot; title=&quot;even large scale&quot;&gt;70% of&lt;/a&gt; &lt;span class=&quot;caps&quot;&gt;SEO&lt;/span&gt; companies for not knowing the &lt;span class=&quot;caps&quot;&gt;SEO&lt;/span&gt; basic&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Furthermore a couple of smaller equipped companies will struggle as they will need to do more a lot more work (as per Jim&amp;#8217;s description) to get the same results&amp;#8230;. and I mean &amp;#8211; A &lt;span class=&quot;caps&quot;&gt;LOT&lt;/span&gt; &lt;span class=&quot;caps&quot;&gt;MORE&lt;/span&gt;.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Now this moves the benefits to larger scale companies (as Google is) that DO have an intact &lt;span class=&quot;caps&quot;&gt;SEO&lt;/span&gt; infrastructure for their daily &lt;span class=&quot;caps&quot;&gt;SEO&lt;/span&gt; work.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;But the small scale link builders will first have to build that infrastructure, browser plugins and knowledge to make visible what Google has just taken from the public.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;h1&gt;&lt;b&gt;Thanks Google !&lt;/b&gt;&lt;/h1&gt;
&lt;/p&gt;
 </description>
 <comments>http://www.marketingfan.com/search-engines/why-removed-supplemental-index-labels-are-good-my-business#comments</comments>
 <category domain="http://www.marketingfan.com/a/search-engines/seo/link-building/index.php">link building</category>
 <category domain="http://www.marketingfan.com/a/search-engines/index.php">search engines</category>
 <category domain="http://www.marketingfan.com/a/tools/index.php">tools</category>
 <category domain="http://www.marketingfan.com/seo-glossary/supplemental-index">supplemental index</category>
 <category domain="http://www.marketingfan.com/products-people-companies/google">google</category>
 <category domain="http://www.marketingfan.com/products-people-companies/payperpost">payperpost</category>
 <category domain="http://www.marketingfan.com/tags/hell">hell</category>
 <category domain="http://www.marketingfan.com/tags/php">php</category>
 <pubDate>Sat, 04 Aug 2007 12:50:18 +0200</pubDate>
 <dc:creator>Marketing Fan</dc:creator>
 <guid isPermaLink="false">1270 at http://www.marketingfan.com</guid>
</item>
<item>
 <title>Google Infrastructure update - pagerank, supplementals, indexing</title>
 <link>http://www.marketingfan.com/a/google-infrastructure-update-pagerank-supplementals-indexing.php</link>
 <description> &lt;p&gt;
&lt;p&gt;Today I found that Matt Cutts, mr. GoogleGuy himself had two really super posts on his blog&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;a href=&quot;http://www.mattcutts.com/blog/infrastructure-status-january-2007/&quot;&gt;First Pagerank update 2007&lt;/a&gt; &amp;#8211; for those obsessed, yes, they are updating it once again&amp;#8230; some new values here and there&amp;#8230; who cares at all?&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;All remaining words on the green bar are here at &lt;a href=&quot;http://www.jimboykin.com/pagerank-4/&quot;&gt;Jim&amp;#8217;s blog&lt;/a&gt; who is bitching again about people caring too much (or anything at all) about page rank (but he&amp;#8217;s also got a nice post today with his internal &lt;a href=&quot;http://www.jimboykin.com/putting-a-price-on-a-link-jims-value-indicators/&quot;&gt;link valuation tool&lt;/a&gt; )&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Matt Cutts also gives another official explanation on what supplemental pages are, how they work and that they are improving their crawl frequency&amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;so supplementals are not older than 4 &lt;span class=&quot;caps&quot;&gt;MONTHS&lt;/span&gt; now anymore&amp;#8230;  :-)&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Another funny thing is Matt &lt;a href=&quot;http://www.mattcutts.com/blog/why-isnt-email-authenticated/&quot;&gt;bitching about email authentication&lt;/a&gt; including pointing to concepts like domainkeys that I already &lt;a href=&quot;http://weblog.cemper.com/a/200611/29-domainkeys-experimental-implementation-worth-the-hassle.php&quot;&gt;bitched about&lt;/a&gt; some weeks before &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Looks like Mr. Cutts was hit by the same amount of &lt;a href=&quot;http://weblog.cemper.com/a/200701/10-how-to-get-rid-of-the-re-my-somecrap-spam.php&quot;&gt;RE: my crap spam&lt;/a&gt; that was hitting my own (G)mail accounts &amp;#8211; undetected :-)&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;&lt;span class=&quot;caps&quot;&gt;LOL&lt;/span&gt;&lt;/p&gt;
&lt;/p&gt;
 </description>
 <comments>http://www.marketingfan.com/a/google-infrastructure-update-pagerank-supplementals-indexing.php#comments</comments>
 <category domain="http://www.marketingfan.com/seo-glossary/pagerank-update">pagerank update</category>
 <category domain="http://www.marketingfan.com/seo-glossary/supplemental-index">supplemental index</category>
 <pubDate>Thu, 11 Jan 2007 17:50:57 +0100</pubDate>
 <dc:creator>Marketing Fan</dc:creator>
 <guid isPermaLink="false">1220 at http://www.marketingfan.com</guid>
</item>
<item>
 <title>Death of the SEO Copywriters - Spam Detection with Phrase Based Information Retrieval</title>
 <link>http://www.marketingfan.com/a/search-engines/research/death-of-the-seo-copywriters-spam-detection-with-phrase-based-information-retrieval.php</link>
 <description> &lt;p&gt;
&lt;p&gt;Bill Slawski of SEObytheSea has a &lt;a href=&quot;http://www.seobythesea.com/?p=413&quot;&gt;great post up&lt;/a&gt; explaining a concept of how search engines (Google, man!) do a phrase based analysis &amp;#8211; of your content to assign quality measures to it and possibly put it into the wastebasket or at least supplemental index.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;The idea is that quality documents have a different co-occurrence of certain phrases (&amp;#8220;money-words&amp;#8221;) than spammy or low quality articles you bought for two dollars each from that low-quality writer in India recently who wasn&amp;#8217;t even aware of how to use Word properly, not to speak about creating quality content&amp;#8230; &lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Certainly a &amp;#8220;SEOed&amp;#8221; article around a phrase, let&amp;#8217;s say &amp;#8220;President of the united states&amp;#8221; would use that term in all variations, word order and such.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;A quality article really talking about the President of the united states would probably mention other &amp;#8220;unimportant&amp;#8221; things like names of past presidents, non-important things like amorous adventures, hollywood careers or other generally bad habits of those big guys that nobody would place an Adwords bid on for example.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;The search engines just create a &lt;b&gt;co-occurance matrix&lt;/b&gt; for all phrases in the document and match those statistics against other quality documents.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;div class=&quot;bb-quote&quot;&gt;&lt;b&gt;patent,bill wrote:&lt;/b&gt;&lt;br /&gt;
&lt;blockquote class=&quot;bb-quote-body&quot;&gt;
&lt;p&gt;From the foregoing, the number of the related phrases present in a given document will be known. A normal, non-spam document will generally have a relatively limited number of related phrases, typically on the order of between 8 and 20, depending on the document collection. By contrast, a &lt;b&gt;spam document&lt;/b&gt; will have an excessive number of related phrases, for example on the order of between &lt;b&gt;100 and 1000 related phrases&lt;/b&gt;. Thus, the present invention takes advantage of this discovery by identifying as spam documents those documents that have a statistically significant deviation in the number of related phrases relative to an expected number of related phrases for documents in the document collection.&lt;/p&gt;&lt;/blockquote&gt;
&lt;/div&gt;
&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;So short &amp;#8211; that patent and the wonderful clear exlpanation by Bill outlines pretty well, that Google &amp;amp; co DO have the means and technology to judge on content quality &amp;#8230;&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;and that is the death for all &lt;span class=&quot;caps&quot;&gt;SEO&lt;/span&gt; &amp;#8220;copywriters&amp;#8221; just focussing on keyword density, repetition and keyword stuffing.&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;What does that mean for you if you &lt;span class=&quot;caps&quot;&gt;HIRE&lt;/span&gt; a writer for creating content?&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;DO &lt;span class=&quot;caps&quot;&gt;NOT&lt;/span&gt; overdo your specifications concerning keyword phrases to use!&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;Especially in the last months I have seen content rank &lt;span class=&quot;caps&quot;&gt;GREAT&lt;/span&gt; on Google (if on the right domains) for &lt;b&gt;related phrases&lt;/b&gt; versus phrases that were really used in the content&amp;#8230; you don&amp;#8217;t need to have an exact mention of a keyword phrase for it to be found on Google anymore!&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;
&lt;p&gt;NO, it could even harm you nowadays &amp;#8211; that&amp;#8217;s the next phase of overoptimization penalties &amp;#8211; create good, natural content and RANK!&lt;/p&gt;
&lt;/p&gt;
&lt;p&gt;&lt;/p&gt;
 </description>
 <comments>http://www.marketingfan.com/a/search-engines/research/death-of-the-seo-copywriters-spam-detection-with-phrase-based-information-retrieval.php#comments</comments>
 <category domain="http://www.marketingfan.com/a/search-engines/research/index.php">research</category>
 <category domain="http://www.marketingfan.com/seo-glossary/keyword-stuffing">keyword stuffing</category>
 <category domain="http://www.marketingfan.com/seo-glossary/supplemental-index">supplemental index</category>
 <category domain="http://www.marketingfan.com/products-people-companies/bill-slawski">bill slawski</category>
 <pubDate>Fri, 29 Dec 2006 14:03:12 +0100</pubDate>
 <dc:creator>Marketing Fan</dc:creator>
 <guid isPermaLink="false">1217 at http://www.marketingfan.com</guid>
</item>
</channel>
</rss>
