Santry Technology Solutions, Content Management, DotNetNuke, SharePoint Consulting
Register | Login
Tuesday, February 09, 2010

Sections
  
About Us
  
Partners
Downloads
  
 Developer News

Cracking the Google Code… Under the GoogleScope
5/8/2005 9:51:18 PM

Google's US Patent confirms information retrieval is based on historical data.

Author Name: Lawrence Deon

An Introduction: …if you thought you cracked the Google Code and had Google all figured out … guess again.

Google’s sweeping changes confirm the search giant has launched a full out assault against artificial link inflation & declared war against search engine spam in a continuing effort to provide the best search service in the world… and if you thought you cracked the Google Code and had Google all figured out … guess again.

Google has raised the bar against search engine spam and artificial link inflation to unrivaled heights with the filing of a United States Patent Application 20050071741 on March 31, 2005.

The filing unquestionable provides SEO’s with valuable insight into Google’s tightly guarded search intelligence and confirms that Google’s information retrieval is based on historical data.

What exactly do these changes mean to you?
Your credibility and reputation on-line are going under the Googlescope! Google has defined their patent abstract as follows:

”A system identifies a document and obtains one or more types of history data associated with the document. The system may generate a score for the document based, at least in part, on the one or more types of history data.”

Google’s patent specification reveals a significant amount of information both old and new about the possible ways Google can (and likely does) use your web page updates to determine the ranking of your site in the SERPs.

Unfortunately, the patent filing does not prioritize or conclusively confirm any specific method one way or the other.

Here’s how Google scores your web pages.

In addition to evaluating and scoring web page content, the ranking of web pages are admittedly still influenced by the frequency of page or site updates.
What’s new and interesting is what Google takes into account in determining the freshness of a web page.

For example, if a stale page continues to procure incoming links, it will still be considered fresh, even if the page header (Last-Modified: tells when the file was most recently modified) hasn’t changed and the content is not updated or ‘stale’.

According to their patent filing Google records and scores the following web page changes to determine freshness.
• The frequency of all web page changes
• The actual amount of the change itself… whether it is a substantial change redundant or superfluous
• Changes in keyword distribution or density
• The actual number of new web pages that link to a web page
• The change or update of anchor text (the text that is used to link to a web page)
• The numbers of new links to low trust web sites (for example, a domain may be considered low trust for having too many affiliate links on one web page).
Although there is no specific number of links indicated in the patent it might be advisable to limit affiliate links on new web pages. Caution should also be used in linking to pages with multiple affiliate links.

Developing your web page augments for page freshness.

Now I’m not suggesting that it’s always beneficial or advisable to change the content of your web pages regularly, but it is very important to keep your pages fresh regularly and that may not necessarily mean a content change.

Google states that decayed or stale results might be desirable for information that doesn't necessarily need updating, while fresh content is good for results that require it.

How do you unravel that statement and differentiate between the two types of content?

An excellent example of this methodology is the roller coaster ride seasonal results might experience in Google’s SERPs based on the actual season of the year.

A page related to winter clothing may rank higher in the winter than the summer... and the geographical area the end user is searching from will now likely be considered and factored into the search results.

Likewise, specific vacation destinations might rank higher in the SERPs in certain geographic regions during specific seasons of the year. Google can monitor and score pages by recording click through rate changes by season.

Google is no stranger to fighting Spam and is taking serious new measures to crack down on offenders like never before.

Section 0128 of Google’s patent filing claims that you shouldn't change the focus of multiple pages at once.

Here’s a quote from their rationale:

"A significant change over time in the set of topics associated with a document may indicate that the document has changed owners and previous document indicators, such as score, anchor text, etc., are no longer reliable.

Similarly, a spike in the number of topics could indicate spam. For example, if a particular document is associated with a set of one or more topics over what may be considered a 'stable' period of time and then a (sudden) spike occurs in the number of topics associated with the document, this may be an indication that the document has been taken over as a 'doorway' document.

Another indication may include the sudden disappearance of the original topics associated with the document. If one or more of these situations are detected, then [Google] may reduce the relative score of such documents and/or the links, anchor text, or other data associated the document."

Unfortunately, this means that Google’s sandbox phenomenon and/or the aging delay may apply to your web site if you change too many of your web pages at once.

From the case studies I’ve conducted it’s more likely the rule and not the exception.

What does all this mean to you?

Keep your pages themed, relevant and most importantly consistent. You have to establish reliability! The days of spamming Google are drawing to an end.

If you require multi page content changes implement the changes in segments over time. Continue to use your original keywords on each page you change to maintain theme consistency.

You can easily make significant content changes by implementing lateral keywords to support and reinforce your vertical keyword(s) and phrases. This will also help eliminate keyword stuffing.

Make sure you determine if the keywords you’re using require static or fresh search results and update your web site content accordingly. On this point RSS feeds may play a more valuable and strategic role than ever before in keeping pages fresh and at the top of the SERPs.

The bottom line here is webmasters must look ahead, plan and mange their domains more tightly than ever before or risk plummeting in the SERPs.

Does Google use your domain name to determine the ranking of your site?

Google’s patent references specific types of ‘information relating to how a document is hosted within a computer network’ that can directly influence the ranking of a specific web site.
This is Google's way of determining the legitimacy of your domain name.

Therefore, the credibility of your host has never been more important to ranking well in Google’s SERP’s.

Google states they may check the information of a name server in multiple ways.

Bad name servers might host known spam sites, adult and/or doorway domains. If you’re hosted on a known bad name server your rankings will undoubtedly suffer… if you’re not blacklisted entirely.

What I found particularly interesting is the criteria that Google may consider in determining the value of a domain or identifying it as a spam domain; According to their patent, Google may now record the following information:
• The length of the domain registration… is it greater than one year or less than one year?
• The address of the web site owner. Possibly for returning higher relevancy local search results and attaching accountability to the domain.
• The admin and the technical contact info. This info is often changed several times or completely falsified on spam domains; again this check is for consistency!
• The stability of your host and their IP range… is your IP range associated with spam?
• Securing a reputable host can and will go a long way in promoting your web site to Google.
Google’s rationale for domain registration is based on the premise that valuable domains are often secured many years in advance while domains used for spam are rarely secured for more than a year.

If in doubt about a host’s integrity I recommend checking their mail server at http://www.dnsstuff.com/ to see if they’re in the spam database. Watch for red flags!

If your mail server is listed you may have a problem ranking well in Google!
The simplest strategy may be registering your domain several years in advance with a reputable provider thereby demonstrating longevity and accountability to Google. Google wants to see that you’re serious about your site and not a flash in the pan spam shop.

http://www.tkqlhce.com/click-1604302-10294265

Google's Aging Delay has teeth… and they’re taking a bite out of spam!

It’s no big secret that Google relies heavily on links when it comes to ranking web sites.

According to their patent filing, Google may record the discovery date of a link and link changes over time.

In addition to volume, quality & the anchor text of links, Google’s patent illustrates possible ways how Google might use historical information to further determine the value of links.

For example, the life span of a link and the speed at which a new web site gets links.

“Burst link growth MAY be a strong indicator of search engine spam”.

This is the first concrete evidence that Google may penalize sites for rapid link acquisition. Whether the “burst growth” rule applies to high trust/authorative sites and directory listings remains unknown. I personally haven’t experienced this phenomenon. What’s clear for certain though is the inevitable end to results orientated link farming.

I would point out here that regardless of whether burst link growth will be tolerated for authorative sites or authorative link acquisition, webmasters will have to get smarter and work harder to secure authorative links as their counterparts become reluctant to exchange links with low trust sites. Now Page Rank really has value!

Relevant content swaps may be a nice alternative to the standard link exchange and allow you some control of the link page elements.

So what else does Google consider in determining the aging delay?
• The anchor text and the discovery date of links are recorded, thus establishing the countdown period of the aging delay.
• Links with a long-term life span may be more valuable than links with a short life span.
• The appearance and disappearance of a links over time.
• Growth rates of links as well as the link growth of independent peer pages.
• Again, this suggests that rapid link acquisition and the quality of peer pages are monitored
• Anchor text over a given period of time for keyword consistency.
• Inbound links from fresh pages… might be considered more important than links from stale pages.
• Google doesn't expect that new web sites have a large number of links so purchasing large numbers of brokered links will likely hurt you more than help you.

Google indicates that it is better for link growth to remain constant and naturally paced. In addition, the anchor text should be varied as much as possible.
• New web sites should not acquire too many new links; it’ll be tolerated if the links are from trusted sites but it may be considered spam.

So how do you build your link popularity / Page Rank and avoid penalties?

When it comes to linking, you should clearly avoid the hocus pocus or magic bullet linking schemes. If you participate in quick fix link exchange scams, use automated link exchange software or buy hundreds of links at once, chances are Google will interpret your efforts as a spam attempt and act accordingly.

Don’t get caught in this trap… the recovery period could be substantial since your host and IP range are also considered!

When you exchange links with other web sites, do it slowly and consistently.

Develop a link management and maintenance program. Schedule regular times every week to build the links to your site and vary the anchor text that points to your site.

Obviously, the links to your site should utilize your keywords. To avoid repetition use lateral keywords and keyword phrases in the anchor text since Google wants to see varied anchor text!

Your sites click through rate may now monitored through bookmarks, cache, favorites, and temporary files.

It’s no big secret that Google has always been suspected of rewarding sites with higher click through rates (very similar to what Google does with their AdWords program) so it shouldn’t come as a great surprise that Google still considers site stickiness and CTR tracking in their criterion.

What’s interesting though is Google is interested in tracking the behavior of web surfers through bookmarks, cache, favorites, and temporary files (most likely with the Google toolbar and/or the Google desktop search tool). Google’s Patent filing indicates Google might track the following information:
• Click through rates are monitored for changes in seasonality, fast increases, or other spike traffic in addition to increase or decrease trends.
• The volume of searches over time is recorded and monitored for
increases.
• The information regarding a web page's rankings are recorded and monitored for changes.
• Click through rates are monitored to find out if stale or fresh web pages are preferred for a search query.
• The traffic to a web page is recorded and monitored for changes… like Alexa.
• User behavior may be monitored through bookmarks, cache, favorites, and temporary files.
• Bookmarks and favorites could be monitored for both additions and deletions, and;
• The overall user behavior for trends and changes.
Since Google is capable of tracking the click-through rates to your web site, you should make sure that your web pages have attractive titles and utilize calls to action so that web surfers click on them in the search results.

It’s also important to keep your visitors there so make your web pages interesting enough so that web surfers stay some time on your web site. It might also help if your web site visitors added your web site to their bookmarks.

As you can see, Google’s new ranking criterion has evolved far beyond the reliance of criteria that can be readily or easily manipulated. One thing is for certain with Google, whatever direction search innovation is going; you can trust Google to be pioneering the way and setting new standards!

Copyright 2005 Lawrence Deon
http://www.rankingyourwaytothebank.com


Page Options:
format for printing  Format for Printer
email article  Email Page
add to your favorites   Add to Favorites
How would you rate the quality of this content?
Poor - - Excellent
Comments?
Overall Rating:
Comments Left:
Left on 11/16/2007 3:37:24 PM by Anonymous
Comments: go to <a href="http://mortgageloanshome.wordpress.com">mortgage loan lead</a>
No ratings available.
Left on 11/7/2007 8:36:54 PM by Anonymous
Comments: life settlement <a href="http://viaticals.golifesettlement.com">viatical life settlement broker </a>
No ratings available.
Left on 11/6/2007 10:21:47 PM by Anonymous
Comments: Live well after retirement, see how here...<a href="http://life-insurance-settlement.golifesettlement.com">life insurance settlement</a> <a href="http://www.golifesettlement.com">life settlement</a> <a href="http://www.golifesettlement.com">life settlement</a> <a href="http://life-settlements.golifesettlement.com">life settlements</a> <a href="http://viaticals.golifesettlement.com">viaticals</a> <a href="http://life-insurance-settlements.golifesettlement.com">life insurance settlements</a> <a href="http://senior-life-settlement.golifesettlement.com">senior life settlement - settlements</a> <a href="http://viatical-settlement.golifesettlement.com">viatical settlement - settlements</a> <a href="http://selling-a-life-insurance-policy.golifesettlement.com">selling a life insurance policy - sell</a> <a href="http://viatical.golifesettlement.com">viatical</a> <a href="http://senior-settlements.golifesettlement.com">senior settlements - settlement</a> <a href="http://life-settlements-alliance.golifesettlement.com">life settlements alliance - settlement</a> <a href="http://viatical-life-settlement .golifesettlement.com">viatical life settlement</a> <a href="http://viatical-assistance.golifesettlement.com">viatical assistance</a> <a href="http://life-insurance-viatical.golifesettlement.com">life insurance viatical</a> <a href="http://viatical-life-settlement-provider.golifesettlement.com">viatical life settlement provider - providers</a> <a href="http://viatical-life-settlement-brokers.golifesettlement.com">viatical life settlement brokers</a> <a href="http://viatical-life-settlement-broker.golifesettlement.com">viatical life settlement broker</a>
No ratings available.
Left on 10/10/2007 3:38:20 AM by Anonymous
Comments: games <a href="http://qwertyblogs.wordpress.com">online casino gambling site</a>
No ratings available.
Left on 10/6/2007 6:12:47 AM by Anonymous
Comments: . <a href="http://www.zbestcasinos.com/">slots online</a>
No ratings available.
Left on 10/5/2007 7:30:23 PM by Anonymous
Comments: games <a href="http://gamesstock.0catch.com">vegas style casinos online</a>
No ratings available.
Left on 10/4/2007 1:03:53 AM by Anonymous
Comments: canadian games<a href="http://trentonlint.wordpress.com/2007/08/02/trenton-loan/"></a> <a href="http://casinosincanada.741.com"></a> <a href="http://casinoscan.00freehost.com"></a> <a href="http://canadianblackjack.digitalzones.com"></a> <a href="http://blackjackroomscanada.createdollz.com"></a> <a href="http://vegasinfos.wordpress.com/"></a> <a href="http://www.aeonity.com/vegasinfo"></a> <a href="http://vegasnightlife.bloggles.info/2007/07/30/vegas-nightlife/"></a> <a href="http://vegasinsider.angelcities.com/index.htm"></a> <a href="http://lasvegasinformation.2pt.net/2007/07/30/las-vegas-information/"></a> <a href="http://kirktint.livejournal.com/"></a> <a href="http://yuriyuri.angelcities.com"></a> <a href="http://gregore.9cy.com"></a> <a href="http://xjingo.wordpress.com/2007/08/02/the-boat/"></a> <a href="http://www.aeonity.com/vesamesa"></a> <a href="http://trumped.galaxy99.net"></a> <a href="http://ruizso.wordpress.com/2007/08/02/ruiz-runners-on/"></a> <a href="http://www.fastpages.com/jimbojones/"></a> <a href="http://www.usapages.com/heavensrow/"></a> <a href="http://thedottedline.wordpress.com/2007/08/02/3/"></a> <a href="http://cruciouslucious.angelcities.com"></a> <a href="http://hurryup.galaxy99.net"></a> <a href="http://toughchoice.bloggles.info/2007/08/02/tough-choices/"></a> <a href="http://dhdhjc.0catch.com"></a>
No ratings available.
Left on 10/3/2007 9:18:38 AM by Anonymous
Comments: canadian games <a href="http://yuriyuri.angelcities.com">casinos british columbia canada</a>
No ratings available.
Left on 5/13/2007 12:37:40 PM by Anonymous
Comments: Loved it, booked it, blogged it!
Left on 3/31/2006 9:20:31 AM by Anonymous
Comments: Absolutely the best article I have read on Google's algo. As a new webmaster my own independent research is verified. Mannny, many thanks for for so much info in one place... chelsea
Left on 3/9/2006 12:44:53 AM by Anonymous
Comments: This would be a nice article if first you understood the fact that if Google patents an application or series of applications does not mean that they are in fact used in their search engine or that they are the rosetta stone to understanding Googles algo.

To the best of my knowldge and that of respected SEOs  NONE of these items that have been discussed are in use by Google and some of them probably never will be.
Left on 3/5/2006 12:11:22 PM by Anonymous
Comments: GOOGLE SUX ! THAT'S WHAT I THINK.
No ratings available.
Left on 7/5/2005 11:51:32 AM by Anonymous
Comments: Great information, I am a new webmaster and much of this I am still learning.  However, this has been expectionally informative.
Left on 7/3/2005 11:50:51 PM by Anonymous
Comments: Why don't you just lay it on the line - phakers are phucked, as they well deserve to be. Haven't any of these morons stymied by Google ever heard of the expression, "Honesty is the best policy." Hell, don't blamce Google if they see through the flim-flam some so-called SEO expert coded for you. Get a good book on web design and ask yourself WWJD as you upload your creation.
Left on 7/1/2005 10:39:52 AM by Anonymous
Comments: Im glad I found this! thanks for the info!
No ratings available.
Left on 5/29/2005 5:25:37 PM by Anonymous
Comments: Excellent article.
Page rank toolbars have stopped working today, perhaps they are going about implementing some of these changes?

Chris
http://www.philosophyarchive.com
Left on 5/24/2005 12:59:17 AM by Anonymous
Comments: Excellent, and of course bookmarked.
Left on 5/20/2005 7:32:09 AM by Anonymous
Comments: Hands down the best article I've read on Google.
Left on 5/19/2005 7:59:50 PM by Anonymous
Comments: Top notch explanation of the dampening filter. Great article.
Left on 5/19/2005 4:07:24 PM by Anonymous
Comments: When good sites get buried in google and spam with adsense goes to top maybe this is what google wants for $$$ mabe this part of google not broken just crooked.
Left on 5/19/2005 3:52:06 PM by Anonymous
Comments: Google is broken. Has been broken since February.  Google programmers aren't as great as everyone thinks they are - they are making lots of stupid errors and unintended goofs - and they don't seem to be able to fix it.
Left on 5/18/2005 1:51:57 AM by Anonymous
Comments: To the point !
One of the best articles I've read on this subject. Keep up the good work
Left on 5/16/2005 6:49:36 AM by Anonymous
Comments: Excellent article. keep up the good work.

http://www.bax-shop.nl
Left on 5/15/2005 10:32:10 PM by Anonymous
Comments: This is the most informative article I've read in a long time! Good jod.
Left on 5/15/2005 10:30:03 AM by Anonymous
Comments: Excellent article! Did you read this part of the article: "The filing unquestionable provides SEO’s with valuable insight into Google’s tightly guarded search intelligence... Unfortunately, the patent filing does not prioritize or conclusively confirm any specific method one way or the other."
Left on 5/15/2005 8:59:11 AM by Anonymous
Comments: The idea that just because a patent application has been filed you now have proof that Google is now doing all these wonderful and contradictory things. Google has file many search technology patents but, few of them ever see the light of day in actual rankings.
Left on 5/14/2005 10:41:56 AM by Anonymous
Comments: First class resource, and well explained.
Left on 5/14/2005 7:14:17 AM by Anonymous
Comments: Well explained easy to understand
Left on 5/13/2005 5:00:19 PM by Anonymous
Comments: Thanks for an excellent article!
Left on 5/12/2005 5:04:14 PM by Anonymous
Comments: again the question still remains, how does any of this stop a competitor from complete and utter sabatage?  Couldn't they simply order the link farm package, point it at my site, and walk away laughing?

No ratings available.
Left on 5/12/2005 2:25:12 PM by Anonymous
Comments: Wel nothing really new here. Seems to me Google is trying to improve their search result in ways that allow real content to surface. Fine with me. ;-P
Left on 5/12/2005 3:39:29 AM by Anonymous
Comments: Insightful
No ratings available.
Left on 5/12/2005 12:04:47 AM by Anonymous
Comments: Excellent! I look forward to your future updates...
Left on 5/11/2005 11:16:15 PM by Anonymous
Comments: That is an excellent article.  Thank you for writing it.
Left on 5/11/2005 9:31:26 PM by Anonymous
Comments: best article that i have read in a long time... thanks!
No ratings available.
Left on 5/11/2005 7:15:55 PM by Anonymous
Comments: Awesome! Why would the Big G bother with any of this if it wasn't to eliminate link spam?
Left on 5/11/2005 2:57:04 PM by Anonymous
Comments: Comments from the following blog: Patrick Santry's (aka wwwCoder) Blog, located at: http://blogs.wwwcoder.com/psantry/archive/0001/01/01/4123.aspx
No ratings available.
Left on 5/11/2005 11:51:38 AM by Anonymous
Comments: nice article on this matter at
http://www.wwwcoder.com/main/parentid/285/site/4748/266/default.aspx as you said. The authors slant terms help me to decide to disregard his opinions as just that. Just the last line alone dismisses his thought as spamvertising.
Left on 5/11/2005 11:32:57 AM by Anonymous
Comments: Hmm, how easy will it be to sabatouge a competitor by making him look like a spammer?  Hmm.....
No ratings available.
Left on 5/11/2005 7:43:52 AM by Anonymous
Comments: Another article on the Google patent: http://www.wwwcoder.com/main/parentid/285/site/4748/266/default.aspx
No ratings available.
Left on 5/10/2005 11:59:46 PM by Anonymous
Comments: One of the most informative articles I've read in some time.
Left on 5/10/2005 11:07:50 PM by Anonymous
Comments: Nice work. It's always good to get SEO info form those who actually have an understanding of it. 5 Kudos points from me.
No ratings available.
Left on 5/10/2005 10:24:17 PM by Anonymous
Comments: i think author meant to say first published march 31 2005. very informative article.
Left on 5/10/2005 9:58:27 PM by Anonymous
Comments: Very helpful. Basically real sites with real content should continue doing what they are now doing.

Left on 5/10/2005 9:47:59 PM by Anonymous
Comments: This is a fantastic article! On the contrary, I think the author understands patents extremely well by indicating "may" or "might". I seriously doubt Google would file a patent on technology and not employ it? I have also seen SEOs report mixed patent filing dates.
Left on 5/10/2005 8:07:14 PM by Anonymous
Comments: The author doesn't understand patents.  The fact that Google has filed a patent on an idea is in no way to be taken as proof that the idea in the patent is what Google does (or, for that matter, doesn't do) on their web site.
Left on 5/10/2005 7:53:21 PM by Anonymous
Comments: i second the 3rd comment. what more, i see no potential to stop farming anywhere. it's really just different ways to do it. e.g. Click through rates are monitored for changes in seasonality, fast increases, or other spike traffic in addition to increase or decrease trends. set up a bot net, and it is slashdot will look like a low traffic site.
Left on 5/10/2005 7:15:34 PM by Anonymous
Comments: How about not trying to second guess the search algorithm and simply JUST WRITE GOOD CONTENT
Left on 5/10/2005 6:27:12 PM by Anonymous
Comments: Thanks for this great article! :)
Left on 5/9/2005 2:47:21 PM by Anonymous
Comments: I looked over the Google patent application.  One of the salient facts about the application is that it was filed 12/31/03, with origins going back to 9/30/03.  Given this topic relates to what I understand is a dynamic cataloging technology, the fact that the patent application is over 18 months old (not 2 mos. old as the article indicates) limits the relevance of this analysis.
  

Send us your news or commentary! Our RSS feeds are used by hundreds of sites on the Web and syndicated by Google News, Moreover, and MSN News. Target your press release to thousands of technical professionals every day. Email us at press @ wwwcoder.com with your press release or information.


 Latest Articles
  

 Latest News
  

Spotlight
Syndication

 


 


Digg This
 


DotNetNuke Platinum Benefactor

  
 

 Terms Of Use | Privacy Statement
 Copyright 2008 - Santry Technology Solutions, Box 172, Girard, PA 16417, (814) 774-0970