Last year, potpiegirl found a document from Google, by searching the Google index, that was entitled “2011 Google Quality Raters Handbook”. Today doing the same thing, I found a new leaked, 161 pages Google document entitled :
“General Guidelines Version 3.27 – June 22, 2012￼￼”
I will not provide a link to this document, due to obvious reasons, but you can try to search, either on Google or the Social Media sites and you will probably find it.
What I will share with you in this post, is my opinion on the contents of this document.
The document is focused on
- identifying webspam (technical analysis – keyword stuffing,redirects etc/ authority of the blog,site/ commercial intent – thin affiliates)
- webpage rating (content type/ language/ flags)
- geo-location webpage rating (if a page is geographically related to the query)
- query (user intent rating)
Aproximately 60 pages talk about the “Rating Guidelines” and explain to the person that is going to be the rater the concepts. A few of them are:
- Understanding the Query
- Classification of User Intent: Action, Information, and Navigation – “Do-Know-Go”
- The Rating Scale
- Vital Pages for People Queries
The next 16 pages are about “URL Rating Tasks with User Locations”.
32+19 pages talk about “Page Quality Rating Guidelines” & “Rating Examples”.
Here is a screenshot of the Page Quality rating task page:
One particular thing that is really interesting, is an example about “Named Entity Queries”.
Snapshot talks by itself.
Part 5 (15 pages) of the document is about “Webspam Guidelines”.
This looks at things like:
- Technical Signals
- Fake vs Ok sites comparisons
- Commercial intent of webspam pages
The last chapter is about the EWQQ that evaluation system that the rater will use.
Part 7 and 8 are :
- Quick Guide to URL Rating
- Quick Guide to Webspam Recognition
To conclude this shows us that Google still uses an army of raters to train their algorithms and validate situations that the algorithm can not do by itself.
It is a pure user oriented view, from Google’s side and everything is related to the User Experience. This is used both for natural search and paid search but it seems they are mostly focused on the natural part.
Reading the document will provide both insight into the Google mindset and a technical view on some of the aspects of SEO.