Microsoft Publishes Duplicate Filter Patent
November 6th, 2006 by pamela
Microsoft published a patent last week which discusses some of the indicators that might be used to determine which results are shown and which to hide. With engines constantly striving to provide the most relevant results, avoiding duplicated results is key, a process which the patent application provides some insight into.
According to the abstract; “A system and method are provided for optimizing a set of search results typically produced in response to a query. The method may include detecting whether two or more results access equivalent content and selecting a single user-preferred result from the two or more results that access equivalent content. The method may additionally include creating a set of search results for display to a user, the set of search results including the single user-preferred result and excluding any other result that accesses the equivalent content. The system may include a duplication detection mechanism for detecting any results that access equivalent content and a user-preferred result selection mechanism for selecting one of the results that accesses the equivalent content as a user-preferred result.”
Fortunately it does go on to be a bit less geek speak and a little more easily digestible.
Taking into account various factors including URLs, navigation and domain extensions, this patent indicates how Microsoft might filter comparable results. Take a look at SEO by the Sea for an analysis of the patent.
November 6th, 2006 at 2:26 pm
This is a very interesting concept that has been about for many a year. Who gets the credit fora piece of content that many websites share?
Obviously there will be winners and losers within certain industries such as travel, where content is rarely unique.Many Affiliate sites may also suffer badly from these changes.
As the major search engines address these issues, there is no gaurantee that the original author of the content will claim their rightful position in the SERP’s. All search engines have a different perception of what the correct answer to a query should be.
If ever there was a call to websites to produce unique content and address natural search, it was now.