The basis of much of the intelligence on the Web is the hyperlink structure which represents an organising principle based on the human facility to be able to discriminate between relevant and irrelevant material. Second generation search engines like Google make use of this structure to infer the authority of particular web pages. However, the linking mechanism provided by HTML does not allow the author to express different types of links such as positive or negative endorsements of page content. Consequently, algorithms like PageRank produce rankings that do not capture the different intentions of web authors. In this paper, we review some of the initiatives for adding simple semantic extensions to the link mechanism. Using a large real world dataset, we demonstrate the different page rankings produced by considering extra semantic information in page links. We conclude that Web intelligence would benefit in adoption of languages that allow authors to easily encode simple semantic extensions to their hyperlinks.
Page-reRank: using trusted links to re-rank authority