google

Reviews
outstanding!
Rated 10 out of 10

October 09, 2008 (1 years 1 ago)
Great doc!

Shared by: Tara Sims
Stats
views:
410
rating:
10(1)
reviews:
1
posted:
8/3/2008
language:
English
pages:
0
How Did the Developers of Google™ Rely on Discrete Mathematics? Gerald Kruse Juniata College Huntingdon, PA USA kruse@juniata.edu The recent media attention during its Initial Public Offering served to underscore Google’s position as the most ubiquitous search engine. The main reason for Google’s popularity is its uncanny ability to provide useful search results. Although there have been many attempts to manipulate these search results, including the well known “Google Bombs”[1], it is safe to assume that most people are not aware that Google’s PageRank is determined using linear algebra and graph theory. When considering how to rank the importance, or PageRank, of web pages, the developers of Google decided to exploit the link structure of the World Wide Web. Rather than just ranking a page solely on its number of backlinks (number of pages pointing to it), the developers wanted to consider the popularity of the backlinks as well. For example, as Page, Brin, et. al. explain in [2], if a page has just one backlink, but it is from the Yahoo home page, then it should have a higher page rank than another page which has several backlinks which are from obscure sites. This leads to their intuitive definition: “A page has high rank if the sum of the ranks of its backlinks is high.” Let’s convert the above to a mathematical definition. We begin by letting u be a web page, R(u ) be the simple PageRank of u , Fu be the set of pages u points to, and B u be the set of pages pointing to u . This leads to number of links from u , and the intuitive definition of PageRank becomes R (u )  c N u  Fu , or the R (v ) . vBu N v  To demonstrate, we will calculate PageRank in an example. We start with the following directed graph containing three web pages, where a directed edge from one page (vertex) to another represents a link. Page A NA  2 Page B NB  1 We apply the PageRank formula to get Page C NC  1 R(C )   c 0  0   1    R( A)  R( B)  c  0  0  2   R( A) R( B)  R(C )  c   0 1  2  R( A)  .  0 0 1  R ( A)   R ( A)       1 0 0  R ( B )  . In this formulation it This system of equations can be written in matrix notation as:  R ( B )   c 2    R (C )   1 1 0  R (C )     2  is obvious that the PageRank vector is simply an eigenvector of the coefficient matrix. Solving and normalizing gives PageRanks of 0.4, 0.2, and 0.4 for pages A, B, and C, respectively. C has backlinks from both A and B, so it is a popular page. A has only one backlink, but it is from a popular page, so it has a high rank too. However, B has one backlink from a popular page, why doesn’t it have a high rank? Well, N A  2 , so A’s popularity is spread over two links. With the billions of pages on the web to rank, the eigenvector calculation could be problematic. Thankfully, from the definition of R(u ) , all entries in the coefficient matrix will be between 0 and 1 inclusive, so it is stochastic [3]. In addition, the developers added a random-surfer term to their PageRank formula [2], so it becomes R (u )  c R (v )  cE (u ) . vBu N v  This term guards against the rank propagation getting caught in a “rank-sink” endless loop, and it insures that the coefficient matrix is regular, which allows for a computationally easier iterative eigenvector calculation. References: [1] http://news.bbc.co.uk/2/hi/americas/3298443.stm [2] Page, Brin, Motwani, Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Stanford Digital Library Technologies Project. [3] Poole, David. Linear Algebra, A Modern Introduction. Brooks-Cole, 2003. Acknowledgements: The author would like to thank Henry Walker for distributing [2] at the 2003 MAA-PREP Workshop on Discrete Mathematics.

Related docs
Gains google with google money
Views: 7  |  Downloads: 0
google headquarters
Views: 63  |  Downloads: 1
Google Logos
Views: 236  |  Downloads: 15
Learn Google
Views: 106  |  Downloads: 11
Google Book
Views: 218  |  Downloads: 4
Google and Google
Views: 13  |  Downloads: 2
Google Phone
Views: 566  |  Downloads: 5
Searching tips -Google tools
Views: 91  |  Downloads: 20
nsknsk5
Views: 11  |  Downloads: 1
Google
Views: 8  |  Downloads: 3
nsknsk4
Views: 12  |  Downloads: 1
Google speaks
Views: 136  |  Downloads: 15
Google Offices Worldwide
Views: 393  |  Downloads: 17
premium docs
Other docs by Tara Sims
Kmart Black Friday 2009 AD
Views: 93  |  Downloads: 1
Toys R US 2009 Black Friday Flyer
Views: 94  |  Downloads: 4
Game Stop 2009 Black Friday Ad
Views: 46  |  Downloads: 0
Walmart Black Friday Ad 2009
Views: 343  |  Downloads: 10
Google Doodle Water on the Moon
Views: 32  |  Downloads: 0
Save 20 off 100 at Best Buy when You Use Paypal
Views: 250  |  Downloads: 0
SEC v GALLEON Management
Views: 107  |  Downloads: 2
Deck the Halls Checklist
Views: 68  |  Downloads: 0
Christmas Dinner Checklist
Views: 201  |  Downloads: 4