Advanced Topics in Databases The Anatomy of a Large

Reviews
Shared by: One Seven
Stats
views:
31
rating:
not rated
reviews:
0
posted:
4/23/2009
language:
English
pages:
0
Advanced Topics in Databases The Anatomy of a Large-Scale Hypertextual Web Search Engine Hasso-Plattner-Institut Potsdam Fachgebiet Informationssysteme Markus Güntert WS 2008/2009 20.01.2009 1 The Anatomy of a Large-Scale Hypertextual Web Search Engine • • • • Sergey Brin, Lawrence Page Computer Science Department, Stanford University 1998 veröffentlicht “an in-depth description […] – the first such detailed public description we know of to date” The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 2 Agenda • • • • • • • • • Autoren Ziele von Google Standpunkt 1998 PageRank weitere Hypertext-Informationen Architektur (vereinfacht) Suchanfragen Future Work Fazit The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 3 Autoren • Sergey Brin – * 1973 in Moskau, 1979 Migration in die USA – Bachelor of Science in Mathematik u. Informatik (University of Maryland, College Park), Masterstudium in Stanford – Promotion bis heute nicht fertig gestellt • Lawrence Page – * 1973 in Michigan – Bachelor of Science in Computer Engineering (University of Michigan), Masterstudium in Stanford – “The ultimate search engine would understand exactly what you mean and give back exactly what you want.” The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 4 Ziele von Google • Google ist Prototyp einer “large-scale search engine” – Crawling, Indexing, Sorting – Konzentration auf Qualität von Suchergebnissen – Berücksichtigung des Wachsens des Webs sowie des technologischen Fortschrittes – wissenschaftliche Arbeit im Bereich der Suchmaschinen – Zugänglichkeit für Jedermann – ~ 100 000 000 indizierte Webseiten • 2008: ~ 1 000 000 000 000 1 – ~ 10 000 000 Anfragen pro Tag • 2006: ~ 91 000 000 2 _______________ 1 http://googleblog.blogspot.com/2008/07/we-knew-web-was-big.html 2 http://searchenginewatch.com/2156461 5 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 Standpunkt 1998 • “human-maintained” Seiten wie Yahoo! – subjektiv, teuer, unvollständig, langsam • reines Keyword-Matching liefert viele schlechte Ergebnisse – kein (ausgereiftes) Ranking der Ergebnisse – Indizes werden größer – Aufnahmevermögen des Nutzers bleibt gleich • Manipulation von Suchmaschinen – “Junk results” The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 6 7 PageRank • Ziel: Priorisierung von Suchergebnissen • Menge verlinkter Dokumente anhand Struktur bewerten und gewichten – Seite gilt als wichtig, wenn viele Seiten auf sie verlinken – Seite gilt als wichtig, wenn wichtige Seiten auf sie verlinken • Prinzip des Zitats für Web adaptiert – subjektives Verständnis von Wichtigkeit kommt dem nahe The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 8 PageRank – Beispiel P1 P2 P3 P4 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 9 PageRank – Beispiel P1 P2 P3 P4 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 10 PageRank – Beispiel P1 P2 P3 P4 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 11 PageRank – Beispiel P1 P2 P3 P4 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 12 PageRank – Beispiel P1 P2 P3 P4 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 13 PageRank – Beispiel P1 P2 P3 P4 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 14 PageRank – Berechnung r ( Pi ) = Pj ∈BPi ∑ r ( Pj ) Pj • Summe aller PageRanks von verlinkenden Seiten Pj – normalisiert • rekursiv! The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 15 PageRank – Berechnung • iterativer Ansatz P1 P2 rk +1 ( Pi ) = Pj ∈BPi ∑ rk ( Pj ) Pj Iteration 0 Iteration 1 Iteration 2 1 4 1 r0 ( P2 ) = 4 1 r0 ( P3 ) = 4 1 r0 ( P4 ) = 4 r0 ( P ) = 1 1 4 3 r1 ( P2 ) = 8 1 r1 ( P3 ) = 4 1 r1 ( P4 ) = 8 r1 ( P ) = 1 1 4 1 r2 ( P2 ) = 4 3 r2 ( P3 ) = 8 1 r2 ( P4 ) = 8 r2 ( P ) = 1 Rank nach 100 Iterationen 3 1 4 2 P3 P4 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 16 PageRank – intuitiv • Nachahmung eines zufällig durch das Netz surfenden Users – – – – zufällige Startseite Klicken von Links (ohne jemals zurück zu gehen) Anfordern einer neuen zufälligen Startseite möglich Wahrscheinlichkeit für Finden einer Seite entspricht deren PageRank The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 17 weitere Hypertext-Informationen • Anchor Text Large-Scale Hypertextual Web Search Engine – teilweise präzisere Beschreibungen von Webseiten – nicht-indizierbare Inhalte (Bilder, Audio, Video,…) • Visual Presentation Details – (relative) Schriftgröße – fett, kursiv etc. DAS ist wichtiger als das. • Location Information – URL, , <meta> – Proximität The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 18 Architektur (vereinfacht) 19 Architektur Crawler • automatisiertes Durchsuchen und Analysieren von Webseiten The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 20 Architektur Crawler Repository • Speicherung der gecrawlten Webseiten – in komprimierter Form • jede Webseite erhält eindeutige docID The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 21 Repository docID http://infolab.stanford.edu/~backrub/google.html The Anatomy of a Search Engine 1234 In this paper we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the Web efficiently and produce much more satisfying results that existing systems … . . . . . . http://en.wikipedia.org/wiki/Google_search Google Search – Wikipedia 98765 Google search is a Web search engine owned by Google Inc. and is the most used search engine on the Web. Google receives several hundred million queries each day through its various services … The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 22 Architektur Crawler Repository Indexer • Parsen der Seiten im Repository • für jedes Dokument wird eine 'Hit List' von Wörtern angelegt, … – wordID The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 23 Hits • jedes Wort, das in einem Dokument vorkommt, ist ein Hit • plain Hits • fancy Hits – URL, <title>, <meta>, Anchor Text The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 24 Architektur Crawler Repository Indexer Barrels • … diese werden als Forward Index gespeichert – sortiert nach docID The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 25 Forward Index docID 1234 wordID anatomy search engine Google Hit . . . 98765 Web . . . Google search Google Web engine . . . The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 26 Architektur Crawler Repository Indexer Barrels Sorter • Reorganisation zu Inverted Index – sortiert nach wordID The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 27 Inverted Index wordID anatomy search docID 1234 1234 98765 1234 98765 98765 98765 1234 Hit engine Google Web 1234 98765 The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 28 Architektur Crawler Repository Indexer Barrels PageRank Links Sorter • Auswertung der Link-Struktur – Berechnung des PageRanks aller Webseiten – neue Links crawlen The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 29 Suchanfragen • One-Word-Query – Suchen von Hit List im Inverted Index • Lexicon (in-memory) The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 30 Lexicon wordID anatomy search docID 1234 1234 98765 1234 98765 98765 98765 1234 Hit … … … … … … … … … … wordID anatomy search engine Google Web engine Google Web 1234 98765 Lexicon (in-memory) Inverted Index The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 31 Suchanfragen • One-Word-Query – Suchen von Hit List im Inverted Index • Lexicon (in-memory) – Gewichtung • Informationen in den Hit Listen • Anzahl der Hits pro Dokument • PageRank • Multi-Word-Query – zusätzlich Betrachtung von Proximität • besser, wenn Wörter nahe beieinander sind The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 32 Future Work • Query Caching • Kontrolliertes Re-Crawling – statistische Informationen über Updates von Webseiten • Stemming • Boolsche Operatoren in Suchanfragen • Zusammenfassung der Ergebnisse • 2000 – 1 000 000 000 Webseiten indiziert – Internationalisierung – Ads • 2004 – Börsengang The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 33 Fazit • Primärziel sind hochqualitative Suchergebnisse im immer größer werdenden Web • Priorisierung der Suchergebnisse durch Auswertung von Hypertext-Informationen – PageRank – Anchor Text – etc. • Architektur von vornherein auf Skalierbarkeit ausgelegt The Anatomy of a Large-Scale Hypertextual Web Search Engine | Markus Güntert | 20.01.2009 34 </div><br /> <!-- google_ad_section_end --> <!--Start SearchLight--> <script type='text/javascript'> aster_cloud_id = '2422251'; aster_cloud_format = '728x90'; </script> <script type='text/javascript'> document.write(unescape("%3Cscript src='http://web.asterpix.com/media/js/searchlight.js' type='text/javascript'%3E%3C/script%3E")); </script> <!--End SearchLight--> </div> </div> </div> </div> <script type="text/javascript"> getEl('viewer-area').style.width = getEl('doc-info').clientWidth + 'px'; </script> <div class="rCol"> <div class="img-ad"><a href="/docs/AdTracker.ashx?adID=369&docID=5629645&url=http%3a%2f%2fwww.docstoc.com%2fstore" rel="nofollow" onclick="adClick('Docstoc, Inc. [en] ','DocStore doc banner - Generic','5629645')" ><img src="http://i.docstoc.com/ads/DocStore_AD_page (generic).jpg" alt=""></a></div> <div class="adword" id="ad-spot3"> </div> <!-- google_ad_section_start --> <div class="docs-box-header">Related docs</div><div class="docs-box"> <div class="doc"> <a href="/docs/9655228/Anatomy-And-Physiology-Topics" class="thumb" title="9655228"></a> <div class="titleStats"><a class="title" href="/docs/9655228/Anatomy-And-Physiology-Topics">Anatomy And Physiology Topics</a> <div class="stats"> Views: 73  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/5623984/Advanced-Topics-in-Databases" class="thumb" title="5623984"></a> <div class="titleStats"><a class="title" href="/docs/5623984/Advanced-Topics-in-Databases">Advanced Topics in Databases</a> <div class="stats"> Views: 10  |  Downloads: 1</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/5632949/Advanced-topics-in-databases-Introduction-to-Prolog-as-a" class="thumb" title="5632949"></a> <div class="titleStats"><a class="title" href="/docs/5632949/Advanced-topics-in-databases-Introduction-to-Prolog-as-a">Advanced topics in databases Introduction to Prolog as a</a> <div class="stats"> Views: 12  |  Downloads: 7</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/5613428/Interfaces-for-Geographical-Databases-Presented-for-Advanced-Research-Topics" class="thumb" title="5613428"></a> <div class="titleStats"><a class="title" href="/docs/5613428/Interfaces-for-Geographical-Databases-Presented-for-Advanced-Research-Topics">Interfaces for Geographical Databases Presented for Advanced Research Topics</a> <div class="stats"> Views: 0  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/15095363/Advanced-Topics" class="thumb" title="15095363"></a> <div class="titleStats"><a class="title" href="/docs/15095363/Advanced-Topics">Advanced Topics</a> <div class="stats"> Views: 0  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/5614530/IC-Advanced-research-topics-in-databases-Computer-Communication-and" class="thumb" title="5614530"></a> <div class="titleStats"><a class="title" href="/docs/5614530/IC-Advanced-research-topics-in-databases-Computer-Communication-and">IC Advanced research topics in databases Computer Communication and</a> <div class="stats"> Views: 3  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/5624601/CIS750-–-Seminar-in-Advanced-Topics-in-Computer-Science" class="thumb" title="5624601"></a> <div class="titleStats"><a class="title" href="/docs/5624601/CIS750-–-Seminar-in-Advanced-Topics-in-Computer-Science">CIS750 – Seminar in Advanced Topics in Computer Science</a> <div class="stats"> Views: 12  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/10287169/At-Large" class="thumb" title="10287169"></a> <div class="titleStats"><a class="title" href="/docs/10287169/At-Large">At Large</a> <div class="stats"> Views: 12  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/7641727/COMP3420-Advanced-Databases-and-Data-Mining" class="thumb" title="7641727"></a> <div class="titleStats"><a class="title" href="/docs/7641727/COMP3420-Advanced-Databases-and-Data-Mining">COMP3420 Advanced Databases and Data Mining</a> <div class="stats"> Views: 7  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/8197580/Anatomy-of-a-Report" class="thumb" title="8197580"></a> <div class="titleStats"><a class="title" href="/docs/8197580/Anatomy-of-a-Report">Anatomy of a Report</a> <div class="stats"> Views: 20  |  Downloads: 1</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/16841131/Anatomy-of-a-Database-System" class="thumb" title="16841131"></a> <div class="titleStats"><a class="title" href="/docs/16841131/Anatomy-of-a-Database-System">Anatomy of a Database System</a> <div class="stats"> Views: 0  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/7774674/Plant-Anatomy" class="thumb" title="7774674"></a> <div class="titleStats"><a class="title" href="/docs/7774674/Plant-Anatomy">Plant Anatomy </a> <div class="stats"> Views: 76  |  Downloads: 7</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/5613984/Databases-and-backups" class="thumb" title="5613984"></a> <div class="titleStats"><a class="title" href="/docs/5613984/Databases-and-backups">Databases and backups</a> <div class="stats"> Views: 16  |  Downloads: 1</div></div> <div class="clr"></div> </div> </div> <!-- google_ad_section_end --> <style type="text/css"> .window-premium-docs {background-color:#f2f2f2; border:solid 1px #b8b8b8; border-top:none; padding:3px 10px; } .window-premium-docs a {color:#04405e; font-size:13px; font-weight:bold;float:left; width:180px;clear:left;margin-bottom:10px; padding-right:10px;} .window-premium-docs b {color:#000; font-size:12px; font-weight:normal; display:block; float:left; clear:right;} </style> <div><img src="/i/header-premium-docs-250.png" alt="premium docs" /></div> <div class="window-premium-docs"> <a href="/docs/9550412/" onclick="adClick('Paid-Feat', 'Website Development Agreement', '5629645')">Website Development Agreement</a><b>$19.95</b><br /> <a href="/docs/10391193/" onclick="adClick('Paid-Feat', 'Website Design Non Disclosure', '5629645')">Website Design Non Disclosure</a><b>$14.95</b><br /> <a href="/docs/10391192/" onclick="adClick('Paid-Feat', 'Web Hosting Agreement', '5629645')">Web Hosting Agreement</a><b>$14.95</b><br /> <a href="/docs/11760281/" onclick="adClick('Paid-Feat', 'Website Vendor Evaluation Matrix', '5629645')">Website Vendor Evaluation Matrix</a><b>$19.95</b><br /> <a href="/docs/11760274/" onclick="adClick('Paid-Feat', 'Website Design RFP Template', '5629645')">Website Design RFP Template</a><b>$49.95</b><br /> <a href="/docs/11760266/" onclick="adClick('Paid-Feat', 'Web Audit Report Tool', '5629645')">Web Audit Report Tool</a><b>$19.95</b><br /> <a href="/docs/11760268/" onclick="adClick('Paid-Feat', 'Web Metrics Reporting Tool', '5629645')">Web Metrics Reporting Tool</a><b>$19.95</b> <br /> </div> <div class="docs-box-header">Other docs by <a href="/profile/oneforseven">One Seven</a></div><div class="docs-box"> <div class="doc"> <a href="/docs/9571/Sample-Market-Analysis-Mindshaker" class="thumb" title="9571"></a> <div class="titleStats"><a class="title" href="/docs/9571/Sample-Market-Analysis-Mindshaker">Sample Market Analysis Mindshaker</a> <div class="stats"> Views: 351  |  Downloads: 16</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/10269/President-George-Washingtons-Farewell-Address-_1796_-Image-1" class="thumb" title="10269"></a> <div class="titleStats"><a class="title" href="/docs/10269/President-George-Washingtons-Farewell-Address-_1796_-Image-1">President George Washington's Farewell Address _1796_ Image 1</a> <div class="stats"> Views: 134  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/10239/President-Franklin-Roosevelts-Annual-Message-_Four-Freedoms_-to-Congress-_1941_---2" class="thumb" title="10239"></a> <div class="titleStats"><a class="title" href="/docs/10239/President-Franklin-Roosevelts-Annual-Message-_Four-Freedoms_-to-Congress-_1941_---2">President Franklin Roosevelt's Annual Message _Four Freedoms_ to Congress _1941_ - 2</a> <div class="stats"> Views: 69  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9745/STAIRWAYS-AND-LADDERS-A-GUIDE-TO-OSHA-RULES" class="thumb" title="9745"></a> <div class="titleStats"><a class="title" href="/docs/9745/STAIRWAYS-AND-LADDERS-A-GUIDE-TO-OSHA-RULES">STAIRWAYS AND LADDERS A GUIDE TO OSHA RULES</a> <div class="stats"> Views: 369  |  Downloads: 2</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9108/Articles-of-Agreement-Relating-to-the-Surrender-of-the-Army-of-Northern-Virginia-_1865_---1" class="thumb" title="9108"></a> <div class="titleStats"><a class="title" href="/docs/9108/Articles-of-Agreement-Relating-to-the-Surrender-of-the-Army-of-Northern-Virginia-_1865_---1">Articles of Agreement Relating to the Surrender of the Army of Northern Virginia _1865_ - 1</a> <div class="stats"> Views: 186  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9462/SCHEDULE-H-CODEBTORS" class="thumb" title="9462"></a> <div class="titleStats"><a class="title" href="/docs/9462/SCHEDULE-H-CODEBTORS">SCHEDULE H CODEBTORS</a> <div class="stats"> Views: 107  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/10274/FORM-4255-RECAPTURE-OF-INVESTMENT-CREDIT" class="thumb" title="10274"></a> <div class="titleStats"><a class="title" href="/docs/10274/FORM-4255-RECAPTURE-OF-INVESTMENT-CREDIT">FORM 4255 RECAPTURE OF INVESTMENT CREDIT</a> <div class="stats"> Views: 123  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9649/OSHA-METHYLENE-CHLORIDE" class="thumb" title="9649"></a> <div class="titleStats"><a class="title" href="/docs/9649/OSHA-METHYLENE-CHLORIDE">OSHA METHYLENE CHLORIDE</a> <div class="stats"> Views: 169  |  Downloads: 3</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/10160/FORM-5754-STATEMENT-BY-PERSON-S-RECEIVING-GAMBLING-WINNINGS" class="thumb" title="10160"></a> <div class="titleStats"><a class="title" href="/docs/10160/FORM-5754-STATEMENT-BY-PERSON-S-RECEIVING-GAMBLING-WINNINGS">FORM 5754 STATEMENT BY PERSON S RECEIVING GAMBLING WINNINGS</a> <div class="stats"> Views: 132  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/10242/FORM-5498-IRA-CONTRIBUTION-INFORMATION" class="thumb" title="10242"></a> <div class="titleStats"><a class="title" href="/docs/10242/FORM-5498-IRA-CONTRIBUTION-INFORMATION">FORM 5498 IRA CONTRIBUTION INFORMATION</a> <div class="stats"> Views: 256  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9608/Sample-Target-Market-VeriType" class="thumb" title="9608"></a> <div class="titleStats"><a class="title" href="/docs/9608/Sample-Target-Market-VeriType">Sample Target Market VeriType</a> <div class="stats"> Views: 497  |  Downloads: 4</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9871/FORMA-943PR-PLANILLA-PARA-LA-DECLARACION-ANUAL-DE-LA-CONTRIBUCION-FEDERAL-DEL-PATRONO-DE-EMPLEADOS-AGRICOLAS" class="thumb" title="9871"></a> <div class="titleStats"><a class="title" href="/docs/9871/FORMA-943PR-PLANILLA-PARA-LA-DECLARACION-ANUAL-DE-LA-CONTRIBUCION-FEDERAL-DEL-PATRONO-DE-EMPLEADOS-AGRICOLAS">FORMA 943PR PLANILLA PARA LA DECLARACION ANUAL DE LA CONTRIBUCION FEDERAL DEL PATRONO DE EMPLEADOS AGRICOLAS</a> <div class="stats"> Views: 1092  |  Downloads: 1</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9718/Sample-Operations-VeriType" class="thumb" title="9718"></a> <div class="titleStats"><a class="title" href="/docs/9718/Sample-Operations-VeriType">Sample Operations VeriType</a> <div class="stats"> Views: 253  |  Downloads: 1</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/10135/FORM-5500-SCHEDULE-I-FINANCIAL-INFORMATION-SMALL-PLAN-2005" class="thumb" title="10135"></a> <div class="titleStats"><a class="title" href="/docs/10135/FORM-5500-SCHEDULE-I-FINANCIAL-INFORMATION-SMALL-PLAN-2005">FORM 5500 SCHEDULE I FINANCIAL INFORMATION SMALL PLAN 2005</a> <div class="stats"> Views: 102  |  Downloads: 0</div></div> <div class="clr"></div> </div> <div class="doc"> <a href="/docs/9819/Federal-Income-Taxation-University-of-Texas" class="thumb" title="9819"></a> <div class="titleStats"><a class="title" href="/docs/9819/Federal-Income-Taxation-University-of-Texas">Federal Income Taxation University of Texas</a> <div class="stats"> Views: 279  |  Downloads: 7</div></div> <div class="clr"></div> </div> </div> </div> </div> </div> <div class="cls"></div><br /> <!-- google_ad_section_start(weight=ignore) --> <div id="footer-wrap"> <div id="footer"> <div class="l-cap"></div> <div class="inner"> <dl> <dt>About:</dt> <dd><a href="/about/" rel="nofollow">What is Docstoc?</a> | <a href="javascript:void window.open('/popterm.aspx?page_id=15','TC','width=800,height=500,menubar=0,location=0,resizable=1,scrollbars=1,status=0')" rel="nofollow">Docstoc Terms of Service</a> | <a href="javascript:void window.open('/popterm.aspx?page_id=102','TC','width=800,height=500,menubar=0,location=0,resizable=1,scrollbars=1,status=0')" rel="nofollow">DocStore Terms of Service</a> | <a href="javascript:void window.open('/popterm.aspx?page_id=22','TC','width=800,height=500,menubar=0,location=0,resizable=1,scrollbars=1,status=0')" rel="nofollow">Privacy Policy</a> | <a href="/FAQ/" rel="nofollow">FAQs</a></dd> </dl> <dl> <dt>Links:</dt> <dd><a href="http://blog.docstoc.com/" onclick="outbound('blog.docstoc.com')">Docstoc Blog</a> | <a href="/pro/" rel="nofollow">Join Pro</a> | <a href="/requests/" rel="nofollow">Requests</a> | <a href="/docsters/">Docsters</a> | <a href="/upload/" rel="nofollow">Upload</a> | <a href="/User/cmsstatic.aspx?page_id=100" rel="nofollow">DMCA Guidelines</a> | <a href="/DMCA/" rel="nofollow">DMCA Notification</a> | <a href="http://platform.docstoc.com/" rel="nofollow">Docstoc API</a> | <a href="/feeds/rssfeed.aspx" rel="nofollow">RSS Feeds</a> <a href="/feeds/rssfeed.aspx" rel="nofollow"><img src="/i/ico/rss-sm.gif" alt="RSS Feeds" /></a></dd> </dl> <dl> <dt>Contact:</dt> <dd><a href="/contact/" rel="nofollow">Contact Us</a> | <a href="http://docstoc.suggestionbox.com/" rel="nofollow">Suggest Features</a> | <a href="/jobs/" rel="nofollow">Join Our Team</a></dd> </dl> <dl> <dt>Share:</dt> <dd><a href="/Embed-Documents/">Embed Documents</a> | <a href="/OneClick/">Docstoc OneClick</a> | <a href="/Sync/">Docstoc Sync</a> </dd> </dl> </div> <div class="r-cap"></div> <div class="copyright">© Docstoc 2009. All rights reserved.</div> </div> </div> <!-- google_ad_section_end --> <br /> </div> <script type="text/javascript"> google_spot_id='google-ad-pos6'; google_ad_client='pub-7343870454215956'; google_ad_output='js'; google_max_num_ads='3'; google_ad_type='text'; google_ad_channel='2191561729+4366728107+7426139644'; google_feedback='on'; googleAdStyle='v'; googleAdChannel='2191561729+4366728107+7426139644'; google_hints = 'merge excel spreadsheets to a database'; </script> <script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script> <script type="text/javascript"> google_spot_id='google-ad-pos1'; google_ad_client='pub-7343870454215956'; google_ad_output='js'; google_max_num_ads='3'; google_ad_type='text_image_flash'; google_ad_channel='5056562550+4366728107+0606741701'; google_feedback='on'; googleAdStyle='728x90'; googleAdChannel='5056562550+4366728107+0606741701'; google_hints = 'merge excel spreadsheets to a database'; google_image_size='728x90'; </script> <script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script> <script type="text/javascript"> google_spot_id = 'ad-spot3' google_ad_client = 'pub-7343870454215956'; google_ad_output = 'js'; google_max_num_ads = '5'; google_ad_type = 'text_html'; google_ad_channel ="4366728107+3710666396+3243318210" google_feedback = 'on'; googleAdStyle = 'v'; </script> <script type="text/javascript" src="http://pagead2.googlesyndication.com/pagead/show_ads.js"></script> <script src="http://www.google-analytics.com/ga.js" type="text/javascript"></script> <script type="text/javascript"> var pageTracker =''; if (_gat && _gat!=null && _gat!=undefined){ pageTracker = _gat._getTracker("UA-1289929-1"); pageTracker._trackPageview(); } </script> <script type="text/javascript"> recordEndTime(); makeDD('ddTarget', 'ddDocuments', 'top'); makeDD('ddTargetMyDocs', 'ddMyDocs', 'top'); makeDD('ddTargetMyColl', 'ddMyColls', 'top');</script> <script type="text/javascript"> //<![CDATA[ var CurrentUsername = ''; var CurrentUserID = '-10'; DocumentID = 5629645;DocumentUrl='http://www.docstoc.com/docs/5629645/Advanced-Topics-in-Databases-The-Anatomy-of-a-Large';DocumentOwnerID='588974';DocumentOwnerUsername='oneforseven';DocumentFilename='Advanced_Topics_in_Databases_The_Anatomy_of_a_Large_Scale.pdf';DocumentType='pdf';DocumentTitle='Advanced Topics in Databases The Anatomy of a Large';DocumentCategoryID=4;DocumentSubCategoryID=162;DocumentError=0;DocumentRotation=0;DocumentSubSubCategoryID=0;adView('Docstoc, Inc. [en] ','DocStore doc banner - Generic','5629645'); //]]> </script> </form> </body> </html>