This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Search Engine Coverage of the OAI-PMH Corpus
March/April 2006 (vol. 10 no. 2)
pp. 66-73
Frank McCown, Old Dominion University
Xiaoming Liu, Los Alamos National Laboratory
Michael L. Nelson, Old Dominion University
Mohammad Zubair, Old Dominion University
Having indexed much of the "surface" Web, search engines are now using various approaches to index the "deep" Web. At the same time, institutional repositories and digital libraries are adopting the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to expose their holdings. The authors harvested nearly 10 million records from OAI-PMH repositories. From these records, they extracted 3.3 million unique resource URLs and then conducted searches on samples from this collection to determine how much of the OAI-PMH corpus the three major search engines have indexed.
Index Terms:
OAI PMH, search engines, indexing, deep web
Citation:
Frank McCown, Xiaoming Liu, Michael L. Nelson, Mohammad Zubair, "Search Engine Coverage of the OAI-PMH Corpus," IEEE Internet Computing, vol. 10, no. 2, pp. 66-73, March-April 2006, doi:10.1109/MIC.2006.41
Usage of this product signifies your acceptance of the Terms of Use.