Search For:

Displaying 1-11 out of 11 total
Machine Learning for Information Architecture in a Large Governmental Website
Found in: Digital Libraries, Joint Conference on
By Miles Efron, Gary Marchionini, Jonathan Elsas, Junliang Zhang
Issue Date:June 2004
pp. 151-159
This paper describes ongoing research into the application of machine learning techniques for improving access to governmental information in complex digital libraries. Under the auspices of the GovStat Project, our goal is to identify a small number of se...
 
Link Attachment (Preferential and Otherwise) in Contributor-Run Digital Libraries
Found in: Digital Libraries, Joint Conference on
By Miles Efron, Donald Sizemore
Issue Date:May 2003
pp. 369
Ibiblio is digital library whose materials are submitted and maintained by volunteer contributors. This study analyzes the emergence of hyperlinked structures within the ibiblio collection. In the context of ibiblio, we analyze the suitablility of Barabasi...
 
Statistical Translation Language Model for Twitter Search
Found in: Proceedings of the 2013 Conference on the Theory of Information Retrieval (ICTIR '13)
By ChengXiang Zhai, Maryam Karimzadehgan, Miles Efron
Issue Date:September 2013
pp. 121-124
With the prevalence of social media applications, an increasing number of internet users are actively publishing text information on-line. This influx provides a wealth of text information on those users. Ranking in social media poses different challenges ...
     
SIGIR 2013 workshop on time aware information access (#TAIA2013)
Found in: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '13)
By Fernando Diaz, Kira Radinsky, Maarten de Rijke, Milad Shokouhi, Miles Efron, Susan Dumais
Issue Date:July 2013
pp. 1137-1137
Web content increasingly reflects the current state of the physical and social world, manifested both in traditional news media sources along with user-generated publishing sites such as Twitter, Foursquare, and Facebook. At the same time, web searching in...
     
Query representation for cross-temporal information retrieval
Found in: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '13)
By Miles Efron
Issue Date:July 2013
pp. 383-392
This paper addresses the problem of long-term language change in information retrieval (IR) systems. IR research has often ignored lexical drift. But in the emerging domain of massive digitized book collections, the risk of vocabulary mismatch due to langu...
     
Improving retrieval of short texts through document expansion
Found in: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '12)
By Katrina Fenlon, Miles Efron, Peter Organisciak
Issue Date:August 2012
pp. 911-920
Collections containing a large number of short documents are becoming increasingly common. As these collections grow in number and size, providing effective retrieval of brief texts presents a significant research problem. We propose a novel approach to im...
     
Estimation methods for ranking recent information
Found in: Proceedings of the 34th international ACM SIGIR conference on Research and development in Information (SIGIR '11)
By Gene Golovchinsky, Miles Efron
Issue Date:July 2011
pp. 495-504
Temporal aspects of documents can impact relevance for certain kinds of queries. In this paper, we build on earlier work of modeling temporal information. We propose an extension to the Query Likelihood Model that incorporates query-specific information to...
     
Hashtag retrieval in a microblogging environment
Found in: Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval (SIGIR '10)
By Miles Efron
Issue Date:July 2010
pp. 787-788
Microblog services let users broadcast brief textual messages to people who "follow" their activity. Often these posts contain terms called hashtags, markers of a post's meaning, audience, etc. This poster treats the following problem: given a user's state...
     
Generative model-based metasearch for data fusion in information retrieval
Found in: Proceedings of the 2009 joint international conference on Digital libraries (JCDL '09)
By Miles Efron
Issue Date:June 2009
pp. 1-1
"Data fusion" refers to the problem in information retrieval (IR) where several lists of documents ranked against a query are to be merged into a single ranked list for presentation to a user. Data fusion is also known as "metasearch." In a digital library...
     
Model-averaged latent semantic indexing
Found in: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '07)
By Miles Efron
Issue Date:July 2007
pp. 755-756
This poster introduces a novel approach to information retrieval that uses statistical model averaging to improve latent semantic indexing (LSI). Instead of choosing a single dimensionality $k$ for LSI , we propose using several models of differing dimensi...
     
The liberal media and right-wing conspiracies: using cocitation information to estimate political orientation in web documents
Found in: Proceedings of the Thirteenth ACM conference on Information and knowledge management (CIKM '04)
By Miles Efron
Issue Date:November 2004
pp. 390-398
This paper introduces a simple method for estimating <i>cultural orientation</i>, the affiliation of online entities in a polarized field of discourse. In particular, cocitation information is used to estimate the political orientation of hypertext...
     
 1