Search For:

Displaying 1-39 out of 39 total
The Community and the Crowd: Multimedia Benchmark Dataset Development
Found in: IEEE MultiMedia
By Martha Larson,Mohammad Soleymani,Maria Eskevich,Pavel Serdyukov,Roeland Ordelman,Gareth Jones
Issue Date:July 2012
pp. 15-23
The MediaEval Multimedia Benchmark leveraged community cooperation and crowdsourcing to develop a large Internet video dataset for its Genre Tagging and Rich Speech Retrieval tasks.
 
Through-the-looking glass: utilizing rich post-search trail statistics for web search
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Alexey Tolstikov, Mikhail Shakhray, Gleb Gusev, Pavel Serdyukov
Issue Date:October 2013
pp. 1897-1900
With increasing popularity of browser toolbars, the challenge of employing user behavior data stored in their logs rises in its importance. The analysis of post-click search trails was shown to provide important knowledge about user experience, helpful for...
     
Predicting the impact of expansion terms using semantic and user interaction features
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Anton Bakhtin, Yury Ustinovskiy, Pavel Serdyukov
Issue Date:October 2013
pp. 1825-1828
Query expansion for Information Retrieval is a challenging task. On the one hand, low quality expansion may hurt either recall, due to vocabulary mismatch, or precision, due to topic drift, and therefore reduce user satisfaction. On the other hand, utilizi...
     
Speller performance prediction for query autocorrection
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Alexey Baytin, Marina Panina, Pavel Serdyukov, Irina Galinskaya
Issue Date:October 2013
pp. 1821-1824
Query speller is an indispensable part of any modern search engine. In this paper we define the problem of speller performance prediction and apply it to the task of query spelling autocorrection. As candidates for query autocorrection we used the suggesti...
     
Modeling clicks beyond the first result page
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Maarten de Rijke, Aleksandr Chuklin, Pavel Serdyukov
Issue Date:October 2013
pp. 1217-1220
Most modern web search engines yield a list of documents of a fixed length (usually 10) in response to a user query. The next ten search results are usually available in one click. These documents either replace the current result page or are appended to t...
     
Evaluating aggregated search using interleaving
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Aleksandr Chuklin, Anne Schuth, Pavel Serdyukov, Katja Hofmann, Maarten de Rijke
Issue Date:October 2013
pp. 669-678
A result page of a modern web search engine is often much more complicated than a simple list of "ten blue links." In particular, a search engine may combine results from different sources (e.g., Web, News, and Images), and display these as grouped results...
     
Using historical click data to increase interleaving sensitivity
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Craig Macdonald, Pavel Serdyukov, Eugene Kharitonov, Iadh Ounis
Issue Date:October 2013
pp. 679-688
Interleaving is an online evaluation method to compare two alternative ranking functions based on the users' implicit feedback. In an interleaving experiment, the results from two ranking functions are merged in a single result list and presented to the us...
     
Personalization of web-search using short-term browsing context
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Pavel Serdyukov, Yury Ustinovskiy
Issue Date:October 2013
pp. 1979-1988
Search and browsing activity is known to be a valuable source of information about user's search intent. It is extensively utilized by most of modern search engines to improve ranking by constructing certain ranking features as well as by personalizing sea...
     
Intent models for contextualising and diversifying query suggestions
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Craig Macdonald, Eugene Kharitonov, Pavel Serdyukov, Iadh Ounis
Issue Date:October 2013
pp. 2303-2308
The query suggestion or auto-completion mechanisms help users to type less while interacting with a search engine. A basic approach that ranks suggestions according to their frequency in query logs is suboptimal. Firstly, many candidate queries with the sa...
     
Timely crawling of high-quality ephemeral new content
Found in: Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM '13)
By Damien Lefortier, Egor Samosvat, Liudmila Ostroumova, Pavel Serdyukov
Issue Date:October 2013
pp. 745-750
In this paper, we study the problem of timely finding and crawling of \textit{ephemeral} new pages, i.e., for which user traffic grows really quickly right after they appear, but lasts only for several days (e.g., news, blog and forum posts). Traditional c...
     
Fresh BrowseRank
Found in: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '13)
By Andrei Khropov, Gleb Gusev, Maxim Zhukovskiy, Pavel Serdyukov
Issue Date:July 2013
pp. 1029-1032
In the last years, a lot of attention was attracted by the problem of page authority computation based on user browsing behavior. However, the proposed methods have a number of limitations. In particular, they run on a single snapshot of a user browsing gr...
     
Click model-based information retrieval metrics
Found in: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '13)
By Aleksandr Chuklin, Maarten de Rijke, Pavel Serdyukov
Issue Date:July 2013
pp. 493-502
In recent years many models have been proposed that are aimed at predicting clicks of web search users. In addition, some information retrieval evaluation metrics have been built on top of a user model. In this paper we bring these two directions together ...
     
User model-based metrics for offline query suggestion evaluation
Found in: Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '13)
By Craig Macdonald, Eugene Kharitonov, Iadh Ounis, Pavel Serdyukov
Issue Date:July 2013
pp. 633-642
Query suggestion or auto-completion mechanisms are widely used by search engines and are increasingly attracting interest from the research community. However, the lack of commonly accepted evaluation methodology and metrics means that it is not possible t...
     
Nontrivial landmark recommendation using geotagged photos
Found in: ACM Transactions on Intelligent Systems and Technology (TIST)
By Alan Hanjalic, Martha Larson, Pavel Serdyukov, Yue Shi
Issue Date:June 2013
pp. 1-27
Online photo-sharing sites provide a wealth of information about user behavior and their potential is increasing as it becomes ever-more common for images to be associated with location information in the form of geotags. In this article, we propose a nove...
     
Recency-sensitive model of web page authority
Found in: Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12)
By Andrei Raigorodskii, Dmitry Vinogradov, Gleb Gusev, Maxim Zhukovskiy, Pavel Serdyukov
Issue Date:October 2012
pp. 2627-2630
Traditional link-based web ranking algorithms run on a single web snapshot without concern of the dynamics of web pages and links. In particular, the correlation of web pages freshness and their classic PageRank is negative (see [11]). For this reason, in ...
     
Session-based query performance prediction
Found in: Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12)
By Andrey Kustarev, Anna Mazur, Pavel Serdyukov, Yury Ustinovskiy
Issue Date:October 2012
pp. 2563-2566
Search sessions are known to be a rich source of diverse valuable information for individual query analysis. In this paper, we address the problem of query performance prediction by utilizing the entire logical search sessions containing the given query. G...
     
Demographic context in web search re-ranking
Found in: Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12)
By Eugene Kharitonov, Pavel Serdyukov
Issue Date:October 2012
pp. 2555-2558
In this paper we study usefulness of user's demographical context for improving ranking of ambiguous queries. Context-aware relevance model is learnt from implicit user behaviour by using a simple yet general modification of a state-of-art click model whic...
     
Prediction of retweet cascade size over time
Found in: Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12)
By Alexey Umnov, Andrey Kupavskii, Andrey Kustarev, Gleb Gusev, Liudmila Ostroumova, Pavel Serdyukov, Svyatoslav Usachev
Issue Date:October 2012
pp. 2335-2338
Retweet cascades play an essential role in information diffusion in Twitter. Popular tweets reflect the current trends in Twitter, while Twitter itself is one of the most important online media. Thus, understanding the reasons why a tweet becomes popular i...
     
Query recommendation for children
Found in: Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12)
By Djoerd Hiemstra, Ingmar Weber, Pavel Serdyukov, Sergio Duarte Torres
Issue Date:October 2012
pp. 2010-2014
One of the biggest problems that children experience while searching the web occurs during the query formulation process. Children have been found to struggle formulating queries based on keywords given their limited vocabulary and their difficulty to choo...
     
Empirical validation of the buckley-osthus model for the web host graph: degree and edge distributions
Found in: Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12)
By Andrei Raigorodskii, Dmitry Vinogradov, Evgeniy Grechnikov, Gleb Gusev, Liudmila Ostroumova, Maxim Zhukovskiy, Pavel Serdyukov, Yuri Pritykin
Issue Date:October 2012
pp. 1577-1581
We consider the Buckley-Osthus implementation of preferential attachment and its ability to model the web host graph in two aspects. One is the degree distribution that we observe to follow the power law, as often being the case for real-world graphs. Anot...
     
Optimizing parameters of the expected reciprocal rank
Found in: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '12)
By Pavel Serdyukov, Yury Logachev
Issue Date:August 2012
pp. 1123-1124
Most popular IR metrics are parameterized. Usually parameters of these metrics are chosen on the basis of general considerations and not adjusted by experiments with real users. Particularly, the parameters of the Expected Reciprocal Rank measure are the n...
     
How query extensions reflect search result abandonments
Found in: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '12)
By Aleksandr Chuklin, Pavel Serdyukov
Issue Date:August 2012
pp. 1087-1088
It is often considered that high abandonment rate corresponds to poor IR system performance. However several studies suggested that there are so called good abandonments, i.e. situations when search engine result page contains enough details to satisfy the...
     
Gender-aware re-ranking
Found in: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval (SIGIR '12)
By Eugene Kharitonov, Pavel Serdyukov
Issue Date:August 2012
pp. 1081-1082
In this paper we study usefulness of users' gender information for improving ranking of ambiguous queries in personalized and non-contextual settings. This study is performed as a sequence of offline re-ranking experiments and it demonstrates that the prop...
     
WSCD 2012: workshop on web search click data 2012
Found in: Proceedings of the fifth ACM international conference on Web search and data mining (WSDM '12)
By Nick Craswell, Georges Dupret, Pavel Serdyukov
Issue Date:February 2012
pp. 771-772
WSCD2012 is the second workshop on Web Search Click Data, following WSCD2009. It is a forum for new research relating to Web search usage logs and for discussing desirable properties of publicly released search log datasets. Research relating to search log...
     
The where in the tweet
Found in: Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM '11)
By Arjen P. de Vries, Carsten Eickhoff, Martha Larson, Pavel Serdyukov, Wen Li
Issue Date:October 2011
pp. 2473-2476
Twitter is a widely-used social networking service which enables its users to post text-based messages, so-called tweets. POI tags on tweets can show more human-readable high-level information about a place rather than just a pair of coordinates. In this p...
     
Smoothing NDCG metrics using tied scores
Found in: Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM '11)
By Andrey Kustarev, Evgeny Grechnikov, Ilya Segalovich, Pavel Serdyukov, Yury Logachev, Yury Ustinovsky
Issue Date:October 2011
pp. 2053-2056
One of promising directions in research on learning to rank concerns the problem of appropriate choice of the objective function to maximize by means of machine learning algorithms. We describe a novel technique of smoothing an arbitrary ranking metric and...
     
Recency ranking by diversification of result set
Found in: Proceedings of the 20th ACM international conference on Information and knowledge management (CIKM '11)
By Andrey Styskin, Fedor Romanenko, Fedor Vorobyev, Pavel Serdyukov
Issue Date:October 2011
pp. 1949-1952
In this paper, we propose a web search retrieval approach which automatically detects recency sensitive queries and increases the freshness of the ordinary document ranking by a degree proportional to the probability of the need in recent content. We propo...
     
Web page classification on child suitability
Found in: Proceedings of the 19th ACM international conference on Information and knowledge management (CIKM '10)
By Arjen P. de Vries, Carsten Eickhoff, Pavel Serdyukov
Issue Date:October 2010
pp. 1425-1428
Children spend significant amounts of time on the Internet. Recent studies showed, that during these periods they are often not under adult supervision. This work presents an automatic approach to identifying suitable web pages for children based on topica...
     
Entity ranking using Wikipedia as a pivot
Found in: Proceedings of the 19th ACM international conference on Information and knowledge management (CIKM '10)
By Arjen De Vries, Jaap Kamps, Pavel Serdyukov, Rianne Kaptein
Issue Date:October 2010
pp. 69-78
In this paper we investigate the task of Entity Ranking on the Web. Searchers looking for entities are arguably better served by presenting a ranked list of entities directly, rather than a list of web pages with relevant but also potentially redundant inf...
     
Using flickr geotags to predict user travel behaviour
Found in: Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval (SIGIR '10)
By Arjen P. de Vries, Maarten Clements, Marcel J.T. Reinders, Pavel Serdyukov
Issue Date:July 2010
pp. 851-852
We propose a method to predict a user's favourite locations in a city, based on his Flickr geotags in other cities. We define a similarity between the geotag distributions of two users based on a Gaussian kernel convolution. The geotags of the most similar...
     
Query log analysis in the context of information retrieval for children
Found in: Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval (SIGIR '10)
By Djoerd Hiemstra, Pavel Serdyukov, Sergio Duarte Torres
Issue Date:July 2010
pp. 847-848
In this paper we analyze queries and sessions intended to satisfy children's information needs using a large-scale query log. The aim of this analysis is twofold: i) To identify differences between such queries and sessions, and general queries and session...
     
Linking wikipedia to the web
Found in: Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval (SIGIR '10)
By Jaap Kamps, Pavel Serdyukov, Rianne Kaptein
Issue Date:July 2010
pp. 839-840
We investigate the task of finding links from Wikipedia pages to external web pages. Such external links significantly extend the information in Wikipedia with information from the Web at large, while retaining the encyclopedic organization of Wikipedia. W...
     
Placing flickr photos on a map
Found in: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval (SIGIR '09)
By Pavel Serdyukov, Roelof van Zwol, Vanessa Murdock
Issue Date:July 2009
pp. 435-435
In this paper we investigate generic methods for placing photos uploaded to Flickr on the World map. As primary input for our methods we use the textual annotations provided by the users to predict the single most probable location where the image was take...
     
Modeling multi-step relevance propagation for expert finding
Found in: Proceeding of the 17th ACM conference on Information and knowledge mining (CIKM '08)
By Djoerd Hiemstra, Henning Rode, Pavel Serdyukov
Issue Date:October 2008
pp. 1001-1001
An expert finding system allows a user to type a simple text query and retrieve names and contact information of individuals that possess the expertise expressed in the query. This paper proposes a novel approach to expert finding in large enterprises or i...
     
The search for expertise: to the documents and beyond
Found in: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '08)
By Pavel Serdyukov
Issue Date:July 2008
pp. 2-2
Natural language processing techniques are believed to hold a tremendous potential to supplement the purely quantitative methods of text information retrieval. This has led to the emergence of a large number of NLP-based IR research projects over the last ...
     
Combining document- and paragraph-based entity ranking
Found in: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '08)
By Djoerd Hiemstra, Henning Rode, Pavel Serdyukov
Issue Date:July 2008
pp. 2-2
We study entity ranking on the INEX entity track and propose a simple graph-based ranking approach that enables to combine scores on document and paragraph level. The combined approach improves the retrieval results not only on the INEX testset, but simila...
     
Modeling expert finding as an absorbing random walk
Found in: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '08)
By Djoerd Hiemstra, Henning Rode, Pavel Serdyukov
Issue Date:July 2008
pp. 2-2
We introduce a novel approach to expert finding based on multi-step relevance propagation from documents to related candidates. Relevance propagation is modeled with an absorbing random walk. The evaluation on the two official Enterprise TREC data sets dem...
     
Exploiting sequential dependencies for expert finding
Found in: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '08)
By Djoerd Hiemstra, Henning Rode, Pavel Serdyukov
Issue Date:July 2008
pp. 2-2
We propose an expert finding method based on assumption of sequential dependence between a candidate expert and the query terms in the scope of a document. We assume that the strength of relation of a candidate to the document's content depends on its posi...
     
Generative modeling of persons and documents for expert search
Found in: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '07)
By Djoerd Hiemstra, Maarten Fokkinga, Pavel Serdyukov, Peter M. G. Apers
Issue Date:July 2007
pp. 827-828
In this paper we address the task of automatically finding an expert within the organization, known as the expert search problem. We present the theoretically-based probabilistic algorithm which models retrieved documents as mixtures of expert candidate la...
     
 1