Issue No. 08 - August (2010 vol. 22)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2009.145
Shikui Wei , Beijing Jiaotong University, Beijing
Yao Zhao , Beijing Jiaotong University, Beijing
Zhenfeng Zhu , Beijing Jiaotong University, Beijing
Nan Liu , Beijing Jiaotong University, Beijing
Analysis on click-through data from a very large search engine log shows that users are usually interested in the top-ranked portion of returned search results. Therefore, it is crucial for search engines to achieve high accuracy on the top-ranked documents. While many methods exist for boosting video search performance, they either pay less attention to the above factor or encounter difficulties in practical applications. In this paper, we present a flexible and effective reranking method, called CR-Reranking, to improve the retrieval effectiveness. To offer high accuracy on the top-ranked results, CR-Reranking employs a cross-reference (CR) strategy to fuse multimodal cues. Specifically, multimodal features are first utilized separately to rerank the initial returned results at the cluster level, and then all the ranked clusters from different modalities are cooperatively used to infer the shots with high relevance. Experimental results show that the search quality, especially on the top-ranked results, is improved significantly.
Clustering, image/video retrieval, multimedia databases.
S. Wei, Y. Zhao, Z. Zhu and N. Liu, "Multimodal Fusion for Video Search Reranking," in IEEE Transactions on Knowledge & Data Engineering, vol. 22, no. , pp. 1191-1199, 2009.