In this paper, we propose a technique to retrieve the images using the ?search by similarity? method with the help of multimodal keywords. Multimodal keywords consist of low-level MPEG-7 color descriptors and textual keywords. The visual keywords and textual keywords are combined together and the image collection is represented as a matrix, which is similar in representation to a term-document matrix. Using LSI (latent semantic indexing), we demonstrate that the visual keywords, when combined with textual keywords can improve the retrieval results to a great degree.
Citation:
Rajeev Agrawal, William Grosky, Farshad Fotouhi, "Image Retrieval Using Multimodal Keywords," ism, pp.817-822, Eighth IEEE International Symposium on Multimedia (ISM'06), 2006