Aug. 30, 2004 to Sept. 3, 2004
Marta Gatius , Technical University of Catalunya, Barcelona
Manuel Bertran , Technical University of Catalunya, Barcelona
Horacio Rodr?guez , Technical University of Catalunya, Barcelona
Web documents present new challenges to conventional Information Retrieval (IR) technologies. This paper describes how these challenges are faced in FameIR, a multilingual multimedia IR shell. In this shell Cross-Language IR (CLIR) and query expansion are performed using EuroWordNet (EWN), the best developed and most widely used lexical resource for several languages. Techniques to extract information from Web documents, Wrapper Generation (WG) techniques, are used to access a finer information granularity than the whole Web page. By combining IR and WG techniques with the use of EWN, FameIR provides a powerful facility to perform CLIR from multimedia Web documents.
Marta Gatius, Manuel Bertran, Horacio Rodr?guez, "Multilingual and Multimedia Information Retrieval from Web Documents", DEXA, 2004, 2012 23rd International Workshop on Database and Expert Systems Applications, 2012 23rd International Workshop on Database and Expert Systems Applications 2004, pp. 20-24, doi:10.1109/DEXA.2004.1333443