The Community for Technology Leaders
2013 IEEE 13th International Conference on Data Mining Workshops (2006)
Hong Kong, China
Dec. 18, 2006 to Dec. 22, 2006
ISBN: 0-7695-2702-7
pp: 185-189
Andreas Kupfer , TU Braunschweig, Germany
Silke Eckstein , TU Braunschweig, Germany
Brigitte Mathiak , TU Braunschweig, Germany
Tatjana Scope , TU Braunschweig, Germany
Britta Stormann , TU Braunschweig, Germany
The aim of literature retrieval is to find significant papers on a given topic. In previous publications, we examined the use of choosing these papers based on the pictures they include. To refine this approach we seek to employ picture classification to further narrow down the number of interesting pictures presented. This can be useful, for example, when looking for the results of specific experiments. The classification can also be useful as a data cleansing step, to omit all unnecessary pictures not used as a figure. We use a method originally designed to distinguish between photos and computer-generated pictures on the web. We show that this method can not only be used to distinguish between raw data and derived representation figures, we can also reliably eliminate non-figure pictures in the document, like text pages and logos. We tested this approach on two different data sets with different topics and different non-figure problems, both with satisfactory results.
Andreas Kupfer, Silke Eckstein, Brigitte Mathiak, Tatjana Scope, Britta Stormann, "Using image classification for biomedical literature retrieval", 2013 IEEE 13th International Conference on Data Mining Workshops, vol. 00, no. , pp. 185-189, 2006, doi:10.1109/ICDMW.2006.168
78 ms
(Ver 3.3 (11022016))