Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06)
Using image classification for biomedical literature retrieval
Hong Kong, China
December 18-December 22
ISBN: 0-7695-2702-7
The aim of literature retrieval is to find significant papers on a given topic. In previous publications, we examined the use of choosing these papers based on the pictures they include. To refine this approach we seek to employ picture classification to further narrow down the number of interesting pictures presented. This can be useful, for example, when looking for the results of specific experiments. The classification can also be useful as a data cleansing step, to omit all unnecessary pictures not used as a figure. We use a method originally designed to distinguish between photos and computer-generated pictures on the web. We show that this method can not only be used to distinguish between raw data and derived representation figures, we can also reliably eliminate non-figure pictures in the document, like text pages and logos. We tested this approach on two different data sets with different topics and different non-figure problems, both with satisfactory results.
Citation:
Brigitte Mathiak, Andreas Kupfer, Tatjana Scope, Britta Stormann, Silke Eckstein, "Using image classification for biomedical literature retrieval," icdmw, pp.185-189, Sixth IEEE International Conference on Data Mining - Workshops (ICDMW'06), 2006