Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 2
Identifying Story and Preview Images in News Web Pages
Edinburgh, Scotland
August 03-August 06
ISBN: 0-7695-1960-1
The World Wide Web provides an increasingly powerful and popular publication mechanism. Web documents often contain a large number of images serving various different purposes. This paper focuses on images that are associated with a story or preview to a story. Such images often accompany the key content on a web page, thus their identification is important for applications such as web page summarization and mobile access. We present a novel algorithm for automatic identification of story/preview images which combines features extracted from both the image itself and the surrounding text. The effectiveness of this algorithm is demonstrated by experimental results on over 1500 images collected from 25 news web sites.
Citation:
Jianying Hu, Amit Bagga, "Identifying Story and Preview Images in News Web Pages," icdar, vol. 2, pp.640, Seventh International Conference on Document Analysis and Recognition (ICDAR'03) - Volume 2, 2003