loading...
 This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008)
Retrieval of Personal Web Documents by Extracting Subjective Expressions
March 25-March 28
ISBN: 978-0-7695-3096-3
This paper presents a method for gathering Japanese web documents which contain personal opinions. Our method is available as a pre-processing of applications for mining various opinions. In order to find personal documents on the Web, we focus on four kinds of subjective expressions: (1) negative meaning expressions, (2) final particles, (3) interjections, and (4) specific symbols such as face marks. Measuring the frequencies of these subjective expressions in a document, our method classifies web documents into personal and impersonal ones. Besides, our method gives the documents scores which show the accuracy of the classification results. We experimentally confirmed the effectiveness of the proposal using 1200 web documents. The experimental results have shown the precision and recall of the proposed classification are 0.70 and 0.87, respectively. In addition, we have confirmed that personal documents can be easily obtained by gathering documents which are given high scores.
Index Terms:
WWW, Personal Web Pages, Subjective Expressions
Citation:
Takahiro Hayashi, Koji Abe, Rikio Onai, "Retrieval of Personal Web Documents by Extracting Subjective Expressions," ainaw, pp.1187-1192, 22nd International Conference on Advanced Information Networking and Applications - Workshops (aina workshops 2008), 2008
Usage of this product signifies your acceptance of the Terms of Use.