|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery
Web Page Classification Based on a Least Square Support Vector Machine with Latent Semantic Analysis
October 18-October 20
ISBN: 978-0-7695-3305-6
| ASCII Text | x | ||
| Yong Zhang, Bin Fan, Long-bin Xiao, "Web Page Classification Based on a Least Square Support Vector Machine with Latent Semantic Analysis," Fuzzy Systems and Knowledge Discovery, Fourth International Conference on, vol. 2, pp. 528-532, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008. | |||
| BibTex | x | ||
| @article{ 10.1109/FSKD.2008.259, author = {Yong Zhang and Bin Fan and Long-bin Xiao}, title = {Web Page Classification Based on a Least Square Support Vector Machine with Latent Semantic Analysis}, journal ={Fuzzy Systems and Knowledge Discovery, Fourth International Conference on}, volume = {2}, year = {2008}, isbn = {978-0-7695-3305-6}, pages = {528-532}, doi = {http://doi.ieeecomputersociety.org/10.1109/FSKD.2008.259}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - Fuzzy Systems and Knowledge Discovery, Fourth International Conference on TI - Web Page Classification Based on a Least Square Support Vector Machine with Latent Semantic Analysis SN - 978-0-7695-3305-6 SP528 EP532 A1 - Yong Zhang, A1 - Bin Fan, A1 - Long-bin Xiao, PY - 2008 KW - web page classification KW - least square support vector machine KW - latent semantic analysis KW - web page expression KW - noise reduction VL - 2 JA - Fuzzy Systems and Knowledge Discovery, Fourth International Conference on ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/FSKD.2008.259
Chinese web page classification (WPC) has been considered as a hot research area in data mining. In order to effectively classify web pages, we present a web page categorization based on a least square support vector machine (LS-SVM) with latent semantic analysis (LSA). LSA uses Singular Value Decom- postion (SVD) to obtain latent semantic structure of original term-document matrix solving the polysemous and synonymous keywords problem. LS-SVM is an effective method for learning the classification knowledge from massive data, especially on condition of high cost in getting labeled classical examples. We adopt a novel method of web page expression, and make use of summarization algorithm to reduce the noise of web pages. A preliminary experimental comparison is made showing encouraging results.
Index Terms:
web page classification, least square support vector machine, latent semantic analysis, web page expression, noise reduction
Citation:
Yong Zhang, Bin Fan, Long-bin Xiao, "Web Page Classification Based on a Least Square Support Vector Machine with Latent Semantic Analysis," fskd, vol. 2, pp.528-532, 2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery, 2008
Usage of this product signifies your acceptance of the Terms of Use.
