|
| This Article | ||
| | ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
36th Annual Hawaii International Conference on System Sciences (HICSS'03) - Track 4
Big Island, Hawaii
January 06-January 09
ISBN: 0-7695-1874-5
| ASCII Text | x | ||
| Longzhuang Li, Yi Shang, Wei Zhang, Hongchi Shi, "A General Method for Statistical Performance Evaluation," 2013 46th Hawaii International Conference on System Sciences, vol. 4, pp. 108c, 36th Annual Hawaii International Conference on System Sciences (HICSS'03) - Track 4, 2003. | |||
| BibTex | x | ||
| @article{ 10.1109/HICSS.2003.1174251, author = {Longzhuang Li and Yi Shang and Wei Zhang and Hongchi Shi}, title = {A General Method for Statistical Performance Evaluation}, journal ={2013 46th Hawaii International Conference on System Sciences}, volume = {4}, year = {2003}, isbn = {0-7695-1874-5}, pages = {108c}, doi = {http://doi.ieeecomputersociety.org/10.1109/HICSS.2003.1174251}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - 2013 46th Hawaii International Conference on System Sciences TI - A General Method for Statistical Performance Evaluation SN - 0-7695-1874-5 SP EP A1 - Longzhuang Li, A1 - Yi Shang, A1 - Wei Zhang, A1 - Hongchi Shi, PY - 2003 KW - null VL - 4 JA - 2013 46th Hawaii International Conference on System Sciences ER - | |||
In the paper, we propose a general method for statistical performance evaluation. The method incorporates various statistical metrics and automatically selects an appropriate statistical metric according to the problem parameters. Empirically, We compare the performance of five representative statistical metrics under different conditions through simulation. They are expected loss, Friedman statistic, interval-based selection, probability of win, and probably approximately correct. In the experiments, expected loss is the best for small means, like 1 or 2, and probably approximately correct is the best for all the other cases. Also, we apply the general method to compare the performance of HITS-based algorithms that combine four relevance scoring methods, VSM, Okapi, TLS, and CDR, using a set of broad topic queries. Among the four relevance scoring methods, CDR is the best statistically when it is combined with a HITS-based algorithm.
Citation:
Longzhuang Li, Yi Shang, Wei Zhang, Hongchi Shi, "A General Method for Statistical Performance Evaluation," hicss, vol. 4, pp.108c, 36th Annual Hawaii International Conference on System Sciences (HICSS'03) - Track 4, 2003
Usage of this product signifies your acceptance of the Terms of Use.
