|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
The Effect of Cluster Size, Dimensionality, and the Number of Clusters on Recovery of True Cluster Structure
January 1983 (vol. 5 no. 1)
pp. 40-47
| ASCII Text | x | ||
| Glenn W. Milligan, S. C. Soon, Lisa M. Sokol, "The Effect of Cluster Size, Dimensionality, and the Number of Clusters on Recovery of True Cluster Structure," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 5, no. 1, pp. 40-47, January, 1983. | |||
| BibTex | x | ||
| @article{ 10.1109/TPAMI.1983.4767342, author = {Glenn W. Milligan and S. C. Soon and Lisa M. Sokol}, title = {The Effect of Cluster Size, Dimensionality, and the Number of Clusters on Recovery of True Cluster Structure}, journal ={IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume = {5}, number = {1}, issn = {0162-8828}, year = {1983}, pages = {40-47}, doi = {http://doi.ieeecomputersociety.org/10.1109/TPAMI.1983.4767342}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Pattern Analysis and Machine Intelligence TI - The Effect of Cluster Size, Dimensionality, and the Number of Clusters on Recovery of True Cluster Structure IS - 1 SN - 0162-8828 SP40 EP47 EPD - 40-47 A1 - Glenn W. Milligan, A1 - S. C. Soon, A1 - Lisa M. Sokol, PY - 1983 VL - 5 JA - IEEE Transactions on Pattern Analysis and Machine Intelligence ER - | |||
An evaluation of four clustering methods and four external criterion measures was conducted with respect to the effect of the number of clusters, dimensionality, and relative cluster sizes on the recovery of true cluster structure. The four methods were the single link, complete link, group average (UPGMA), and Ward's minimum variance algorithms. The results indicated that the four criterion measures were generally consistent with each other, of which two highly similar pairs were identified. The tirst pair consisted of the Rand and corrected Rand statistics, and the second pair was the Jaccard and the Fowlkes and Mallows indexes. With respect to the methods, recovery was found to improve as the number of clusters increased and as the number of dimensions increased. The relative cluster size factor produced differential performance effects, with Ward's procedure providing the best recovery when the clusters were of equal size. The group average method gave equivalent or better recovery when the clusters were of unequal size.
Citation:
Glenn W. Milligan, S. C. Soon, Lisa M. Sokol, "The Effect of Cluster Size, Dimensionality, and the Number of Clusters on Recovery of True Cluster Structure," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 5, no. 1, pp. 40-47, Jan. 1983, doi:10.1109/TPAMI.1983.4767342
Usage of this product signifies your acceptance of the Terms of Use.

