Issue No. 06 - November/December (2002 vol. 14)
<p><b>Abstract</b>—The generation of a set of rules underlying a classification problem is performed by applying a new algorithm called Hamming Clustering (HC). It reconstructs the <scp>and-or</scp> expression associated with any Boolean function from a training set of samples. The basic kernel of the method is the generation of clusters of input patterns that belong to the same class and are close to each other according to the Hamming distance. Inputs which do not influence the final output are identified, thus automatically reducing the complexity of the final set of rules. The performance of HC has been evaluated through a variety of artificial and real-world benchmarks. In particular, its application in the diagnosis of breast cancer has led to the derivation of a reduced set of rules solving the associated classification problem.</p>
Rule generation, Hamming clustering, knowledge discovery, Boolean function approximation, generalization.
Marco Muselli, Diego Liberati, "Binary Rule Generation via Hamming Clustering", IEEE Transactions on Knowledge & Data Engineering, vol. 14, no. , pp. 1258-1268, November/December 2002, doi:10.1109/TKDE.2002.1047766