This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
Redundancy Detection in Semistructured Case Bases
May/June 2001 (vol. 13 no. 3)
pp. 513-518

Abstract—With the dramatic proliferation of case-based reasoning systems in commercial applications, many case bases are now becoming legacy systems. They represent a significant portion of an organization's assets, but they are large and difficult to maintain. One of the contributing factors is that these case bases are often large and yet unstructured or semistructured; they are represented in natural language text. Adding to the complexity is the fact that the case bases are often authored and updated by different people from a variety of knowledge sources, making it highly likely for a case base to contain redundant and inconsistent knowledge. In this paper, we present methods and a system for maintaining large and semistructured case bases. We focus on a difficult problem in case base maintenance: redundancy detection. This problem is particularly pervasive when one deals with a semistructured case base. We will discuss an information-retrieval-based algorithm and an implemented system for solving this problem. As the ability to contain the knowledge acquisition problem is of paramount importance, our method allows one to express relevant domain expertise for detecting redundancy naturally and effortlessly. Empirical evaluations of the system demonstrate the effectiveness of the methods in several large domains.

[1] D. Aha, “Case-Based Learning Algorithms,” Proc. 1991 DARPA Case-Based Reasoning Workshop, vol. 1, pp. 147-158, 1991.
[2] W.B. Frakes and R. Baeza-Yates, Information Retrieval Data Structures&Algorithmss.Englewood Cliffs, N.J.: Prentice Hall, 1992.
[3] D. Gelbart and J.C. Smith, “Towards Combining Automated Text Retrieval and Case-Based Expert Legal Advice,” Law Technology J., vol. 1, pp. 19-24, 1992.
[4] H. Kaindl, “Verification and Validation of Knowledge-Based Systems Using Semiformal Representation,” Proc. AAAI 96 Workshop Verification and Validation of Knowledge-Based Systems, pp. 7-16, 1996.
[5] J.L. Kolodner, Case-Based Reasoning, Morgan Kaufmann, San Francisco, Calif., 1993.
[6] J. Kolodner and R.L. Simpson, “The MEDIATOR: Analysis of an Early Case-Based Problem Solver,” Cognitive Science, vol. 13, no. 4, pp. 507-549, 1989.
[7] D. Leake, Case-Based Reasoning—Expriences, Lessons and Future Directions. AAAI Press/MIT Press, 1996.
[8] P. Maes, "Agents That Reduce Work and Information Overload," Comm. ACM, Vol. 37, No. 7, July 1994, p. 31-40.
[9] T. Nguyen, M. Czerwinski, and D. Lee, “Compaq Quicksource—Providing the Consumer with the Power of AI,” AI Magazine, 1993.
[10] A.D. Preece, “Towards a Methodology for Evaluating Expert Systems,” Expert Systems, vol. 7, no. 4, pp. 215-233, 1990.
[11] G. Salton and M. McGill, Introduction to Modern Information Retrieval, McGraw Hill, New York, 1983.
[12] J. Shavlik, “Finding Genes by Case-Based Reasoning in the Presence of Noisy Case Boundaries,” Proc. 1991 DARPA Workshop Case-Based Reasoning, vol. 1, pp. 291-303, 1991.
[13] H. Shimazu and Y. Takashima, “Detecting Discontinuities in Case-Bases,” Proc. 13th Nat'l Conf. Aritifical Intelligence, vol. 1, pp. 690-695, 1996.
[14] E. Simoudis, “Using Case-Based Retrieval for Customer Technical Support,” IEEE Expert, vol. 7, no. 5, pp. 7-13, 1992.
[15] B. Smyth and M. Keane, “Remembering to Forget: A Competence-Preserving Case Deletion Policy for Case-Based Reasoning Systems,” Proc. Int'l Joint Conf. Artificial Intelligence, vol. 1, pp. 377-382, 1995.
[16] I. Watson, Applying Case-Based Reasoning: Techniques for Enterprise Systems. Morgan Kaufmann, 1997.
[17] Q. Yang, E. Kim, and K. Racine, “Caseadvisor: Supporting Interactive Problem Solving and Case Base Maintenance for Help Desk Applications,” Proc. Int'l Joint Conf. Artificial Intelligence (IJCAI 97 Workshop on Practical Applications of CBR), Aug. 1997.

Index Terms:
Data mining, knowledge and case base maintenance, knowledge acquisition in expert systems.
Citation:
Kirsti Racine, Qiang Yang, "Redundancy Detection in Semistructured Case Bases," IEEE Transactions on Knowledge and Data Engineering, vol. 13, no. 3, pp. 513-518, May-June 2001, doi:10.1109/69.929905
Usage of this product signifies your acceptance of the Terms of Use.