The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.01 - Jan. (2014 vol.26)
pp: 69-82
Mikel Larranaga , University of the Basque Country, Donostia
Angel Conde , University of the Basque Country, Donostia
Inaki Calvo , University of the Basque Country, Donostia
Jon A. Elorriaga , University of the Basque Country, Donostia
Ana Arruarte , University of the Basque Country, Donostia
ABSTRACT
Technology-supported learning systems have proved to be helpful in many learning situations. These systems require an appropriate representation of the knowledge to be learned, the Domain Module. The authoring of the Domain Module is cost and labor intensive, but its development cost might be lightened by profiting from semiautomatic Domain Module authoring techniques and promoting knowledge reuse. DOM-Sortze is a system that uses natural language processing techniques, heuristic reasoning, and ontologies for the semiautomatic construction of the Domain Module from electronic textbooks. To determine how it might help in the Domain Module authoring process, it has been tested with an electronic textbook, and the gathered knowledge has been compared with the Domain Module that instructional designers developed manually. This paper presents DOM-Sortze and describes the experiment carried out.
INDEX TERMS
Ontologies, Buildings, Educational institutions, Dictionaries, Data mining,ontology design, Knowledge acquisition, domain engineering
CITATION
Mikel Larranaga, Angel Conde, Inaki Calvo, Jon A. Elorriaga, Ana Arruarte, "Automatic Generation of the Domain Module from Electronic Textbooks: Method and Validation", IEEE Transactions on Knowledge & Data Engineering, vol.26, no. 1, pp. 69-82, Jan. 2014, doi:10.1109/TKDE.2013.36
REFERENCES
[1] B. Parsad and L. Lewis, "Distance Education at Degree-Granting Postsecondary Institutions: 2006-07," technical report, Nat'l Center for Education Statistics, Inst. of Education Sciences, US Department of Education, 2008.
[2] P.-S.D. Chen, A.D. Lambert, and K.R. Guidry, "Engaging Online Learners: The Impact of Web-Based Learning Technology on College Student Engagement," Computers and Education, vol. 54, no. 4, pp. 1222-1232, May 2010.
[3] J.R. Anderson, "The Expert Module," Foundations of Intelligent Tutoring Systems, M.C. Polson and J.J. Richardson, eds., pp. 21-54, Lawrence Erlbaum, 1988.
[4] M. Larrañaga, I. Niebla, U. Ruedat, J.A. Elorriaga, and A. Arruarte, "Towards Collaborative Domain Module Authoring," Proc. Seventh IEEE Int'l Conf. Advanced Learning Technologies (ICALT '07), pp. 814-818, July 2007.
[5] I. Aduriz, E. Agirre, I. Aldezabal, I. Alegria, O. Ansa, X. Arregi, J.M. Arriola, X. Artola, A.D. de Ilarraza, N. Ezeiza, K. Gojenola, A. Maritxalar, M. Maritxalar, M. Oronoz, K. Sarasola, A. Soroa, R. Urizar, and M. Urkia, "A Framework for the Automatic Processing of Basque," Proc. Language Resources and Evaluation Conf. (LREC '98), 1998.
[6] I. Aduriz, I. Aldezabal, I. Alegria, X. Artola, N. Ezeiza, and R. Urizar, "Euslem: A Lemmatiser/Tagger for Basque," Proc. EURALEX, vol. 1, pp. 17-26, 1996.
[7] Ontology Learning from Text: Methods, Applications, and Evaluation, P. Buitelaar, P. Cimiano, and B. Magnini, eds., IOS Press, 2005.
[8] Semi-Automatic Ontology Development: Processes and Resources, M.T. Pazienza and A. Stellato, eds., IGI Global, 2012.
[9] WordNet: An Electronic Lexical Database, C. Fellbaum, ed., MIT Press, 1998.
[10] P. Cimiano and J. Völker, "Text2Onto—A Framework for Ontology Learning and Data-Driven Change Discovery," Proc. 10th Int'l Conf. Applications of Natural Language to Information Systems (NLDB '05), pp. 227-238, June 2005.
[11] M.A. Hearst, "Automatic Acquisition of Hyponyms from Large Text Corpora," Proc. 14th Conf. Computational Linguistics (COLING '92), pp. 539-545, 1992.
[12] K.T. Frantzi, S. Ananiadou, and J. Tsujii, "The C-Value/NC-Value Method of Automatic Recognition for Multi-Word Terms," Proc. Second European Conf. Research and Advanced Technology for Digital Libraries (ECDL '98), pp. 585-604, 1998.
[13] P. Buitelaar, D. Olejnik, and M. Sintek, "A Protégé Plug-In for Ontology Extraction from Text Based on Linguistic Analysis," Proc. First European Semantic Web Symp. (ESWS '04), pp. 31-44, 2004.
[14] N. Guarino, "Semantic Matching: Formal Ontological Distinctions for Information Organization, Extraction, and Integration," Proc. Int'l Summer School on Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology (SCIE '97), pp.139-170, 1997.
[15] S. Bechhofer, F. van Harmelen, J. Hendler, I. Horrocks, D.L. McGuinness, P.F. Patel-Schneider, and L.A. Stein, "OWL Web Ontology Language Reference," http://www.w3.org/TRowl-ref/, 2004.
[16] M. Larrañaga, U. Rueda, J.A. Elorriaga, and A. Arruarte, "Acquisition of the Domain Structure from Document Indexes Using Heuristic Reasoning," Proc. Seventh Int'l Conf. Intelligent Tutoring Systems (ITS '04), pp. 175-186, 2004.
[17] P. Vossen, "Extending, Trimming and Fusing WordNet for Technical Documents," Proc. Second Meeting of the North Am. Chapter of the Assoc. for Computational Linguistics (NAACL '01), 2001.
[18] E. Morin and C. Jaquemin, "Projecting Corpus-Based Semantic Links on a Thesaurus," Proc. 37th Ann. Meeting of the Assoc. for Computational Linguistics (ACL '99), pp. 389-396, 1999.
[19] R.J. Byrd, N. Calzolari, M.S. Chodorow, J.L. Klavans, M.S. Neff, and O.A. Rizk, "Tools and Methods for Computational Lexicology," Computational Linguistics, vol. 13, nos. 3-4, pp. 219-240, 1987.
[20] J.S. Justeson and S.M. Katz, "Technical Terminology: Some Linguistic Properties and an Algorithm for Identification of Terms in Text," Natural Language Eng., vol. 1, no. 1, pp. 9-27, 1995.
[21] I. Alegria, A. Gurrutxaga, P. Lizaso, X. Saralegi, S. Ugartetxea, and R. Urizar, "An XML-Based Term Extraction Tool for Basque," Proc. Fifth Int'l Conf. Language Resources and Evaluations (LREC '04), 2004.
[22] "Constraint Grammar: Language-Independent System for Parsing Unrestricted Text," Natural Language Processing, F.Karlsson, A. Voutilainen, and J. Heikkila, eds., no. 4, Mouton de Gruyter, 1995.
[23] M. Larrañaga, I. Calvo, J.A. Elorriaga, A. Arruarte, K. Verbert, and E. Duval, "ErauzOnt: A Framework for Gathering Learning Objects from Electronic Documents," Proc. 11th IEEE Int'l Conf. Advanced Learning Technologies (ICALT '11), pp. 656-658, 2011.
[24] T. Leidig, "L3-Towards an Open Learning Environment," ACM J. Educational Resources in Computing, vol. 1, no. 1, pp. 5-11, 2001.
[25] K. Verbert, D. Gašević, J. Jovanović, and E. Duval, "Ontology-Based Learning Content Repurposing," Proc. 14th Int'l Conf. World Wide Web (WWW '05), pp. 1140-1141, 2005.
[26] M. Larrañaga, A. Conde, I. Calvo, A. Arruarte, and J.A. Elorriaga, "Evaluating the Automatic Extraction of Learning Objects from Electronic Textbooks Using Erauzont," Proc. 11th Int'l Conf. Intelligent Tutoring Systems (ITS '12), pp. 655-656, 2012.
[27] K. Verbert, "An Architecture and Framework for Flexible Reuse of Learning Object Components," PhD dissertation, Faculteit Ingenieurswetenschappen, Katholieke Univ. Leuven, Feb. 2008.
[28] B. Liu, C.W. Chin, and H.T. Ng, "Mining Topic-Specific Concepts and Definitions on the Web," Proc. 12th Int'l Conf. World Wide Web (WWW), pp. 251-260, 2003.
[29] E. Agirre, O.L. de Lacalle, and A. Soroa, "Knowledge-Based WSD and Specific Domains: Performing Better Than Generic Supervised WSD," Proc. 21st Int'l Joint Conf. Artifical Intelligence (IJCAI '09), pp. 1501-1506, 2009.
[30] T. Hughes and D. Ramage, "Lexical Semantic Relatedness with Random Graph Walks," Proc. EMNLP-CONLL-2007, pp. 581-589, 2007.
[31] M. Larrañaga, J.A. Elorriaga, and A. Arruarte, "A Heuristic NLP Based Approach for Getting Didactic Resources from Electronic Documents," Proc. European Conf. Technology Enhanced Learning (EC-TEL '08), pp. 197-202, 2008.
[32] K. Cardinaels, M. Meire, and E. Duval, "Automating Metadata Generation: The Simple Indexing Interface," Proc. 14th Int'l Conf. World Wide Web (WWW '05), 2005.
[33] S.C. Kabel, R. de Hoog, B.J. Wielinga, and A. Anjewierden, "The Added Value of Task and Ontology Based Mark-Up for Information Retrieval," J. Am. Soc. for Information Science and Technology, vol. 55, no. 4, pp. 348-362, 2004.
[34] K. Verbert, X. Ochoa, and E. Duval, "The ALOCOM Framework: Towards Scalable Content Reuse," J. Digital Information, vol. 9, no. 1, 2008.
[35] M. Meire, X. Ochoa, and E. Duval, "SAmgI: Automatic Metadata Generation v2.0," Proc. World Conf. Educational Multimedia, Hypermedia, and Telecomm. (ED-MEDIA '07), pp. 1195-1204, June 2007.
[36] S. Ternier, D. Massart, F.V. Assche, N. Smith, B. Simon, and E. Duval, "A Simple Publishing Interface for Learning Object Repositories," Proc. World Conf. Educational Multimedia, Hypermedia, and Telecomm. (ED-MEDIA '08), pp. 1840-1845, 2008.
[37] B. Simon, D. Massart, F.V. Assche, S. Ternier, E. Duval, S. Brantner, D. Olmedilla, and Z. Miklós, "A Simple Query Interface for Interoperable Learning Repositories," Proc. 14th Int'l Conf. World Wide Web (WWW '05), pp. 11-18, 2005.
[38] W. Chen, R. Lu, W. Zhang, and H. Du, "A Tool for Automatic Generation of Multimedia ICAI Systems," Proc. Int'l Conf. Artificial Intelligence in Education (AIED '97), pp. 571-573, 1997.
[39] M. Lentini, D. Nardi, and A. Simonetta, "Self-instructive Spreadsheets: An Environment for Automatic Knowledge Acquisition and Tutor Goeneration," Int'l J. Human-Computer Studies, vol. 52, no. 5, pp. 775-803, 2000.
[40] A. Zouaq and R. Nkambou, "Evaluating the Generation of Domain Ontologies in the Knowledge Puzzle Project," IEEE Trans. Knowledge and Data Eng., vol. 21, no. 11, pp. 1559-1572, Nov. 2009.
[41] A. Maedche and S. Staab, "Ontology Learning for the Semantic Web," IEEE Intelligent Systems, vol. 16, no. 2, pp. 72-79, Mar. 2001.
[42] P. Velardi, R. Navigli, A. Cucchiarello, and F. Neri, "Evaluation of OntoLearn, a Methodology for Automatic Learning of Domain Ontologies," Ontology Learning from Text: Methods, Applications, and Evaluation, P. Buitelaar, P. Cimiano, and B. Magnini, eds., pp. 92-106, IOS Press, 2005.
[43] A. Conde, M. Larrañaga, I. Calvo, J.A. Elorriaga, and A. Arruarte, "Automating the Authoring of Learning Material in Computer Engineering Education," Proc. 42nd IEEE Frontiers in Education Conf. (FIE '12), pp. 1376-1381, 2012.
[44] K.S. Jones, "A Statistical Interpretation of Term Specificity and Its Application in Retrieval," J. Documentation, vol. 60, no. 5, pp. 11-21, 1972.
18 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool