The Community for Technology Leaders
RSS Icon
Subscribe
Issue No.04 - April (2013 vol.25)
pp: 805-819
Patrice Buche , UMR INRA IATE and LIRMM, Montpellier
Juliette Dibie-Barthelemy , INRA Metarisk, Paris
Liliana Ibanescu , INRA Metarisk, Paris
Lydie Soler , INRA Metarisk, Paris
ABSTRACT
In this paper, we present the design of ONDINE system which allows the loading and the querying of a data warehouse opened on the Web, guided by an Ontological and Terminological Resource (OTR). The data warehouse, composed of data tables extracted from Web documents, has been built to supplement existing local data sources. First, we present the main steps of our semiautomatic method to annotate data tables driven by an OTR. The output of this method is an XML/RDF data warehouse composed of XML documents representing data tables with their fuzzy RDF annotations. We then present our flexible querying system which allows the local data sources and the data warehouse to be simultaneously and uniformly queried, using the OTR. This system relies on SPARQL and allows approximate answers to be retrieved by comparing preferences expressed as fuzzy sets with fuzzy RDF annotations.
INDEX TERMS
Ontologies, Resource description framework, Semantics, Data mining, OWL, XML, Data warehouses, knowledge modeling, Knowledge and data engineering tools and techniques, XML/XSL/RDF, uncertainty, "fuzzy", and probabilistic reasoning, representations, data structures, and transforms
CITATION
Patrice Buche, Juliette Dibie-Barthelemy, Liliana Ibanescu, Lydie Soler, "Fuzzy Web Data Tables Integration Guided by an Ontological and Terminological Resource", IEEE Transactions on Knowledge & Data Engineering, vol.25, no. 4, pp. 805-819, April 2013, doi:10.1109/TKDE.2011.245
REFERENCES
[1] P. Buche and O. Haemmerlé, "Towards a Unified Querying System of Both Structured and Semi-Structured Imprecise Data Using Fuzzy Views," Proc. Linguistic on Conceptual Structures: Logical Linguistic, and Computational Issues (ICCS), pp. 207-220, 2000.
[2] P. Buche, C. Dervin, O. Haemmerlé, and R. Thomopoulos, "Fuzzy Querying of Incomplete, Imprecise, and Heterogeneously Structured Data in the Relational Model Using Ontologies and Rules," IEEE Trans. Fuzzy Systems, vol. 13, no. 3, pp. 373-383, June 2005.
[3] G. Hignette, P. Buche, J. Dibie-Barthélemy, and O. Haemmerlé, "An Ontology-Driven Annotation of Data Tables," Proc. WISE Workshops Web Data Integration and Management for Life Sciences, pp. 29-40, 2007.
[4] G. Hignette, P. Buche, J. Dibie-Barthélemy, and O. Haemmerlé, "Fuzzy Annotation of Web Data Tables Driven by a Domain Ontology," Proc. Sixth European Semantic Web Conf. The Semantic Web: Research and Applications (ESWC), pp. 638-653, 2009.
[5] P. Buche, J. Dibie-Barthélemy, and H. Chebil, "Flexible Sparql Querying of Web Data Tables Driven by an Ontology," Proc. Eight Int'l Conf. Flexible Query Answering Systems (FQAS), pp. 345-357, 2009.
[6] P. Cimiano, P. Buitelaar, J. McCrae, and M. Sintek, "Lexinfo: A Declarative Model for the Lexicon-Ontology Interface," J. Web Semantics, vol. 9, no. 1, pp. 29-51, 2011.
[7] J. McCrae, D. Spohr, and P. Cimiano, "Linking Lexical Resources and Ontologies on the Semantic Web with Lemon," Proc. Eight Extended Semantic Web Conf. The Semantic Web: Research and Applications (ESWC), pp. 245-259, 2011.
[8] T. Declerck and P. Lendvai, "Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems," Proc. Seventh Int'l Conf. Language Resources and Evaluation (LREC '10), 2010.
[9] A. Reymonet, J. Thomas, and N. Aussenac-Gilles, "Modelling Ontological and Terminological Resources in OWL DL," Proc. OntoLex 2007 - Workshop associated with ISWC '07, Sixth Int'l Semantic Web Conf. (ISWC '07), 2007.
[10] C. Roche, M. Calberg-Challot, L. Damas, and P. Rouard, "Ontoterminology - A New Paradigm for Terminology," Proc. Int'l Conf. Knowledge Eng. and Ontology Development (KEOD), pp. 321-326. 2009,
[11] A. Reymonet, J. Thomas, and N. Aussenac-Gilles, "Ontology Based Information Retrieval: An Application to Automotive Diagnosis," Proc. Int'l Workshop Principles of Diagnosis, pp. 9-14, 2009.
[12] N. Noy, A. Rector, P. Hayes, and C. Welty, "Defining Nary Relations on the Semantic Web W3C Working Group Note," http://www.w3.org/TRswbp-n-aryRelations, 2012.
[13] R. Yangarber, W. Lin, and R. Grishman, "Unsupervised Learning of Generalized Names," Proc. Int'l Conf. Computational Linguistics, pp. 1-7, 2002.
[14] C.J. van Rijsbergen, Information Retrieval. Butterworth, 1979.
[15] J.C. Platt, Fast Training of Support Vector Machines Using Sequential Minimal Optimization, pp. 185-208. MIT Press, 1999.
[16] L. Zadeh, "Fuzzy Sets," Information and Control, vol. 8, pp. 338-353, 1965.
[17] L. Zadeh, "Fuzzy Sets as a Basis for a Theory of Possibility," Fuzzy Sets and Systems, vol. 1, pp. 3-28, 1978.
[18] D. Dubois and H. Prade, "The Three Semantics of Fuzzy Sets," Fuzzy Sets and Systems, vol. 90, pp. 141-150, 1997.
[19] D. Dubois and H. Prade, Possibility Theory - An Approach to Computerized Processing of Uncertainty. Plenum Press, 1988.
[20] M. Baziz, M. Boughanem, H. Prade, and G. Pasi, "A Fuzzy Logic Approach to Information Retrieval Using a Ontology-Based Representation of Documents," Fuzzy Logic and the Semantic Web, vol. 1, pp. 363-377, 2006.
[21] Y. Liu, K. Bai, P. Mitra, and C.L. Giles, "Tableseer: Automatic Table Metadata Extraction and Searching in Digital Libraries," Proc. ACM/IEEE-CS Seventh Joint Conf. Digital Libraries (JCDL), pp. 91-100, 2007.
[22] M.J. Cafarella, A.Y. Halevy, Y. Zhang, D.Z. Wang, and E. Wu, "Uncovering the Relational Web," Proc. 11th Int'l Workshop Web and Databases (WebDB), 2008.
[23] M.J. Cafarella, A.Y. Halevy, D.Z. Wang, E. Wu, and Y. Zhang, "Webtables: Exploring the Power of Tables on the Web," Proc. VLDB Endowment, vol. 1, no. 1, pp. 538-549, 2008.
[24] M. van Assem, H. Rijgersberg, M. Wigham, and J. Top, "Converting and Annotating Quantitative Data Tables," Proc. Ninth Int'l Semantic Web Conf. The Semantic Web, pp. 16-31, 2010.
[25] S. Tenier, Y. Toussaint, A. Napoli, and X. Polanco, "Instantiation of Relations for Semantic Annotation," Proc. Int'l Conf. Web Intelligence, pp. 463-472, 2006.
[26] D.W. Embley, C. Tao, and S.W. Liddle, "Automatically Extracting Ontologically Specified Data from HTML Tables of Unknown Structure," Proc. 21st Int'l Conf. Conceptual Modeling (ER), pp. 322-337. 2002,
[27] A. Campi, E. Damiani, S. Guinea, S. Marrara, G. Pasi, and P. Spoletini, "A Fuzzy Extension for the Xpath Query Language," Proc. Seventh Int'l Conf. Flexible Query Answering Systems (FQAS), pp. 210-221, 2006.
[28] C.A. Hutardo, A. Poulovassilis, and P.T. Wood, "A Relaxed Approach to Rdf Querying," Proc. Fifth Int'l Conf. The Semantic Web (ISWC), vol. 4273, pp. 314-328, 2006.
[29] O. Corby, R. Dieng-Kuntz, C. Faron-Zucker, and F. Gandon, "Searching the Semantic Web: Approximate Query Processing Based on Ontologies," IEEE Intelligent Systems J., vol. 21, no. 1, pp. 20-27, Jan.-Feb. 2006.
[30] J.Z. Pan, G.B. Stamou, G. Stoilos, S. Taylor, and E. Thomas, "Scalable Querying Services over Fuzzy Ontologies," Proc. 17th Int'l Conf. World Wide Web (WWW), pp. 575-584, 2008.
[31] P. Buche, O. Couvert, J. Dibie-Barthélemy, G. Hignette, E. Mettler, and L. Soler, "Flexible Querying of Web Data to Simulate Bacterial Growth in Food," Food Microbiology, vol. 28, no. 4, pp. 685-693, 2011.
[32] H. Rijgersberg, M. Wigham, and J.L. Top, "How Semantics Can Improve Engineering Processes: A Case of Units Measure and Quantities," Advanced Eng. Informatics, vol. 25, no. 2, pp. 276-287, 2011.
19 ms
(Ver 2.0)

Marketing Automation Platform Marketing Automation Tool