Complex SQL Predicates as Quantifiers
By Antonio Badia,Anna Wagner
Issue Date:July 2014
We propose a logical framework to analyze complex predicates (those involving a subquery) in SQL. We propose a new operator in the relational algebra for handling such predicates, and study its properties and how it combines with traditional relational ope...
Databases as Social Entities
By Antonio Badia
Issue Date:September 2012
Databases can be seen as social entities, since they are used by a group of users, not a single one. This article describes methods to transform this group into a social network and then use the added social dimension for collaborative tasks. The author de...
Opaque Attribute Alignment
By Jennifer Sleeman,Rafael Alonso,Hua Li,Art Pope,Antonio Badia
Issue Date:April 2012
Ontology alignment describes a process of mapping ontological concepts, classes and attributes between different ontologies providing a way to achieve interoperability. While there has been considerable research in this area, most approaches that rely upon...
A Web Usage Mining Framework for Mining Evolving User Profiles in Dynamic Web Sites
By Olfa Nasraoui, Maha Soliman, Esin Saka, Antonio Badia, Richard Germain
Issue Date:February 2008
In this paper, we present a complete framework and findings in mining web usage patterns from Web log files of a real website that has all the challenging aspects of real life web usage mining, including evolving user profiles and external data describing ...
Redundancy Awareness in SQL Queries
By Bin Cao, Antonio Badia
Issue Date:April 2006
In this paper, we study SQL queries with aggregate subqueries that share common tables and conditions with the outer query. While several approaches can deal with such queries, they have limited applicability. We propose the redundancy awareness method to ...
Conceptual Modeling for Semistructured Data
By Antonio Badia
Issue Date:December 2002
We review the more widely used models in Conceptual Modeling for Information Systems (Entity-Relationship and UML), and argue that they do not support effectively modeling of semistructured data. As a consequence, structured and semistructured data cannot ...
Is formalizing events necessary for full exploitation
By Antonio Badia
Issue Date:October 2010
Part of what makes event extraction difficult is the lack of a clear definition of the event concept. We argue that a good definition would be a positive contribution to Information Extraction and its applications. We give a preliminary discussion of a log...
SQL query optimization through nested relational algebra
By Antonio Badia, Bin Cao
Issue Date:August 2007
Most research work on optimization of nested queries focuses on aggregate subqueries. In this article, we show that existing approaches are not adequate for nonaggregate subqueries, especially for those having multiple subqueries and certain comparison ope...
Non-linear prefixes in query languages
By Antonio Badia, Stijn Vansummeren
Issue Date:June 2007
In first order logic there are two main extensions to quantification: generalized quantifiers and non-linear prefixes. While generalized quantifiers have been explored from a database perspective, non-linear prefixes have not-most likely because of complex...
Focused crawling: experiences in a real world project
By Antonio Badia, Olfa Nasraoui, Tulay Muezzinoglu
Issue Date:May 2006
The web crawler space is often delimited into two general areas: full-web crawling and focused crawling. We present netSifter, a crawler system which integrates features from these two areas to provide an effective mechanism for web-scale crawling. netSift...
Graph building as a mining activity: finding links in the small
By Antonio Badia, Mehmed Kantardzic
Issue Date:August 2005
Many analysis of data proceed by building a graph out of the data set and then using social network theory and similar tools on the result. However, there is no theory concerning the construction of the graph itself, even though this is a very important pr...
A nested relational approach to processing SQL subqueries
By Antonio Badia, Bin Cao
Issue Date:June 2005
One of the most powerful features of SQL is the use of nested queries. Most research work on the optimization of nested queries focuses on aggregate subqueries. However, the solutions proposed for non-aggregate subqueries are still limited, especially for ...
Adding subqueries to MySQL, what does it take to have a decision-support engine?
By Antonio Badia, Bin Cao, Matt Chanda
Issue Date:November 2002
MySQL is an OLTP system without the ability to handle SQL subqueries. As part of our project in query optimization, we add subquery processing to MySQL through rewriting. First, we incorporate the ability to handle subqueries in the FROM clause; then rewri...
Providing better support for a class of decision support queries
By Antonio Badia, Dirk van Gucht, Sudhir G. Rao
Issue Date:June 1996
Relational database systems do not effectively support complex queries containing quantifiers (quantified queries) that are increasingly becoming important in decision support applications. Generalized quantifiers provide an effective way of expressing suc...