The Community for Technology Leaders
2014 IEEE 30th International Conference on Data Engineering Workshops (ICDEW) (2014)
Chicago, IL, USA
March 31, 2014 to April 4, 2014
ISBN: 978-1-4799-3481-2
TABLE OF CONTENTS

USB label (PDF)

pp. 1

USB welcome (PDF)

pp. 1

Hub page (PDF)

pp. 1

Session list (PDF)

pp. 1

Table of contents (PDF)

pp. 1-10

About CP (PDF)

pp. 1

BIIIG: Enabling business intelligence with integrated instance graphs (Abstract)

Andre Petermann , Department of Computer Science, University of Leipzig Augustusplatz 10, 04109 Leipzig, Germany
Martin Junghanns , Department of Computer Science, University of Leipzig Augustusplatz 10, 04109 Leipzig, Germany
Robert Muller , Faculty of Media, University of Applied Sciences Leipzig Karl-Liebknecht-Str. 145, 04277 Leipzig, Germany
Erhard Rahm , Department of Computer Science, University of Leipzig Augustusplatz 10, 04109 Leipzig, Germany
pp. 4-11

Graph databases for large-scale healthcare systems: A framework for efficient data management and data services (Abstract)

Yubin Park , University of Texas at Austin, TX, USA
Mallikarjun Shankar , Oak Ridge National Laboratory, TN, USA
Byung-Hoon Park , Oak Ridge National Laboratory, TN, USA
Joydeep Ghosh , University of Texas at Austin, TX, USA
pp. 12-19

On isomorphic matching of large disk-resident graphs using an XQuery engine (Abstract)

Carlos R. Rivero , Department of Computer Science University of Idaho, USA
Hasan M. Jamil , Department of Computer Science University of Idaho, USA
pp. 20-27

Typing query languages for data graphs (Abstract)

Dario Colazzo , LAMSADE, Université Paris Dauphine, Place du Maréchal de Lattre de Tassigny, 75 775 Paris Cedex 16, France
Carlo Sartiani , DIMIE, Università della Basilicata Via dell'Ateneo Lucano 10 - Potenza - Italy
pp. 28-31

Privacy-preserving reachability query services for sparse graphs (Abstract)

Peipei Yi , Hong Kong Baptist University
Zhe Fan , Hong Kong Baptist University
Shuxiang Yin , Fudan University
pp. 32-35

A hashtags dictionary from crowdsourced definitions (Abstract)

Merieme Ghenname , LT2C, Telecom Saint-Etienne, Université Jean Monnet 42000, Saint-Etienne France
Julien Subercaze , LT2C, Telecom Saint-Etienne, Université Jean Monnet 42000, Saint-Etienne France
Christophe Gravier , LT2C, Telecom Saint-Etienne, Université Jean Monnet 42000, Saint-Etienne France
Frederique Laforest , LT2C, Telecom Saint-Etienne, Université Jean Monnet 42000, Saint-Etienne France
Mounia Abik , LeRMA ENSIAS, Université Mohammed V Souissi 10000, Rabat Morocco
Rachida Ajhoun , LeRMA ENSIAS, Université Mohammed V Souissi 10000, Rabat Morocco
pp. 39-44

In schema matching, even experts are human: Towards expert sourcing in schema matching (Abstract)

Tomer Sagi , Technion - Israel Institute of Technology, Haifa, Israel
Avigdor Gal , Technion - Israel Institute of Technology, Haifa, Israel
pp. 45-49

Semantic management of Enterprise Integration Patterns: A use case in Smart Grids (Abstract)

Om P. Patri , Department of Computer Science, University of Southern California, Los Angeles, CA
Anand V. Panangadan , Ming-Hsieh Department of Electrical Engineering, University of Southern California, Los Angeles, CA
Vikrambhai S. Sorathia , Kensemble Tech Labs LLP, Gandhinagar, India
Viktor K. Prasanna , Ming-Hsieh Department of Electrical Engineering, University of Southern California, Los Angeles, CA
pp. 50-55

Bootstrapping Wikipedia to answer ambiguous person name queries (Abstract)

Toni Gruetze , Hasso Plattner Institute, Prof.-Dr.-Helmert-Straße 2-3, 14482 Potsdam, Germany
Gjergji Kasneci , Hasso Plattner Institute, Prof.-Dr.-Helmert-Straße 2-3, 14482 Potsdam, Germany
Zhe Zuo , Hasso Plattner Institute, Prof.-Dr.-Helmert-Straße 2-3, 14482 Potsdam, Germany
Felix Naumann , Hasso Plattner Institute, Prof.-Dr.-Helmert-Straße 2-3, 14482 Potsdam, Germany
pp. 56-61

Mapping abstract queries to big data web resources for on-the-fly data integration and information retrieval (Abstract)

Hasan M. Jamil , Department of Computer Science, University of Idaho, Moscow, Idaho, USA
pp. 62-67

Scholarly big data information extraction and integration in the CiteSeerχ digital library (Abstract)

Kyle Williams , Information Sciences and Technology, Pennsylvania State University, University Park, PA 16802, USA
Jian Wu , Information Sciences and Technology, Pennsylvania State University, University Park, PA 16802, USA
Sagnik Ray Choudhury , Information Sciences and Technology, Pennsylvania State University, University Park, PA 16802, USA
Madian Khabsa , Computer Science and Engineering, Pennsylvania State University, University Park, PA 16802, USA
C. Lee Giles , Computer Science and Engineering, Pennsylvania State University, University Park, PA 16802, USA
pp. 68-73

Aggregation of similarity measures in schema matching based on generalized mean (Abstract)

Faten A. Elshwimy , Department of Computer Engineering, Tanta University, Egypt
Alsayed Algergawy , Department of Computer Engineering, Tanta University, Egypt
Amany Sarhan , Department of Computer Engineering, Tanta University, Egypt
Elsayed A. Sallam , Department of Computer Engineering, Tanta University, Egypt
pp. 74-79

A tool for personal data extraction (Abstract)

Daniela Vianna , Department of Computer Science Rutgers University, Piscataway, NJ 08854-8019
Alicia-Michelle Yong , Department of Computer Science Rutgers University, Piscataway, NJ 08854-8019
Chaolun Xia , Department of Computer Science Rutgers University, Piscataway, NJ 08854-8019
Amelie Marian , Department of Computer Science Rutgers University, Piscataway, NJ 08854-8019
Thu Nguyen , Department of Computer Science Rutgers University, Piscataway, NJ 08854-8019
pp. 80-83

Reconciling malware labeling discrepancy via consensus learning (Abstract)

Ting Wang , IBM T.J. Watson Research Center
Xin Hu , IBM T.J. Watson Research Center
Shicong Meng , IBM T.J. Watson Research Center
Reiner Sailer , IBM T.J. Watson Research Center
pp. 84-89

SortingHat: A framework for deep matching between classes of entities (Abstract)

Sumant Kulkarni , International Institute of Information Technology Bangalore 26/C, Electronics City, Bangalore, India 560100
Srinath Srinivasa , International Institute of Information Technology Bangalore 26/C, Electronics City, Bangalore, India 560100
Jyotiska Nath Khasnabish , International Institute of Information Technology Bangalore 26/C, Electronics City, Bangalore, India 560100
Kartikay Nagal , International Institute of Information Technology Bangalore 26/C, Electronics City, Bangalore, India 560100
Sandeep G Kurdagi , International Institute of Information Technology Bangalore 26/C, Electronics City, Bangalore, India 560100
pp. 90-93

Leveraging in-memory technology for interactive analyses of point-of-sales data (Abstract)

David Schwalb , Hasso Plattner Institute
Martin Faust , Hasso Plattner Institute
Jens Krueger , SAP AG
Hasso Plattner , Hasso Plattner Institute
pp. 97-102

Overlap versus partition: Marketing classification and customer profiling in complex networks of products (Abstract)

Diego Pennacchioli , IMT - Lucca, P.za San Ponziano, 6, Lucca, Italy
Michele Coscia , CID - Harvard University, 79 JFK Street, Cambridge, MA, USA
Dino Pedreschi , KDDLab University of Pisa, Largo B. Pontecorvo, 3, Pisa, Italy
pp. 103-110

Harnessing the crowds for multi-channel marketing monitoring (Abstract)

Haggai Roitman , IBM Research - Haifa, Haifa University Campus, Israel 31905
Gilad Barkai , IBM Research - Haifa, Haifa University Campus, Israel 31905
David Konopnicki , IBM Research - Haifa, Haifa University Campus, Israel 31905
Michal Shmueli-Scheuer , IBM Research - Haifa, Haifa University Campus, Israel 31905
pp. 111-114

Characterizing comparison shopping behavior: A case study (Abstract)

Mona Gupta , Department of Computer Science and Engineering, Indian Institute of Technology, New Delhi, India
Happy Mittal , Department of Computer Science and Engineering, Indian Institute of Technology, New Delhi, India
Parag Singla , Department of Computer Science and Engineering, Indian Institute of Technology, New Delhi, India
Amitabha Bagchi , Department of Computer Science and Engineering, Indian Institute of Technology, New Delhi, India
pp. 115-122

A provenance-based approach to manage long term preservation of scientific data (Abstract)

Renato Beserra Sousa , Institute of Computing, Unicamp, Av. Albert Einstein, 1251, Campinas/SP - Brasil
Daniel Cintra Cugler , Institute of Computing, Unicamp, Av. Albert Einstein, 1251, Campinas/SP - Brasil
Joana Esther Gonzales Malaverri , Institute of Computing, Unicamp, Av. Albert Einstein, 1251, Campinas/SP - Brasil
Claudia Bauzer Medeiros , Institute of Computing, Unicamp, Av. Albert Einstein, 1251, Campinas/SP - Brasil
pp. 162-133

PDS4: A model-driven planetary science data architecture for long-term preservation (Abstract)

John S. Hughes , Jet Propulsion Laboratory, California Institute of Technology 4800 Oak Grove Drive, Pasadena, CA 91109 U.S.A.
Daniel Crichton , Jet Propulsion Laboratory, California Institute of Technology 4800 Oak Grove Drive, Pasadena, CA 91109 U.S.A.
Sean Hardman , Jet Propulsion Laboratory, California Institute of Technology 4800 Oak Grove Drive, Pasadena, CA 91109 U.S.A.
Emily Law , Jet Propulsion Laboratory, California Institute of Technology 4800 Oak Grove Drive, Pasadena, CA 91109 U.S.A.
Ronald Joyner , Jet Propulsion Laboratory, California Institute of Technology 4800 Oak Grove Drive, Pasadena, CA 91109 U.S.A.
Paul Ramirez , Jet Propulsion Laboratory, California Institute of Technology 4800 Oak Grove Drive, Pasadena, CA 91109 U.S.A.
pp. 134-141

Data and Software Preservation for Open Science (DASPOS) (Abstract)

Michael D. Hildreth , Physics Department, University of Notre Dame Notre Dame, IN, USA
pp. 142-146

Method and components for creating scientific workflow (Abstract)

Yuan Lin , Université Montpellier 2, LIRMM UMR 5506, CC477, 161 rue Ada, 34095 Montpellier Cedex 5 - France
Isabelle Mougenot , Université de Montpellier II, 2 Place Eugne Bataillon 34095 Montpellier Cedex 5, France
Therese Libourel , Université de Montpellier II, 2 Place Eugne Bataillon 34095 Montpellier Cedex 5, France
pp. 147-153

A validation framework for the long term preservation of high energy physics data (Abstract)

Dmitri Ozerov , Deutsches Elektronen Synchrotron Notkestrasse 85, 22607 Hamburg, Germany
David M. South , Deutsches Elektronen Synchrotron Notkestrasse 85, 22607 Hamburg, Germany
pp. 154-158

Towards improvements on the quality of service for multi-tenant RDBMS in the cloud (Abstract)

Leonardo O. Moreira , Departament of Computer Science, Federal University of Ceará, Fortaleza, Brazil
Victor A. E. Farias , Departament of Computer Science, Federal University of Ceará, Fortaleza, Brazil
Flavio R. C. Sousa , Departament of Computer Science, Federal University of Ceará, Fortaleza, Brazil
Gustavo A. C. Santos , Departament of Computer Science, Federal University of Ceará, Fortaleza, Brazil
Jose G. R. Maia , Departament of Computer Science, Federal University of Ceará, Fortaleza, Brazil
Javam C. Machado , Departament of Computer Science, Federal University of Ceará, Fortaleza, Brazil
pp. 162-169

PolarDBMS: Towards a cost-effective and policy-based data management in the cloud (Abstract)

Ilir Fetai , Department of Informatics and Mathematics University of Basel Switzerland
Filip-M. Brinkmann , Department of Informatics and Mathematics University of Basel Switzerland
Heiko Schuldt , Department of Informatics and Mathematics University of Basel Switzerland
pp. 170-177

SLA-driven workload management for cloud databases (Abstract)

Dimokritos Stamatakis , Brandeis University, Waltham, MA, USA
Olga Papaemmanouil , Brandeis University, Waltham, MA, USA
pp. 178-181

Parallel join executions in RAMCloud (Abstract)

Christian Tinnefeld , Hasso Plattner Institute, University of Potsdam, Germany
Donald Kossmann , Systems Group, ETH Zurich, Switzerland
Joos-Hendrik Boese , SAP AG, Walldorf, Germany
Hasso Plattner , Hasso Plattner Institute, University of Potsdam, Germany
pp. 182-190

Neighbor-base similarity matching for graphs (Abstract)

Hang Zhang , School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China 150001
Hongzhi Wang , School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China 150001
Jianzhong Li , School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China 150001
Hong Gao , School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China 150001
pp. 191-198

Data stream partitioning re-optimization based on runtime dependency mining (Abstract)

Emeric Viel , System Software Laboratories, Fujitsu Laboratories Ltd., Kawasaki, Japan
Haruyasu Ueda , System Software Laboratories, Fujitsu Laboratories Ltd., Kawasaki, Japan
pp. 199-206

Curracurrong cloud: Stream processing in the cloud (Abstract)

Vasvi Kakkad , School of Information Technologies, The University of Sydney, Sydney NSW 2006 Australia
Akon Dey , School of Information Technologies, The University of Sydney, Sydney NSW 2006 Australia
Alan Fekete , School of Information Technologies, The University of Sydney, Sydney NSW 2006 Australia
Bernhard Scholz , School of Information Technologies, The University of Sydney, Sydney NSW 2006 Australia
pp. 207-214

Orestes: A scalable Database-as-a-Service architecture for low latency (Abstract)

Felix Gessert , Computer Science Department, University of Hamburg Vogt-Kölln Straße 33, 22527 Hamburg, Germany
Florian Bucklers , Computer Science Department, University of Hamburg Vogt-Kölln Straße 33, 22527 Hamburg, Germany
Norbert Ritter , Computer Science Department, University of Hamburg Vogt-Kölln Straße 33, 22527 Hamburg, Germany
pp. 215-222

YCSB+T: Benchmarking web-scale transactional databases (Abstract)

Akon Dey , School of Information Technologies, University of Sydney, Sydney NSW 2006 Australia
Alan Fekete , School of Information Technologies, University of Sydney, Sydney NSW 2006 Australia
Raghunath Nambiar , Cisco Systems, Inc., 275 East Tasman Drive, San Jose, CA 95134 USA
Uwe Rohm , School of Information Technologies, University of Sydney, Sydney NSW 2006 Australia
pp. 223-230

Benchmarking cloud-based tagging services (Abstract)

Tanu Malik , Computation Institute, University of Chicago and Argonne National Laboratory, 5735 S Ellis. Ave, Chicago, IL 60637
Kyle Chard , Computation Institute, University of Chicago and Argonne National Laboratory, 5735 S Ellis. Ave, Chicago, IL 60637
Ian Foster , Computation Institute, University of Chicago and Argonne National Laboratory, 5735 S Ellis. Ave, Chicago, IL 60637
pp. 231-238

Extending contexts with ontologies for multidimensional data quality assessment (Abstract)

Mostafa Milani , Carleton University, School of Computer Science, Ottawa, Canada
Leopoldo Bertossi , Carleton University, School of Computer Science, Ottawa, Canada
Sina Ariyan , Carleton University, School of Computer Science, Ottawa, Canada
pp. 242-247

Assigning global relevance scores to DBpedia facts (Abstract)

Philipp Langer , Hasso Plattner Institute (HPI) Potsdam, Germany
Patrick Schulze , Hasso Plattner Institute (HPI) Potsdam, Germany
Stefan George , Hasso Plattner Institute (HPI) Potsdam, Germany
Matthias Kohnen , Hasso Plattner Institute (HPI) Potsdam, Germany
Tobias Metzke , Hasso Plattner Institute (HPI) Potsdam, Germany
Ziawasch Abedjan , Hasso Plattner Institute (HPI) Potsdam, Germany
Gjergji Kasneci , Hasso Plattner Institute (HPI) Potsdam, Germany
pp. 248-253

Balloon Fusion: SPARQL rewriting based on unified co-reference information (Abstract)

Kai Schlegel , University of Passau Innstraβe 43, 94032 Passau, Germany
Florian Stegmaier , University of Passau Innstraβe 43, 94032 Passau, Germany
Sebastian Bayerl , University of Passau Innstraβe 43, 94032 Passau, Germany
Michael Granitzer , University of Passau Innstraβe 43, 94032 Passau, Germany
Harald Kosch , University of Passau Innstraβe 43, 94032 Passau, Germany
pp. 254-259

LODHub — A platform for sharing and integrated processing of linked open data (Abstract)

Stefan Hagedorn , Databases & Information Systems Group, Technische Universität Ilmenau, Germany
Kai-Uwe Sattler , Databases & Information Systems Group, Technische Universität Ilmenau, Germany
pp. 260-262

RQ-RDF-3X: Going beyond triplestores (Abstract)

Jyoti Leeka , IIIT-Delhi Delhi, India
Srikanta Bedathur , IIIT-Delhi Delhi, India
pp. 263-268

On reflection in Linked Data management (Abstract)

George H. L. Fletcher , Eindhoven University of Technology The Netherlands
pp. 269-271

How to generate query parameters in RDF benchmarks? (Abstract)

Andrey Gubichev , TU Munich, Germany
Renzo Angles , Universidad de Talca, Chile, VU University Amsterdam, Netherlands
Peter Boncz , CWI, Netherlands
pp. 272-274

Towards automated personalized data storage (Abstract)

John Lange , Department of Computer Science, University of Pittsburgh Pittsburgh, PA 15260, USA
Alexandros Labrinidis , Department of Computer Science, University of Pittsburgh Pittsburgh, PA 15260, USA
Panos K. Chrysanthis , Department of Computer Science, University of Pittsburgh Pittsburgh, PA 15260, USA
pp. 278-283

Cinderella — Adaptive online partitioning of irregularly structured data (Abstract)

Kai Herrmann , Database Technology Group, Technische Universität Dresden, 01062 Dresden, Germany
Hannes Voigt , Database Technology Group, Technische Universität Dresden, 01062 Dresden, Germany
Wolfgang Lehner , Database Technology Group, Technische Universität Dresden, 01062 Dresden, Germany
pp. 284-291

Interactive data exploration based on user relevance feedback (Abstract)

Kyriaki Dimitriadou , Brandeis University, Waltham, MA, USA
Olga Papaemmanouil , Brandeis University, Waltham, MA, USA
Yanlei Diao , University of Massachusetts Amherst, Amherst, MA, USA
pp. 292-295

Auto-scaling techniques for elastic data stream processing (Abstract)

Thomas Heinze , SAP AG Chemnitzer Str. 48, 01069 Dresden, Germany
Valerio Pappalardo , SAP AG Chemnitzer Str. 48, 01069 Dresden, Germany
Zbigniew Jerzak , SAP AG Chemnitzer Str. 48, 01069 Dresden, Germany
Christof Fetzer , TU Dresden, Systems Engineering Group Noethnitzer Str. 46, 01187 Dresden, Germany
pp. 296-302

Automated operator placement in distributed Data Stream Management Systems subject to user constraints (Abstract)

Cory Thoma , Department of Computer Science, University of Pittsburgh
Alexandros Labrinidis , Department of Computer Science, University of Pittsburgh
Adam J. Lee , Department of Computer Science, University of Pittsburgh
pp. 310-316

Automatic user steering for interactive data exploration (Abstract)

Kyriaki Dimitriadou , Brandeis University, Waltham, MA, USA
pp. 320-324

2B or not 2B and everything in between — novel evaluation methods for matching problems (Abstract)

Tomer Sagi , Technion - Israel Institute of Technology, Haifa, Israel
pp. 325-329

Trip similarity computation for context-aware travel recommendation exploiting geotagged photos (Abstract)

Zhenxing Xu , College of Computer Science, Zhejiang University, Hangzhou, P.R. China
pp. 330-334

Towards optimization of RDF analytical queries on MapReduce (Abstract)

Padmashree Ravindra , Department of Computer Science, North Carolina State University
pp. 335-339

Predictive query processing on moving objects (Abstract)

Abdeltawab M. Hendawi , Department of Computer Science and Engineering, University of Minnesota, Minneapolis, MN 55455, USA
pp. 340-344

Aggregates caching for enterprise applications (Abstract)

Stephan Muller , Hasso Plattner Institute, University of Potsdam, August-Bebel-Str. 88, D-14482 Potsdam
pp. 345-349

Analysis and detection of low quality information in social networks (Abstract)

De Wang , College of Computing, Georgia Institute of Technology, Atlanta, Georgia, United States. 30332-0765
pp. 350-354

User-driven refinement of imprecise queries (Abstract)

Bahar Qarabaqi , College of Computer and Information Science, Northeastern University, Boston, USA
pp. 355-359
104 ms
(Ver 3.3 (11022016))