The Community for Technology Leaders
Green Image
Issue No. 07 - July (2015 vol. 27)
ISSN: 1041-4347
pp: 1796-1809
Beth Trushkowsky , Department of Computer Science, Harvey Mudd College, Claremont, CA
Tim Kraska , Department of Computer Science, Brown University, Providence, RI
Michael J. Franklin , AMP Lab at UC Berkeley, Berkeley, CA
Purnamrita Sarkar , Department of Statistics and Data Science at UT Austin, Austin, TX
Venketaram Ramachandran , Amazon, Seattle, WA
ABSTRACT
Hybrid human/computer database systems promise to greatly expand the usefulness of query processing by incorporating the crowd for data gathering and other tasks. Such systems raise many implementation questions. Perhaps the most fundamental issue is that the closed world assumption underlying relational query semantics does not hold in such systems. As a consequence, the meaning of even simple queries can be called into question. Furthermore, query progress monitoring becomes difficult due to non-uniformities in the arrival of crowdsourced data and peculiarities of how people work in crowdsourcing systems. To address these issues, we develop statistical tools that enable users and systems developers to reason about query completeness. These tools can also help drive query execution and crowdsourcing strategies. We evaluate our techniques using experiments on a popular crowdsourcing platform.
INDEX TERMS
Estimation, Crowdsourcing, Query processing, Computers, Sociology,user interfaces, Database design, modeling and management
CITATION
Beth Trushkowsky, Tim Kraska, Michael J. Franklin, Purnamrita Sarkar, Venketaram Ramachandran, "Crowdsourcing Enumeration Queries: Estimators and Interfaces", IEEE Transactions on Knowledge & Data Engineering, vol. 27, no. , pp. 1796-1809, July 2015, doi:10.1109/TKDE.2014.2339857
91 ms
(Ver 3.3 (11022016))