|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
| ASCII Text | x | ||
| Paul Clough, Mark Sanderson, Jiayu Tang, Tim Gollins, Amy Warner, "Examining the limits of crowdsourcing for relevance assessment," IEEE Internet Computing, vol. 99, no. 1, pp. , , 5555. | |||
| BibTex | x | ||
| @article{ 10.1109/MIC.2012.95, author = {Paul Clough and Mark Sanderson and Jiayu Tang and Tim Gollins and Amy Warner}, title = {Examining the limits of crowdsourcing for relevance assessment}, journal ={IEEE Internet Computing}, volume = {99}, number = {1}, issn = {1089-7801}, year = {5555}, doi = {http://doi.ieeecomputersociety.org/10.1109/MIC.2012.95}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - MGZN JO - IEEE Internet Computing TI - Examining the limits of crowdsourcing for relevance assessment IS - 1 SN - 1089-7801 SP EP EPD - A1 - Paul Clough, A1 - Mark Sanderson, A1 - Jiayu Tang, A1 - Tim Gollins, A1 - Amy Warner, PY - 5555 VL - 99 JA - IEEE Internet Computing ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/MIC.2012.95
Evaluation is instrumental in the development and management of effective information retrieval systems and ensuring high levels of user satisfaction. Using crowdsourcing to obtain relevance assessments has been shown to be viable through a number of publications. What is less well understood are the limits of crowdsourcing for the assessment task, particularly for domain specific search. We present results comparing relevance assessments gathered using crowdsourcing with those gathered from a domain expert for evaluating different search engines in a large government archive. While crowdsourced judgments rank the tested search engines in the same order as expert judgments, crowdsourced workers appear unable to distinguish different levels of highly accurate search results in a way that expert assessors can. The nature of this limitation in crowd sourced workers for this experiment is examined and the viability of crowdsourcing for evaluating search in specialist settings is discussed.
Citation:
Paul Clough, Mark Sanderson, Jiayu Tang, Tim Gollins, Amy Warner, "Examining the limits of crowdsourcing for relevance assessment," IEEE Internet Computing, 28 June 2012. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/MIC.2012.95>
Usage of this product signifies your acceptance of the Terms of Use.

