This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
2009 13th International Conference on Computer Supported Cooperative Work in Design
A collaborative approach to building evaluated web pages datasets
Santiago, Chile
April 22-April 24
ISBN: 978-1-4244-3534-0
Ricardo Barros, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Jose A. Rodrigues Nt., COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Heraldo J. A. Carneiro Filho, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Fabricio R. S. Ferreira, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Oliverio C. Fernandes, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Carlos Eduardo P. Silva, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Andre L. G. Ribeiro, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Geraldo B. Xexeo, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
Jano M. de Souza, COPPE, Graduate School of Engineering, UFRJ - Federal University of Rio de Janeiro, Brazil
In order to evaluate information retrieval algorithms it is imperative to use a dataset as a test database. However, access to such datasets is often difficult and expensive, since building them is a time-consuming and costly task. This paper presents a collaborative approach to dataset creation that uses a data quality evaluation technique based on fuzzy theory, to assist users in selecting suitable web documents for their datasets. These documents are automatically captured by a crawler and assessed on information derived from their metadata.
Citation:
Ricardo Barros, Jose A. Rodrigues Nt., Heraldo J. A. Carneiro Filho, Fabricio R. S. Ferreira, Oliverio C. Fernandes, Carlos Eduardo P. Silva, Andre L. G. Ribeiro, Geraldo B. Xexeo, Jano M. de Souza, "A collaborative approach to building evaluated web pages datasets," cscwd, pp.668-673, 2009 13th International Conference on Computer Supported Cooperative Work in Design, 2009
Usage of this product signifies your acceptance of the Terms of Use.