|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
2008 Fourth IEEE International Conference on eScience
Scalable Semantics
December 07-December 12
ISBN: 978-0-7695-3535-7
| ASCII Text | x | ||
| Andrew Newman, Yuan-Fang Li, Jane Hunter, "Scalable Semantics ," eScience, IEEE International Conference on, pp. 111-118, 2008 Fourth IEEE International Conference on eScience, 2008. | |||
| BibTex | x | ||
| @article{ 10.1109/eScience.2008.23, author = {Andrew Newman and Yuan-Fang Li and Jane Hunter}, title = {Scalable Semantics }, journal ={eScience, IEEE International Conference on}, volume = {0}, year = {2008}, isbn = {978-0-7695-3535-7}, pages = {111-118}, doi = {http://doi.ieeecomputersociety.org/10.1109/eScience.2008.23}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - eScience, IEEE International Conference on TI - Scalable Semantics SN - 978-0-7695-3535-7 SP111 EP118 A1 - Andrew Newman, A1 - Yuan-Fang Li, A1 - Jane Hunter, PY - 2008 KW - RDF KW - RDF molecules KW - MapReduce KW - distributed processing KW - data integration VL - 0 JA - eScience, IEEE International Conference on ER - | |||
Semantic inferencing and querying across large-scale RDF triple stores is notoriously slow. Our objective is to expedite this process by employing Google's MapReduce framework to implement scale-out distributed querying and reasoning. This approach requires RDF graphs to be decomposed into smaller units that are distributed across computational nodes. RDF Molecules appear to offer an ideal approach – providing an intermediate level of granularity between RDF graphs and triples. However, the original RDF molecule definition has inherent limitations that will adversely affect performance. In this paper, we propose a number of extensions to RDF molecules (hierarchy and ordering) to overcome these limitations. We then present some implementation details for our MapReduce-based RDF molecule store. Finally we evaluate the benefits of our approach in the context of the Bio-MANTA project – an application that requires integration and querying across large-scale protein-protein interaction datasets.
Index Terms:
RDF, RDF molecules, MapReduce, distributed processing, data integration
Citation:
Andrew Newman, Yuan-Fang Li, Jane Hunter, "Scalable Semantics ," escience, pp.111-118, 2008 Fourth IEEE International Conference on eScience, 2008
Usage of this product signifies your acceptance of the Terms of Use.
