This Article 
   
 Share 
   
 Bibliographic References 
   
 Add to: 
 
Digg
Furl
Spurl
Blink
Simpy
Google
Del.icio.us
Y!MyWeb
 
 Search 
   
SCBXP: An Efficient CAM-Based XML Parsing Technique in Hardware Environments
November 2011 (vol. 22 no. 11)
pp. 1879-1887
Fadi El-Hassan, University of Ottawa, Ottawa
Dan Ionescu, University of Ottawa, Ottawa
The underlying technologies of web information and distributed systems often require efficient XML parsing. Even though new software-based XML parsing techniques improve XML processing, the verbose nature of XML does not help to achieve the substantial improvements that are desired. In some systems, such as mobile devices, the restricted memory resources exacerbate the problems associated with XML processing. In this paper, we present a novel XML parsing technique—titled SCBXP—that is designed to achieve high performance in hardware-based environments. In addition, the parsing technique provides a natural way of checking for full well formedness and partial validation, thereby taking advantage of our CAM-based architecture and the inherent parallel features of the hardware. Furthermore, the efficiency of XML parsing is maintained even when memory resources are limited. The SCBXP technique architecture makes use of 1) a content-addressable memory that must be configured with a skeleton of the XML document being parsed, 2) a finite state machine that controls FIFOs, in order to align XML data properly, 3) multiple state machines acting on the multilevel nature of XML, and 4) dual-port memory modules. The results of testing the SCBXP technique, implemented on an FPGA, demonstrate that a processing rate of at least 2 bytes of XML data can be performed during each clock cycle.

[1] Extensible Markup Language (XML) 1.0 (Fifth Ed.), http://www.w3.org/TR/2008PER-xml-20080205 , 2008.
[2] F. El-Hassan and D. Ionescu, "SCBXP: An Efficient Hardware-Based XML Parsing Technique," Proc. Fifth Southern Conf. Programmable Logic (SPL '09), Apr. 2009.
[3] H. Sugano, S. Fujimoto, G. Klyne, A. Bateman, W. Carr, and J. Peterson, "Presence Information Data Format (PIDF)," RFC 3863, IETF, Aug. 2004.
[4] ModelSim, http:/www.model.com/, Oct. 2010.
[5] Altera. Section I. Stratix Device Family Data Sheet, http://www.altera.com/literature/hb/stxstratix_section_1_vol_1.pdf , Oct. 2010.

Index Terms:
XML processing, XML parsing, field programmable gate arrays, content addressable memory.
Citation:
Fadi El-Hassan, Dan Ionescu, "SCBXP: An Efficient CAM-Based XML Parsing Technique in Hardware Environments," IEEE Transactions on Parallel and Distributed Systems, vol. 22, no. 11, pp. 1879-1887, Nov. 2011, doi:10.1109/TPDS.2011.51
Usage of this product signifies your acceptance of the Terms of Use.