Proceedings Eighth Working Conference on Reverse Engineering (2001)
Oct. 2, 2001 to Oct. 5, 2001
Leon Moonen , CWI
Source model extraction---the automated extraction of information from system artifacts---is a common phase in reverse engineering tools. One of the major challenges of this phase is creating extractors that can deal with irregularities in the artifacts that are typical for the reverse engineering domain (for example, syntactic errors, incomplete source code, language dialects and embedded languages).This paper proposes a solution in the form of island grammars, a special kind of grammars that combine the detailed specification possibilities of grammars with the liberal behavior of lexical approaches. We show how island grammars can be used to generate robust parsers that combine the accuracy of syntactical analysis with the speed, flexibility and tolerance usually only found in lexical analysis. We conclude with a discussion of the development of Mangrove, a generator for source model extractors based on island grammars and describe its application to a number of case studies.
Island grammars, parser generation, source model extraction, partial parsing, fuzzy parsing, reverse engineering, program analysis.
L. Moonen, "Generating Robust Parsers using Island Grammars," Proceedings Eighth Working Conference on Reverse Engineering(WCRE), Suttgart, Germany, 2001, pp. 13.