Codebox Software
A Top-Down Parser written in JavaScript
Published:
Here is a simple top-down parser written in JavaScript. Every so often I need to write a parser, and keep forgetting how the recursion works. This is a simple- and general-as-possible implementation that I can refer back to in the future.
To use the parser, call the buildParser() function with a single string argument containing the grammar.
Each production rule in the grammar should be on a separate line, and the symbol should be followed by the character sequence ->,
and then by the substitutions. The symbol | is used as a delimiter on the RHS of a rule if there are multiple valid substitutions.
By default the start symbol is START, and ε is used to represent an empty string,
although alternate values can be supplied when calling buildParser(). The parser checks for left-recursion in the
grammar, and will throw an error if it finds any - this prevents nasty infinite loops. Whitespace between tokens is ignored when parsing,
so input values of 1+2 and 1 + 2 are handled identically.
I have used several ES6 language features in the code, so this won't work in older browsers.