Parser Objects
ParserObjects is a library for parsers and parser combinators. Combinators are the same general algorithm as Recursive Descent, but using objects instead of functions to represent grammar production rules. Some benefits of this approach include:
- Dynamic: You can easily construct or dynamically modify parser object graphs at runtime (normal Recursive Descent uses method references, which are generally set at compile time)
- Declarative: Declaration of parsers can closely resemble a pseudo-BNF grammar for easy readability
The Wikipedia page on Parsing gives a good overview of parsing in general and the page on Parser Combinators cover a lot of the important topics and also point to other resources for continued reading. See the Test Suite Examples for examples of use and implementation patterns.
Project Goals
The ParserObjects project intends to be an easy, usable and flexible parsing solution for the .NET ecosystem. The goal of this project is to be an easy on-ramp for developers who need to parse data but aren’t necessarily experts in parsing theory. You don’t need to know BNF or CFG or the details of any parsing algorithm in order to get started with ParserObjects. Parsers should be easy to build, the code should be easy to read, and the examples should be easy to understand.
This library also intends to showcase the Combinators approach to parsing, and the underlying ethos of being able to compose parsers together from reusable components.
Key Concepts and Abstractions
ParserObjects defines a few important abstractions.
ISequence<T>
ISequence<T>
is very similar to IEnumerable<T>
/IEnumerator<T>
. It allows you to stream items from a source one by one, but it also provides a few additional features which IEnumerable<T>
and IEnumerator<T>
don’t include. See the page on Sequence reference and usage for detailed information. Sequences are the basic input type for parsers.
IResult<T>
and IMultiResult<T>
The IResult<T>
and IMultiResult<T>
abstractions represent the result of a parser execution. Results communicate success or failure. On success it includes the derived value and on failure it includes an error message. The result also includes metadata about how many inputs were consumed and the location in the input stream where the match occurred, among other things.
IParser<TInput, TOutput>
and IMultiParser<TInput, TOutput>
IParser<TInput, TOutput>
and IMultiParser<TInput, TOutput>
and related interfaces represent a parser object. These are the core component of the ParserObjects library. Parsers take an ISequence<TInput>
of input values and return an IResult<TOutput>
or IMultiResult<TOutput>
indicating success or failure.
See the page on Parser Usage for information about using parsers.
- Core Parsers Reference
- Logical Parsers Reference
- String/Character Parser Reference
- Programming Parser Reference
- Pratt Parser Reference
- Multi Parsers Reference
- Earley Parser Reference
Examples
See the following pages for examples:
There are also several small examples of usable in the test suite.