File re2parser.hh¶

Parser transforming re2 regular expressions to their corresponding automata representations.

Enums

enum class Encoding¶

Values:

enumerator Utf8¶

enumerator Latin1¶

namespace mata

Main namespace including structs and algorithms for all automata.

In particular, this includes:

Alphabets,
Formula graphs and nodes,
Mintermization,
Closed sets.

namespace parser

Parser from .mata format to automata (currently Nfa and Afa are supported).

This includes parsing either from files or from other streams (strings, etc.).

Functions

nfa::Nfa create_nfa(const std::string &pattern, bool use_epsilon = false, Symbol epsilon_value = 306, bool use_reduce = true, Encoding encoding = Encoding::Latin1)¶

Creates NFA from regular expression using RE2 parser.

At https://github.com/google/re2/wiki/Syntax, you can find the syntax of regular expressions with following futher limitations: 1) If you use UTF8 encoding, the created NFA will have the values of bytes instead of full symbols. For example, the character Ā whose Unicode code point is U+0100 and is represented in UTF8 as two bytes c4 80 will have two transitions, one with c4 followed with by 80, to encode it. 2) The created automaton represents the language of the regex and is not expected to be used in regex matching. Therefore, stuff like ^, $, , etc. are ignored in the regex.

File re2parser.hh¶

Mata

Navigation

Related Topics