TinyPG doesn't properly parse keyword based grammar #20

jrleek · 2015-05-22T08:55:30Z

I could be wrong, but I'm pretty sure this is a bug in TinyPG. It isn't able to properly parse this grammar:

EOF -> @"^\s_$";
[Skip] WHITESPACE -> @"\s+";
LIST -> "LIST";
END -> "END";
IDENTIFIER -> @"[a-zA-Z_][a-zA-Z0-9_]_";
Expr -> LIST IDENTIFIER+ END;
Start -> (Expr)+ EOF;
The resulting parser cannot parse this:

LIST foo BAR Baz END
because it greedily lexes END as an IDENTIFIER, instead of properly as the END keyword.

Theoistic · 2016-08-16T15:33:23Z

Here is an example from the Simple-CIL-compiler project,
The identifier has to catch single words except the ones listed, which means you have to include the exception token's in to the identifier

Hope that helps.

ultrasuperpingu · 2017-01-25T17:53:24Z

First of all, sorry for my english which is far from perfect. I hope this post is still understandable.

I had the same issue. But because I wasn't attempting to match a particular grammar, I just modified it as a workaround (here, I would just replace LIST and END tokens by something like [ and ]). But I looked in the code why this wasn't working. It is because of the "Partial Context Sensitive/Ambiguous Grammars" feature (take a look at the documentation here). The parser asks to the scanner to look ahead for expected tokens but, if the rule has a OneOrMultiple cardinality (+), the expected token list does not contains the following rules first terminal tokens. That means that, in this example, while parsing the IDENTIFIER+ rule, the lookahead only looks for IDENTIFIER tokens, matches the END token as an identifier and consumes it.

The solution here would be to provide too the next rule(s?) first terminals list as expected token. I will give it a try and, if working, propose it as a pull request.

Edit: Pull request submited

ultrasuperpingu mentioned this issue Jan 27, 2017

Fix for Issue # 20 #22

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TinyPG doesn't properly parse keyword based grammar #20

TinyPG doesn't properly parse keyword based grammar #20

jrleek commented May 22, 2015

Theoistic commented Aug 16, 2016

ultrasuperpingu commented Jan 25, 2017 •

edited

Loading

TinyPG doesn't properly parse keyword based grammar #20

TinyPG doesn't properly parse keyword based grammar #20

Comments

jrleek commented May 22, 2015

Theoistic commented Aug 16, 2016

ultrasuperpingu commented Jan 25, 2017 • edited Loading

ultrasuperpingu commented Jan 25, 2017 •

edited

Loading