Added a simple lexer. #4

skx · 2020-06-15T05:07:20Z

This pull-request, once complete, will replace our inline parsing with a standalone brainfuck lexer.

The lexer will produce a stream of tokens, along with a count of how many times those tokens were seen repeated.

So given an input string of "<<<<<>>>>>" we'll expect to see:

{Type: "<", Repeat: 5}
{Type: ">", Repeat: 5}
{Type: EOF, Repeat: 1}

Trivial cases added too, but not exhaustively. This will close #3.

This lexer will parse programs, skipping unknown characters, and collapsing repeated occurrences of identical tokens. The result will be a stream of values which can be compiled.

I've updated the lexer to skip newlines/unknown characters, and avoid handling repeated loop open/closes - since we need to count those separately. We're now using the lexer in our generation and that is good.

Oftentimes brainfuck programs are formatted in a "readable" fashion with newlines, and spaces. If we have an input-program that looks like this: > --------------------------------- > --------------------------------- > +++---- We'd parse that naively as: > {Type: "-", Repeat: X} > {Type: "-", Repeat: X} > {Type: "+", Repeat: X} > {Type: "-", Repeat: X} i.e. The first two lines would count as two tokens. If we remove the newlines before we process the input we instead get: > {Type: "-", Repeat: X} > {Type: "+", Repeat: X} > {Type: "-", Repeat: X}

skx added 4 commits June 15, 2020 08:04

Added a simple lexer.

d935893

This lexer will parse programs, skipping unknown characters, and collapsing repeated occurrences of identical tokens. The result will be a stream of values which can be compiled.

Updated to use our lexer.

5f8de77

I've updated the lexer to skip newlines/unknown characters, and avoid handling repeated loop open/closes - since we need to count those separately. We're now using the lexer in our generation and that is good.

We can't repeat INPUT/OUTPUT instructions either

c387b3b

skx merged commit e3bc7f8 into master Jun 15, 2020

skx deleted the 3-lexer branch June 15, 2020 07:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added a simple lexer. #4

Added a simple lexer. #4

Uh oh!

skx commented Jun 15, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Added a simple lexer. #4

Added a simple lexer. #4

Uh oh!

Conversation

skx commented Jun 15, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants