Parsley 4 Major Changes #98

j-mie6 · 2021-10-05T14:22:03Z

This PR consolidates the Parsley 4 major API changes ready for the eventual release of next release series.

The scope of Parsley 4's changes are massive: probably the largest shift in Parsley's history! The aims of this project have been the following:

User-Facing Changes

Major Changes

Major `parsley.token` Changes

Minor Changes

Patch Changes

Improved the implementation of persist
Made the <|> combinator more consistent when the (internal) JumpTable optimisation applied
General improvements to semantic preservation of optimisations on parsers
Error semantics adjusted to be consistent with Why does label act only on the first set of hints? mrkkrp/megaparsec#482
Better unicode support for error messages
The .unexpected, .!, .collectMsg, .guardAgainst, .filter, .collect, .filterMap, .filterOut, and .filterNot combinators now have amended semantics by default, and sets the caret width to be the thing that was parsed. (behaviour needs documentation)

Other Changes

Major documentation overhaul, with the entire documentation rewritten and standardised, with examples throughout. This should be much friendlier for newcomers to the library.
Changed the versioning policy to be consistent with Scala's SemVer policy: M.m.p represents binary back-compat, source back-compat and then patch. This should make the library far more stable in the wild, and the policy is enforced by CI.
More than 4 registers may be now used simultaneously

Internal Changes

Simplified Cont to not require a given result type at the operation level, this allows the same instance to be used in different places
Split the internal AST into two halves: a fully strict AST called StrictParsley and a partially lazy AST called LazyParsley. This divides the work of processing a tree into a frontend and a backend. This improves performance and allows for more advanced use of mutable structure in the backend. This is also much more maintainable.
Moved datastructures into parsley.internal.collections
AST normalisation is used, replacing old <|> and *>, <* nodes with Choice and Seq constructors, which can normalise the tree in linear time as opposed to polynomial time.
Instructions have been reworked:
- Many instructions have been renamed to have more meaningful names
- Stateful instructions have been removed (with the exception of CalleeSave, which is a special case)
- Call has been much more optimised, and GoSub has been removed as a result: in general the return mechanism is greatly simplified and improved
- Applied TCO more aggressively across parsers, since stateful instruction preservation is no longer a concern
- Most handlers have been split into two instructions to remove conditional statements in these instructions
- Several performance improvements to instructions and code generation across the board
The Context#status flag has been replaced by Context#good and Context#running, which allows for a tighter loop in Context#run(). In addition, Context#fail() can only be called when Context#good is already false: this avoids some redundant work being performed by failure handlers which re-fail.

* Removed redundant parameter in expression, whoops! * Fixed for scala 3... I think they have bugs with co-variance... * Modernised the design of the precedence, this new scheme works a bit nicer on Scala 3 * Updated documentation

* Removed strictness on main non-operator combinators * User API is fully strict now, internal is lazy * Fixed 2.12 * Unary AST-nodes strict * Amazing, looks like we've hit a Scala 3 bug... reverting * uncurried Binary * fully strict left on Binary

…apshot names are more faithful to the semver. Version names are checked before release

…, implemented getMax radix method.

* Broke out into infix and chain * Make zipped strict, and added mixed precedence

…e warnings

…t less cluttered

…ondition that residual input is non-empty

j-mie6 added the major This change would affect break backwards compatibility label Oct 5, 2021

j-mie6 force-pushed the parsley-4 branch from 25f4e49 to e8bda7a Compare November 26, 2021 15:39

j-mie6 added 24 commits January 24, 2022 20:11

Parsley 4 expression simplification (#97)

ff22dc0

* Removed redundant parameter in expression, whoops! * Fixed for scala 3... I think they have bugs with co-variance... * Modernised the design of the precedence, this new scheme works a bit nicer on Scala 3 * Updated documentation

Line length

25579b3

enforced deprecation

5cdd0c7

more enforced deprecation

0071bbf

Fixed tests

086ca97

Strict API (#100)

47952e6

* Removed strictness on main non-operator combinators * User API is fully strict now, internal is lazy * Fixed 2.12 * Unary AST-nodes strict * Amazing, looks like we've hit a Scala 3 bug... reverting * uncurried Binary * fully strict left on Binary

line lengths and wildcard import

cd65ed7

snapshot releases enabled for parsley 4

cff0b23

Renamed Ops to SmartOps

37c64dd

Corrected documentation

262dd49

Reset revision number on errors to Revision0, updated TestErrorBuilder

0b3eda9

Parsley 4 adheres to early semver, which is now checked in CI. The sn…

bd5b132

…apshot names are more faithful to the semver. Version names are checked before release

Fixed syntax error in CI [no release]

dcb86e5

Moved dynverAssertVersion only on relevant CI parts

c206169

Forced deprecation of BitSet, improved the EscapeToken by using Radix…

431f23c

…, implemented getMax radix method.

Restructured escape a little

a01f8a0

Added some comments

51d696e

Removed revision system, it isn't compatible with semver

0bab92a

Generalised getMax so it works for BufferedIterator

669518b

Simplified the iterator helpers

695616b

Added an Iterable to make 2.12 happy

9442421

Correctly supported a wider range of haskell caret escapes

9ec4047

Renamed anyChar to item

dc9d8a9

Fixed broken native tests

e2cde70

j-mie6 force-pushed the parsley-4 branch from 5b70714 to e2cde70 Compare January 24, 2022 20:23

Improved the deep-embedding code somewhat

cc65ec7

j-mie6 added this to the Parsley 4 milestone Feb 23, 2022

Reshuffled Chains (#111)

0181296

* Broke out into infix and chain * Make zipped strict, and added mixed precedence

j-mie6 added 28 commits November 21, 2022 23:53

Noted problem with UnexpectDesc width [skip ci]

fc28f6d

fail can now adjust the caret width

647533e

Fixed some documentation, hid non-released functions, fixed some styl…

5b15273

…e warnings

Some missing doc filled in [skip release]

9453616

More docs for names and space

d0298db

Moved ADT cases into parent object for Token and TokenSpan, it's a bi…

090a5b1

…t less cluttered

Documentation for token types

d8f7abf

token extractors fully documented; use require for preconditions

e1a3028

error doc complete, removed Token.EndOfInput, as it violates the prec…

b1cdada

…ondition that residual input is non-empty

Fixed mocked builder to have length max int

4fe95f2

generic bridge traits documented

4e94cd1

Predicate documented, some extra requirements on the descs

839c04b

Documented separators

e6de62c

Documented enclosing

b3ab04a

Space documentation done

8e14d43

Added texts for requirements of descriptions

82da81f

Extra testing for space in progress

0cda27e

Added missing EOF comment functionality

057bd40

Tests for mixed comment eof

c3ca7c1

Finished space tests

31ac39c

Name documentation complete

91985e5

Documentation for combined, and most of integer and real

4e8dd55

Improved improved unexpectedWhen

820fd69

Finished documentation, but no examples for reals

8a3e905

Updated readme [skip ci]

4686da8

Switched back to Iterable[Char] for the unexpectedToken API

c4833b1

removed indexedSeq from internals too

866aa26

Fixed scala 2.12 tests

33509cb

j-mie6 merged commit 51ccb79 into master Nov 30, 2022

j-mie6 deleted the parsley-4 branch November 30, 2022 20:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parsley 4 Major Changes #98

Parsley 4 Major Changes #98

j-mie6 commented Oct 5, 2021 •

edited

Loading

Parsley 4 Major Changes #98

Parsley 4 Major Changes #98

Conversation

j-mie6 commented Oct 5, 2021 • edited Loading

User-Facing Changes

Major Changes

Major parsley.token Changes

Minor Changes

Patch Changes

Other Changes

Internal Changes

j-mie6 commented Oct 5, 2021 •

edited

Loading

Major `parsley.token` Changes