Implement Message and Interchange high-level containers #29

JocelynDelalande · 2020-09-08T17:44:38Z

This PR fixes #19

Limitations

Functional groups are not implemented, I do not plan to implement them soon myself. (their use is optional, and I never used/seen them in the wild)
Syntax / consistency checks are kinda lousy, IMHO this is not the time to handle that any further : that could take place in a more general work on Segment specialization/validation (huge work ahead, if so).

TODO

(things I want to do before merging)

Accept optional segments in Message / Interchange headers
Decide on API breakage strategy (see bellow)

API breakage strategy discussion

Compared to what we discussed in #19, I have two tasks pending :

move from_file() to Interchange class
move UNA handling to Interchange class

I wanted to stop and discuss here because, those changes are going to break previous API. I see three options:

Option 1 : prioritize compatibility

Let it as-is, more or less : the API is a bit less clean but backward compatibility is ensured

Option 2 : prioritize API cleanness

Totally remove SegmentCollection identifier (optionally replace it with something throwing an exception with comprehensive message)
if needed : Create a RawSegmentCollection that barely inherit AbstractSegmentsContainer and maintains the possibility to parse random segments buckets

Option 3 : progressive breakage

Publish a version following option 1, but add a deprecation warning when a SegmentCollection is instantiated
Publish a version (major one, because API breakage) following option 2

WDYT? As of now, I myself have a preference for Option 3

Any comments on the PR are welcome, even if this is WIP (design and most of the code is here already).

nerdoc · 2020-09-08T18:13:34Z

First, without I have seen the code: whow, I'm impressed. It's still hard to believe for me that anyone else finds this code useful for "production" already - because for me it's still in "eats your dog" mode. ;-)
But I'm more than happy that you seem to really dive into it and hel me in things I barely have time at the moment...
That's open source.

So, to the "options" - API stability is no issue ATM, as it really is meant "pre-alpha". But, nevertheless, it would be fair to choose option 3, for anyone who already uses the low level API. And it's GoodProgrammingPractice anyway. So. Option 3 should be the way to go.
I'll have a look at the code soon. Thanks for your valuable input.

nerdoc · 2020-09-08T18:26:45Z

One thing I see if I look at e.g. Message/Interchange: You use parameters of the __init__() function to initialize the created Interchange. But they are only the mandatory parameters of this segment. There are many more. we have to be extremely careful designing this API, as it is hard to be changed later, when its stable some time.
So, either it is necessary to implement ALL of the positions of a segment, including the optionals.
This maybe could be done by

class Message(AbstractSegmentsContainer):
    def __init__(
            self,
            reference_number: str,
            identifier: Tuple,
            *options
    ):

with gooooood documentation.
This way, optional elements could be added later without problems. and with out breaking API here.

And yes, there would be NO type whatsoever checking possibility when using str, tuples etc.
I must think about that (your opinions?) - I wanted to implement all the type checking functionality first - but maybe that's a bunch of work, doesn't help much in the first place and can be done later too. Maybe pydifact Is never going to be finished/stable if doing AllTheThingsCompletelyRightTheFirstTime(TM) ;-)

nerdoc · 2020-09-08T18:37:51Z

And I see you are using the SyntaxError Exception. I already created EDISyntaxError for this purpose - BUT - maybe it's not really a good name. And maybe Python's SyntaxError could be used for that too.

exception SyntaxError
Raised when the parser encounters a syntax error.

This says the Python docs - but I think we can use it here to.
So I will replace my EDISyntaxError with SyntaxError in one of the next commits.

JocelynDelalande · 2020-09-09T09:17:32Z

So I will replace my EDISyntaxError with SyntaxError in one of the next commits.

Or maybe just keep EDISyntaxError but make it inherit SyntaxError ?

nerdoc · 2020-09-10T07:34:06Z

Is there any recommendation to use SyntaxError only for Python syntax? I didn't find any. but naturally, I think using EDISyntaxError is the better way, so it's easier to distinguish. I'll make it inherit SyntaxError.

JocelynDelalande · 2020-09-10T09:01:03Z

So, either it is necessary to implement ALL of the positions of a segment, including the optionals.
This maybe could be done by
class Message(AbstractSegmentsContainer):
    def __init__(
            self,
            reference_number: str,
            identifier: Tuple,
            *options
    ):
with gooooood documentation.

Yes, that is on my TODO for this PR :). But I would prefer to use the *options stared format but rather a tuple/list : if for one reason or another we have to add an argument, it would totally break existing code.

nerdoc · 2020-09-10T13:07:39Z

"I would prefer to use the *options stared format" - you mean "not use"?
But then, better use a dict than a tuple. So let's use **options. Then It is even more compatible.
I see no big problem in `options neither. Because the only problem would be if a newer software uses an older pydifact lib which has not implemented "option x" - then it crashes. Using a tuple, or better a dict, then it is easier to check if a certain key is there. We have to check anyway - because blindly accessing a dict's key without knowing it is there does not work. and all these options are optional.

class Message(AbstractSegmentsContainer):
    def __init__(
            self,
            reference_number: str,
            identifier: Tuple,
            **options
    ):

JocelynDelalande · 2020-09-10T13:19:29Z

I'd prefer to allow adding arbitrary segment sections, rather than listing and handling all optional fields specifically. See my implementation in 5047490 WDYT ?

JocelynDelalande · 2020-09-10T13:20:15Z

(moreover, limiting the supported sections may force us to support all versions of the standard with their specificities, which is not a way I want to take as of now :-p).

nerdoc · 2020-09-10T13:33:33Z

Hm. I think you want to support the newest version (4). Due to my own requirements, and why I started pydifact, I need to support the v3. That's why I created the syntax directory with the v[1-3].py files... (they're stubs for now)
So yes, I'd like to support more than one versions, at least keep that in mind...
What is not necessary is to support them within one program. So IMHO it should be possible to use from pydifact.syntax.v4 import XYZSegment and use v4 then. But we would have to implement classes for Segments of each version then, or, if they differ few, put that class in syntax/common.py, handle the differences in the class and import that in the v3 and v4 namespace.

JocelynDelalande · 2020-09-10T13:35:17Z

Hm. I think you want to support the newest version (4). Due to my own requirements, and why I started pydifact, I need to support the v3. That's why I created the syntax directory with the v[1-3].py files... (they're stubs for now)
So yes, I'd like to support more than one versions, at least keep that in mind...
What is not necessary is to support them within one program. So IMHO it should be possible to use from pydifact.syntax.v4 import XYZSegment and use v4 then. But we would have to implement classes for Segments of each version then, or, if they differ few, put that class in syntax/common.py, handle the differences in the class and import that in the v3 and v4 namespace.

OK, thanks for the explanation, but within the scope of that PR, do my liberal implememantion of optional elements suits you ?

nerdoc · 2020-09-16T19:04:09Z

Oh, yes, sorry for the late answer. For this PR this is ok, if you don't mind if I change the API later...

pydifact/segmentcollection.py

JocelynDelalande · 2020-09-28T11:50:28Z

Oh, yes, sorry for the late answer. For this PR this is ok, if you don't mind if I change the API later...

Yes, sure. By the way, I'd be in favor of using semantic versioning : bump the major version when we break the API.

JocelynDelalande · 2020-09-28T13:35:28Z

"I would prefer to use the *options stared format" - you mean "not use"?

yes, sorry.

supporting arbitrary segments using dicts seems a bit inconsistent to me : as I am not going to document the allowed parameters for now (and they may vary from a version of edifact to another), telling users to pick arbitrary keys seems weird. So tuple for now, and we may switch to dict or named args later :)

JocelynDelalande · 2020-09-28T14:24:26Z

Adressed remaining issues and filled changelog. I think it is good to go :)

nerdoc · 2020-09-29T05:09:10Z

Regarding semantic versioning - definitively. Just startend with a 0.0. Version as it's pre-alpha. So there is no major version until stable, API can change any time... But to address the changes we could use minor versions for "major" changes too until 1.0.0

JocelynDelalande · 2020-09-29T07:42:10Z

Regarding semantic versioning - definitively. Just startend with a 0.0. Version as it's pre-alpha. So there is no major version > until stable, API can change any time... But to address the changes we could use minor versions for "major" changes too until 1.0.0

OK, not a big fan, but you are the maintainer, and I do not want wasting our time with endless discussion on versioning scheme, so let's go for that :).

OK to target SegmentCollection removal for v1.0 as I wrote in that PR, or before ?

nerdoc · 2020-09-30T18:48:21Z

OK, not a big fan, but you are the maintainer, and I do not want wasting our time with endless discussion on versioning scheme, so let's go for that :).

OK to target SegmentCollection removal for v1.0 as I wrote in that PR, or before ?

I maybe just misunderstood - I'm a BIG fan of SemVer. I just thought that UNTIL we reach 1.0.0 - API can change anyway. Because if not, I would have to change the major version with each API break - but the first break would then have to be 1.0.0 - which is not desirable - because 1.0.0 means stable.
See here, explaining:

Before publishing your first, useable version, you might find yourself incrementing the middle and the last digit to keep track of Alpha/Beta releases. Only once you’re ready for a proper, first release, should you start versioning from 1.0.0.

That's exactly what I want.
So, increase patch number with every release/change, and minor with greater breaks, UNTIL 1.0.0 - and then strictly semantic versioning...

nerdoc · 2020-09-30T18:49:57Z

OK to target SegmentCollection removal for v1.0 as I wrote in that PR, or before ?

I would remove that before. Noone should use that any more when 1.0 is near ;-)
Remember, this is (pre)alpha software ATM...

JocelynDelalande · 2020-10-01T08:59:57Z

Ok for everything, thanks for taking time explaining :). So if I got it correctly, I think my PR is ready with latest push.

0.1 with introduction of Interchange/Message containers
0.2 with API breakage

Sounds good to you ?

Fix nerdocs#19

Ref nerdocs#19

Fix nerdocs#19

- plan its removal for v1.0 - move from_file() method and UNA parsing to Interchange class - create a replacement RawSegmentCollection class for use cases with lousy structured segment bunches. Ref nerdocs#19

Ref nerdocs#19

Fix nerdocs#19

To detect early some invalid UNB segments (as we rely on the presence of those fields then in the code).

JocelynDelalande · 2020-10-01T16:21:03Z

Fixed a few leftovers (nitpicking) and a bug

nerdoc · 2020-10-01T20:52:38Z

Yes, this sounds fine!

nerdoc · 2020-10-24T20:44:25Z

Sorry for the long delay. Looks all good to me. Thank you very much for your input.

JocelynDelalande · 2020-10-26T10:39:25Z

No prob, and thanks :-).

JocelynDelalande marked this pull request as draft September 8, 2020 17:52

JocelynDelalande force-pushed the jd-high-level-containers branch from a5a2e97 to 5047490 Compare September 10, 2020 09:56

nerdoc approved these changes Sep 16, 2020

View reviewed changes

pydifact/segmentcollection.py Outdated Show resolved Hide resolved

pydifact/segmentcollection.py Outdated Show resolved Hide resolved

JocelynDelalande marked this pull request as ready for review September 28, 2020 14:20

JocelynDelalande force-pushed the jd-high-level-containers branch from 5047490 to 934b4e6 Compare September 28, 2020 14:24

JocelynDelalande changed the title ~~WIP: Implement Message and Interchange high-level containers~~ Implement Message and Interchange high-level containers Sep 28, 2020

JocelynDelalande requested a review from nerdoc September 29, 2020 07:43

JocelynDelalande force-pushed the jd-high-level-containers branch from 698e923 to e79acae Compare October 1, 2020 08:57

JocelynDelalande force-pushed the jd-high-level-containers branch from b8acfe6 to 42929d9 Compare October 1, 2020 16:07

Implement Message and Interchange containers

dd61345

Fix nerdocs#19

JocelynDelalande added 11 commits October 1, 2020 18:17

Implement messages handling within interchange

9248a32

Fix nerdocs#19

Refactor interchange/message tests to use fixtures

a724256

Ref nerdocs#19

Support extra elements for header segment of message and interchange

abea42e

Fix nerdocs#19

Deprecate SegmentCollection gracefuly

be3e1d9

- plan its removal for v1.0 - move from_file() method and UNA parsing to Interchange class - create a replacement RawSegmentCollection class for use cases with lousy structured segment bunches. Ref nerdocs#19

Add changelog related to Message / Interchange changes

00ad3f5

Ref nerdocs#19

Document Interchange / Message / SegmentCollection stuff

a4dabee

Fix nerdocs#19

Add shortcuts to get version and type of message

a8183ee

Add loosy syntax check of UNB headers

621c2eb

To detect early some invalid UNB segments (as we rely on the presence of those fields then in the code).

Make EDISyntaxError son of SyntaxError

404a903

Make Message/Interchange use propper EDISyntaxError

87b7ef3

Fix crash on Interchanges with UNA header

eef1cb9

JocelynDelalande force-pushed the jd-high-level-containers branch from 42929d9 to eef1cb9 Compare October 1, 2020 16:18

nerdoc merged commit 0e6847b into nerdocs:master Oct 24, 2020

JocelynDelalande deleted the jd-high-level-containers branch October 26, 2020 10:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Message and Interchange high-level containers #29

Implement Message and Interchange high-level containers #29

JocelynDelalande commented Sep 8, 2020 •

edited

Loading

nerdoc commented Sep 8, 2020

nerdoc commented Sep 8, 2020 •

edited

Loading

nerdoc commented Sep 8, 2020 •

edited

Loading

JocelynDelalande commented Sep 9, 2020

nerdoc commented Sep 10, 2020

JocelynDelalande commented Sep 10, 2020

nerdoc commented Sep 10, 2020 •

edited

Loading

JocelynDelalande commented Sep 10, 2020

JocelynDelalande commented Sep 10, 2020

nerdoc commented Sep 10, 2020

JocelynDelalande commented Sep 10, 2020

nerdoc commented Sep 16, 2020

JocelynDelalande commented Sep 28, 2020

JocelynDelalande commented Sep 28, 2020

JocelynDelalande commented Sep 28, 2020

nerdoc commented Sep 29, 2020

JocelynDelalande commented Sep 29, 2020

nerdoc commented Sep 30, 2020

nerdoc commented Sep 30, 2020

JocelynDelalande commented Oct 1, 2020

JocelynDelalande commented Oct 1, 2020

nerdoc commented Oct 1, 2020

nerdoc commented Oct 24, 2020

JocelynDelalande commented Oct 26, 2020

Implement Message and Interchange high-level containers #29

Implement Message and Interchange high-level containers #29

Conversation

JocelynDelalande commented Sep 8, 2020 • edited Loading

Limitations

TODO

API breakage strategy discussion

Option 1 : prioritize compatibility

Option 2 : prioritize API cleanness

Option 3 : progressive breakage

nerdoc commented Sep 8, 2020

nerdoc commented Sep 8, 2020 • edited Loading

nerdoc commented Sep 8, 2020 • edited Loading

JocelynDelalande commented Sep 9, 2020

nerdoc commented Sep 10, 2020

JocelynDelalande commented Sep 10, 2020

nerdoc commented Sep 10, 2020 • edited Loading

JocelynDelalande commented Sep 10, 2020

JocelynDelalande commented Sep 10, 2020

nerdoc commented Sep 10, 2020

JocelynDelalande commented Sep 10, 2020

nerdoc commented Sep 16, 2020

JocelynDelalande commented Sep 28, 2020

JocelynDelalande commented Sep 28, 2020

JocelynDelalande commented Sep 28, 2020

nerdoc commented Sep 29, 2020

JocelynDelalande commented Sep 29, 2020

nerdoc commented Sep 30, 2020

nerdoc commented Sep 30, 2020

JocelynDelalande commented Oct 1, 2020

JocelynDelalande commented Oct 1, 2020

nerdoc commented Oct 1, 2020

nerdoc commented Oct 24, 2020

JocelynDelalande commented Oct 26, 2020

JocelynDelalande commented Sep 8, 2020 •

edited

Loading

nerdoc commented Sep 8, 2020 •

edited

Loading

nerdoc commented Sep 8, 2020 •

edited

Loading

nerdoc commented Sep 10, 2020 •

edited

Loading