Hjson: quoteless strings shall not include a comma (#552) #553

cousteaulecommandant · 2024-07-13T10:06:03Z

Some hjson files in X-HEEP use trailing comma after a "quoteless string" which isn't valid in the standard (https://hjson.github.io/syntax.html):

Do not add commas or comments as they would become part of the string.

This includes "barewords" such as active: low, but also (surprisingly) hexadecimal values such as address: 0x10000000, which is technically also interpreted as a string and not an integer value. So if you add a comma there, the hjson parser considers it as part of the string (this is not a bug in the parser, it's the correct interpretation of the file).
Commas are fine (but superfluous) if the string is quoted, and also after numbers, true, and false.

This was reported as #552, which this PR fixes.

This fix also deprecates certain workarounds in the code that were using .split(',') or .strip(',') to address this issue (but the issue was really that the file format was wrong, not that the parser failed).
I decided not to remove those fixes from mcu_gen.py just in case, but in principle they're no longer necessary if files are kept consistent with the Hjson format.

It would probably be a good idea to get rid of commas in hjson files entirely (including after numbers, quoted strings, or closing brackets/braces), since they are superfluous and the standard discourages its use, but I decided to leave that out of this PR as well.

davideschiavone · 2024-07-15T16:24:58Z

@davidmallasen , can you pls give it a look?

davidmallasen · 2024-07-16T08:23:48Z

@davideschiavone I completely agree with @cousteaulecommandant. In fact, I would suggest to also remove the commas after quoted strings, numbers, etc. to avoid any confusion and to keep things consistent.

I see that in the config folder this was already taken into account and there are only commas after quoted strings or blocks.

Leaving the .split(',') or .strip(',') in mcu_gen.py depends on how strict you want to be and how backwards compatible you want to be in other projects that end up using the latest version of X-HEEP.

The changes required in hw/vendor/lowrisc_opentitan/hw/ip/i2c/data/i2c_testplan.hjson could be an issue, but it should be easily solvable with a vendor patch. @cousteaulecommandant in this case the CI is failing because you are directly changing a file inside the hw/vendor/* folders, and this should only be done with patches. You can look in hw/vendor/patches for examples and https://opentitan.org/book/util/doc/vendor.html for the documentation.

cousteaulecommandant · 2024-07-17T14:01:11Z

I would suggest to also remove the commas after quoted strings, numbers, etc. to avoid any confusion and to keep things consistent.

I was tempted to add that as well (on a second commit), but I felt that that could cause a lot of trouble and merge conflicts. But I could try to add that as well if you want.

Leaving the .split(',') or .strip(',') in mcu_gen.py depends on how strict you want to be and how backwards compatible you want to be in other projects that end up using the latest version of X-HEEP.

Yes; I was afraid that removing that would break every single fork, derived project and ongoing work. It may be a good idea to remove it in a future though.

The changes required in hw/vendor/lowrisc_opentitan/hw/ip/i2c/data/i2c_testplan.hjson could be an issue, but it should be easily solvable with a vendor patch. @cousteaulecommandant in this case the CI is failing because you are directly changing a file inside the hw/vendor/* folders, and this should only be done with patches.

I can exclude that file from the PR for now. It's already fixed in upstream OpenTitan anyway.

Or should I fix the patch as well?

davidmallasen · 2024-07-18T08:02:36Z

@davideschiavone I'll leave these decisions in your hands. My suggestion would be:

Remove all the commas in the hjsons to keep things consistent with the hjson specification and the python package we use.
Keep the current functionality in mcu-gen unless you know it's safe to remove it as well.
Leave the OpenTitan file as-is or patch/update it if we're removing 2.

davideschiavone · 2024-07-18T09:44:16Z

@davideschiavone I'll leave these decisions in your hands. My suggestion would be:

Remove all the commas in the hjsons to keep things consistent with the hjson specification and the python package we use.

Keep the current functionality in mcu-gen unless you know it's safe to remove it as well.

Leave the OpenTitan file as-is or patch/update it if we're removing 2.

I do not have any personal preference or opinion, so I will leave to the top (still active) contributors the decision to take.

Please, @JoseCalero @JuanSapriza @simone-machetti @christophmuellerorg @davidmallasen @LuigiGiuffrida98 @StMiky @danivz
vote by reacting to this message with 👍 if you want to apply the change or 👎 if you do not want to apply those changes.

@benoitdenkinger , although not active any more, feel free to vote and I will take it into account

davideschiavone · 2024-07-18T10:15:43Z

@davideschiavone I'll leave these decisions in your hands. My suggestion would be:

Remove all the commas in the hjsons to keep things consistent with the hjson specification and the python package we use.

Keep the current functionality in mcu-gen unless you know it's safe to remove it as well.

Leave the OpenTitan file as-is or patch/update it if we're removing 2.

I do not have any personal preference or opinion, so I will leave to the top (still active) contributors the decision to take.

Please, @JoseCalero @JuanSapriza @simone-machetti @christophmuellerorg @davidmallasen @LuigiGiuffrida98 @StMiky @danivz vote by reacting to this message with 👍 if you want to apply the change or 👎 if you do not want to apply those changes.

@benoitdenkinger , although not active any more, feel free to vote and I will take it into account

@davidmallasen @cousteaulecommandant , we clearly have a winner :) pls go on

davidmallasen · 2024-07-18T10:18:56Z

Thank you all. @cousteaulecommandant could you:

Remove all the commas in the hjsons to keep things consistent.
Exclude the OpenTitan file from the PR

davideschiavone · 2024-07-18T12:25:03Z

Thank you all. @cousteaulecommandant could you:

Remove all the commas in the hjsons to keep things consistent.

Exclude the OpenTitan file from the PR

if we leave the commas in the OpenTitan I also guess we do not have to change the python script

davidmallasen · 2024-07-18T12:28:20Z

Exactly

Some hjson files in X-HEEP use trailing comma after a "quoteless string" which isn't valid in the standard (https://hjson.github.io/syntax.html): > Do not add commas or comments as they would become part of the string. This includes "barewords" such as `active: low,` but also (surprisingly) hexadecimal values such as `address: 0x10000000,` which is technically also interpreted as a string and not an integer value. (`42,`, `true,`, `false,` and `"string",` are OK though.) This fixes issue esl-epfl#552. It also deprecates certain workarounds in the code that were using `.split(',')` or `.strip(',')` to address this issue (but the issue was really that the file format was wrong, not that the parser failed).

- Followed by spaces - There's a commented-out line that has a trailing comma as well

The Hjson format specification (https://hjson.github.io/syntax.html) also discourages the use of trailing commas at the end of a line, which are harmless (valid) but superfluous and confusing: > You should omit optional commas to make your data more readable. It has been decided that adopting this style will be less confusing and avoid possible repetitions of issue esl-epfl#552 in the future. This commit removes all the trailing commas from hjson files, after a meticulous process to verify that none of these trailing commas had a specific meaning; such as in this hypothetical scenario: ``` description: '''This is the description's first line, which has a trailing comma.''' ```

cousteaulecommandant · 2024-07-20T01:10:40Z

OK, done. I removed all trailing commas from all Hjson files, being careful that none of these commas are actually part of a comment or a '''...''' string or similar.
Also updated the documentation accordingly.
(And excluded the vendorized OpenTitan files as requested.)

I think I did it right. I tried opening all affected hjson files and they at least load without errors using Python's hjson module.

Just in case I messed up some corner case, I left each "logical step" of the substitutions on a separate commit in case you want to check them one by one (before squashing everything together in a single commit).

cousteaulecommandant · 2024-07-20T01:25:12Z

On second thought, I'm still unsure if modifying these many files is a good idea re: other forks. If someone was working on a fork/branch that touched any hjson file, it's going to be merge conflicts everywhere.

At the very least, I think it would probably be a good idea to split this PR into the two main commits ("shall not include" and "should not include") rather than squashing everything into a single commit, so that people can choose not to merge the latter, or revert it before merging. And I think it would make sense since one PR is fixing an actual issue whereas the second is fixing a mere coding style.

davidmallasen · 2024-07-22T09:02:22Z

Thanks @cousteaulecommandant. @davideschiavone I double-checked all the changes and if the CI passes I think it's good to merge.

The merge conflicts in forks of X-HEEP are almost inevitable, that's why Davide asked the main contributors. Since everyone was on board, and I believe only cherry-picking some commits when updating the base X-HEEP is not a maintainable practice, I don't think this should be an issue.

cousteaulecommandant force-pushed the bug552 branch 2 times, most recently from 3e57007 to ea91afc Compare July 13, 2024 19:38

cousteaulecommandant mentioned this pull request Jul 13, 2024

Remove trailing commas in hjson files #552

Open

cousteaulecommandant force-pushed the bug552 branch from ea91afc to 68da510 Compare July 20, 2024 00:03

cousteaulecommandant added 8 commits July 20, 2024 02:34

Remove trailing commas in documentation

a979122

Remove trailing commas followed by comments

4d00531

Remove trailing commas in key: value, cases

741c441

Remove trailing commas in { key: value,

79c9bc2

Remove trailing commas in }, and ],

3988402

Remove trailing commas in ''',

6ecb034

Remove trailing commas in other situations

577e756

- Followed by spaces - There's a commented-out line that has a trailing comma as well

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hjson: quoteless strings shall not include a comma (#552) #553

Hjson: quoteless strings shall not include a comma (#552) #553

cousteaulecommandant commented Jul 13, 2024

davideschiavone commented Jul 15, 2024

davidmallasen commented Jul 16, 2024

cousteaulecommandant commented Jul 17, 2024

davidmallasen commented Jul 18, 2024 •

edited

Loading

davideschiavone commented Jul 18, 2024 •

edited

Loading

davideschiavone commented Jul 18, 2024

davidmallasen commented Jul 18, 2024

davideschiavone commented Jul 18, 2024

davidmallasen commented Jul 18, 2024

cousteaulecommandant commented Jul 20, 2024 •

edited

Loading

cousteaulecommandant commented Jul 20, 2024

davidmallasen commented Jul 22, 2024

Hjson: quoteless strings shall not include a comma (#552) #553

Are you sure you want to change the base?

Hjson: quoteless strings shall not include a comma (#552) #553

Conversation

cousteaulecommandant commented Jul 13, 2024

davideschiavone commented Jul 15, 2024

davidmallasen commented Jul 16, 2024

cousteaulecommandant commented Jul 17, 2024

davidmallasen commented Jul 18, 2024 • edited Loading

davideschiavone commented Jul 18, 2024 • edited Loading

davideschiavone commented Jul 18, 2024

davidmallasen commented Jul 18, 2024

davideschiavone commented Jul 18, 2024

davidmallasen commented Jul 18, 2024

cousteaulecommandant commented Jul 20, 2024 • edited Loading

cousteaulecommandant commented Jul 20, 2024

davidmallasen commented Jul 22, 2024

davidmallasen commented Jul 18, 2024 •

edited

Loading

davideschiavone commented Jul 18, 2024 •

edited

Loading

cousteaulecommandant commented Jul 20, 2024 •

edited

Loading