eng: fix bug in type generation for nullable/union props BNCH-111776 #219

eli-bl · 2024-11-15T23:22:57Z

This is equivalent to my upstream PR openapi-generators#1121. I described the symptoms in detail in openapi-generators#1120, but basically this was a problem that happened with certain patterns of using "oneOf" or "nullable" with an enum, or with an anonymous object schema that was declared inline. The symptom was that it would add spurious suffixes like "Type1" to the class name, and/or generate extra duplicates of the class.

Why we need this fix: we have patterns like this in our API. We fixed the bug in a different, no-longer-applicable way in our old fork.

The main difference between this and my upstream PR is that I was able to use the new functional test framework to verify that the right classes are generated in various slightly different permutations of the problem scenarios.

eli-bl · 2024-11-15T23:24:04Z

end_to_end_tests/functional_tests/generated_code_execution/test_enums_and_consts.py

-      type: ["string", "null"]
-      enum: ["a", "b", null]
-    MyNullOnlyEnum:
+    EnumOfNullOnly:


The changes in this file are just to cut out some test cases that I moved into test_unions.py, because ultimately the issue isn't about the enum itself, it's about the union type (created either explicitly with oneOf, or implicitly by adding "null" as an allowable type).

eli-bl · 2024-11-15T23:26:31Z

end_to_end_tests/functional_tests/generated_code_execution/test_enums_and_consts.py

-class TestNullableEnums:
-    def test_nullable_enum_prop(self, MyModel, MyEnum, MyEnumIncludingNullType1):
-        # Note, MyEnumIncludingNullType1 should be named just MyEnumIncludingNull -
-        # known bug: https://github.com/openapi-generators/openapi-python-client/issues/1120


This was one of the cases affected by the bug. In the new version of this test that's in test_unions.py, the model class name is correctly rendered as MyEnumIncludingNull.

eli-bl · 2024-11-15T23:28:41Z

end_to_end_tests/generated_client.py

+            assert False, (
+                f"Couldn't find import \"{name}\" in \"{self.base_module}{module_path}\"."
+                f" Available imports in that module are: {existing}"
+            )


Just some nicer failure output. In a case like the one I mentioned above, where the generator has created an incorrect type name, if you try to import NameOfThing you'll see something like:

AssertionError: Couldn't find import "NameOfThing" in "testapi_client.models". Available imports in that module are: NameOfThingType0, SomeOtherType, [etc.]

eli-bl · 2024-11-15T23:30:00Z

end_to_end_tests/golden-record/my_test_api_client/models/__init__.py

@@ -52,7 +52,7 @@
 )
 from .model_with_additional_properties_refed import ModelWithAdditionalPropertiesRefed
 from .model_with_any_json_properties import ModelWithAnyJsonProperties
-from .model_with_any_json_properties_additional_property_type_0 import ModelWithAnyJsonPropertiesAdditionalPropertyType0
+from .model_with_any_json_properties_additional_property import ModelWithAnyJsonPropertiesAdditionalProperty


An example of the effect that the bug was having on the reference code in the "golden record"-based tests, causing "Type0" and "_type_0" suffixes to be added for no good reason.

eli-bl · 2024-11-15T23:36:30Z

openapi_python_client/parser/properties/union.py

@@ -80,22 +80,44 @@ def build(
        sub_properties: list[PropertyProtocol] = []

        type_list_data = []
-        if isinstance(data.type, list):
+        if isinstance(data.type, list) and not (data.anyOf or data.oneOf):


This is for a simple case where the schema is just like:

MyType: type: ["string", "integer"]

That's equivalent in meaning to anyOf: [{type: string}, {type: int}], so on lines 84-85 we convert it into those explicit variant types. However, if there was redundantly an anyOf or oneOf and a multiple type:, then we do not want to do that, because the variants that are described in the anyOf/oneOf list already fully describe the possible types. For instance:

MyType: type: ["string", "integer"] oneOf: - type: string format: date-time - type: integer default: 3

In that example, prior to my fix, the union would've ended up having four variants instead of two.

damola-benchling

the fix generally looks good at a high level but - I did struggle a bit to finally wrap my head around the implementation. If we can simplify it, that would be great, but at least I think the comment should have an example illustration

damola-benchling · 2024-12-06T17:58:13Z

openapi_python_client/parser/properties/union.py

+            if (not use_original_name_for) and len(schemas_with_classes) == 1:
+                # An example of this scenario is a oneOf where one of the variants is an inline enum or
+                # model, and the other is a simple value like null. If the name of the union property is
+                # "foo" then it's desirable for the enum or model class to be named "Foo", not "FooType1".
+                # So, we'll do a second pass where we tell ourselves to use the original property name
+                # for that item instead of "{name}_type_{i}".
+                # This only makes a functional difference if the variant was an inline schema, because
+                # we wouldn't be generating a class otherwise, but even if it wasn't inline this will
+                # save on pointlessly long variable names inside from_dict/to_dict.
+                return process_items(use_original_name_for=schemas_with_classes[0])


It took me a minute to really understand the logic of the two-pass loop in process_items

In addition to (or in place of) this comment block here, I think an example-based description of what this code is doing might be better.

Also, would it be possible to use some sort of lookup to track the schema, and then the single name schema gets updated at the end of the loop if only one named schema was encountered 🤔
I'm not sure if that will be necessarily easier/shorted to read than the current implementation but I think the two loops is more on the less-trivial scale.

Also, would it be possible to use some sort of lookup to track the schema, and then the single name schema gets updated at the end of the loop if only one named schema was encountered

Unfortunately no, that's not possible. I mentioned the reason for this when we talked, but I should clarify it in the comment too: there are maps in schemas that already contain references to the processed classes by name, so we can't just modify one object in place.

ahhh yes, I remember you mentioning it, but I think I hadn't grokked the code enough to fully understand.

pyproject.toml

fix bug in type generation for nullable props BNCH-111776

9056e92

eli-bl commented Nov 15, 2024

View reviewed changes

eli-bl force-pushed the eli.bishop/BNCH-111776-nullables branch from 40702a8 to 8059d24 Compare November 15, 2024 23:24

eli-bl commented Nov 15, 2024

View reviewed changes

functional tests for union type fix

b92fed6

eli-bl force-pushed the eli.bishop/BNCH-111776-nullables branch from 8059d24 to b92fed6 Compare November 15, 2024 23:41

misc fixes

f4d3735

eli-bl force-pushed the eli.bishop/BNCH-111776-nullables branch from c4dda2c to f4d3735 Compare November 18, 2024 19:47

eli-bl marked this pull request as ready for review November 18, 2024 19:54

eli-bl requested a review from damola-benchling December 3, 2024 18:51

damola-benchling reviewed Dec 6, 2024

View reviewed changes

eli-bl added 3 commits December 6, 2024 10:30

Merge branch 'prod/2.x' into eli.bishop/BNCH-111776-nullables

1d76bca

clarify nullable special-case logic

668ea04

further clarification

a5ed889

eli-bl requested a review from damola-benchling December 6, 2024 20:11

damola-benchling approved these changes Dec 6, 2024

View reviewed changes

pyproject.toml Outdated Show resolved Hide resolved

eli-bl changed the base branch from 2.x to prod/2.x December 6, 2024 23:40

eli-bl merged commit 853eb9c into prod/2.x Dec 6, 2024

eli-bl deleted the eli.bishop/BNCH-111776-nullables branch December 6, 2024 23:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

eng: fix bug in type generation for nullable/union props BNCH-111776 #219

eng: fix bug in type generation for nullable/union props BNCH-111776 #219

eli-bl commented Nov 15, 2024 •

edited

Loading

eli-bl Nov 15, 2024 •

edited

Loading

eli-bl Nov 15, 2024

eli-bl Nov 15, 2024

eli-bl Nov 15, 2024

eli-bl Nov 15, 2024

damola-benchling left a comment

damola-benchling Dec 6, 2024

eli-bl Dec 6, 2024

damola-benchling Dec 6, 2024

eng: fix bug in type generation for nullable/union props BNCH-111776 #219

eng: fix bug in type generation for nullable/union props BNCH-111776 #219

Conversation

eli-bl commented Nov 15, 2024 • edited Loading

eli-bl Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

eli-bl Nov 15, 2024

Choose a reason for hiding this comment

eli-bl Nov 15, 2024

Choose a reason for hiding this comment

eli-bl Nov 15, 2024

Choose a reason for hiding this comment

eli-bl Nov 15, 2024

Choose a reason for hiding this comment

damola-benchling left a comment

Choose a reason for hiding this comment

damola-benchling Dec 6, 2024

Choose a reason for hiding this comment

eli-bl Dec 6, 2024

Choose a reason for hiding this comment

damola-benchling Dec 6, 2024

Choose a reason for hiding this comment

eli-bl commented Nov 15, 2024 •

edited

Loading

eli-bl Nov 15, 2024 •

edited

Loading