Use bulk actions #28

dustinblack · 2023-12-19T17:49:36Z

Changes introduced with this PR

Redesigning the main operational construct to use bulk actions instead of discrete indexing operations.

By contributing to this repository, I agree to the contribution guidelines.

Automatic upate of README.md by arcaflow-docsgen arcabot fix http authentication

add tests for conversion functions linting pass through dicts in list conversion function

enable serialization tests schema tweaks implement generator for ingesting the bulk list correct error capture output cleanup update integration tests changing bulkuploadobject operation dict key temporarily to a string linting Automatic upate of README.md by arcaflow-docsgen arcabot only append metadata if it exists enable multi-arch build add plugin step to pre-process a list of data into a bulk_upload_list Automatic upate of README.md by arcaflow-docsgen arcabot add step definition to docker-compose action for build

jaredoconnell

Untested, but other than one minor comment, looks good.

jaredoconnell · 2024-02-09T19:17:30Z

arcaflow_plugin_opensearch/opensearch_schema.py

+        typing.Optional[typing.Dict[str, typing.Any]],
+        schema.name("metadata"),
+        schema.description(
+            "Optional global metadata object that will be added " "to every document."


Looks like the linter removed the code wrap.

Should there be a way to name the metadata first-level key instead of just hardcoding "metadata"? For example, {"@metadata" {...}}? You could do this easily either by merging your value here directly into the document with doc.update(params.metadata) or with a separate metadata_key parameter which could even default to "metadata" and doc[params.metadata_key] = params.metadata.

@dbutenhof I guess I'm not following. The schema.name is part of the self-documenting feature of the plugin. Often the name simply matches the key, but not always.

dbutenhof · 2024-02-09T19:52:19Z

arcaflow_plugin_opensearch/opensearch_schema.py

+        typing.Optional[typing.Dict[str, typing.Any]],
+        schema.name("metadata"),
+        schema.description(
+            "Optional global metadata object that will be added " "to every document."


Should there be a way to name the metadata first-level key instead of just hardcoding "metadata"? For example, {"@metadata" {...}}? You could do this easily either by merging your value here directly into the document with doc.update(params.metadata) or with a separate metadata_key parameter which could even default to "metadata" and doc[params.metadata_key] = params.metadata.

webbnh

Apparently I started this review yesterday and then "lost" it, so I'm posting what I have so far, and perhaps I'll get back to it on Monday (if it is still open, then).

webbnh · 2024-02-08T22:44:56Z

arcaflow_plugin_opensearch/opensearch_plugin.py

+    bulk_upload_list = []
+    for item in params.data_list:
+        bulk_upload_list.append(BulkUploadObject(params.operation, item))


Alternatively,

bulk_upload_list = [BulkUploadObject(params.operation, i) for i in params.data_list]

For me the longer way is a bit more readable. Is there an advantage here beyond tidiness?

There are two:

doing it all in one line (provided that the line does not wrap) means that more lines fit on a screen which makes it easier for the reader to comprehend the code (i.e., the reader should be able to move his/her eyes and see the whole functional unit without having to move the scroll bar); and,

the "list comprehension" is scoped and bounded, such that none of the variables escape the expression and none of the operations have side-effects (unless they are a call to a function whose purpose is to have side-effects...), which means that the reader can safely consider the comprehension to be a "black box" if that suits his/her purposes.

arcaflow_plugin_opensearch/opensearch_plugin.py

webbnh · 2024-02-08T23:14:11Z

arcaflow_plugin_opensearch/opensearch_plugin.py

+        ids = []
+        for i in resp["items"]:
+            ids.append(list(i.values())[0]["_id"])


Alternately,

ids = [next(iter(i.values()))["_id"] for i in resp["items"]]

dustinblack force-pushed the use-bulk-actions branch 2 times, most recently from 25d5dc5 to 70b5963 Compare December 20, 2023 17:47

dustinblack force-pushed the use-bulk-actions branch from 4f567b9 to b9857e4 Compare February 8, 2024 16:34

dustinblack added 3 commits February 8, 2024 17:37

add ability to disable tls verification

598f1aa

Automatic upate of README.md by arcaflow-docsgen arcabot fix http authentication

ensure lists are converted to Opensearch-compatible formats

f135e5d

add tests for conversion functions linting pass through dicts in list conversion function

dustinblack force-pushed the use-bulk-actions branch from b9857e4 to 703b9b0 Compare February 8, 2024 16:37

dustinblack marked this pull request as ready for review February 8, 2024 16:38

dustinblack requested a review from a team February 8, 2024 16:38

jaredoconnell approved these changes Feb 9, 2024

View reviewed changes

dbutenhof approved these changes Feb 9, 2024

View reviewed changes

webbnh reviewed Feb 10, 2024

View reviewed changes

cleanup from PR feedback

c5eedf9

dustinblack merged commit 8500053 into main Feb 13, 2024
3 checks passed

webbnh deleted the use-bulk-actions branch August 26, 2024 20:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use bulk actions #28

Use bulk actions #28

dustinblack commented Dec 19, 2023

jaredoconnell left a comment

jaredoconnell Feb 9, 2024

dbutenhof Feb 9, 2024

dustinblack Feb 13, 2024

dbutenhof Feb 9, 2024

webbnh left a comment

webbnh Feb 8, 2024

dustinblack Feb 13, 2024

webbnh Feb 13, 2024

webbnh Feb 8, 2024

Use bulk actions #28

Use bulk actions #28

Conversation

dustinblack commented Dec 19, 2023

Changes introduced with this PR

jaredoconnell left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

webbnh left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment