New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

#305 getAncestors Database functionality. #311

Open

ABLL526 wants to merge 1 commit into master from 305-Get-Ancestors-Database

ABLL526 commented Jan 21, 2025

This PR is strictly for the getAncestors DataBase Functionality.
Any feedback is welcome.


          Added the getAncestors Database functionality.

bbba4ae

1. Made the necessary changes as mentioned by the team.
3. Made the necessary changes to the getAncestors Database functionality.

ABLL526 requested review from benedeki, lsulak, Zejnilovic, dk1844 and salamonpavel as code owners

January 21, 2025 20:32

ABLL526 added good first issue DB Server labels

github-actions bot commented Jan 21, 2025

JaCoCo model module code coverage report - scala 2.13.11

Overall Project	58.76%	🍏

There is no coverage information present for the Files changed

github-actions bot commented Jan 21, 2025

JaCoCo agent module code coverage report - scala 2.13.11

Overall Project	78.2%	🍏

There is no coverage information present for the Files changed

github-actions bot commented Jan 21, 2025

JaCoCo reader module code coverage report - scala 2.13.11

Overall Project	90.86%	🍏

There is no coverage information present for the Files changed

github-actions bot commented Jan 21, 2025

JaCoCo server module code coverage report - scala 2.13.11

Overall Project	68.39%	🍏

There is no coverage information present for the Files changed

ABLL526 linked an issue

that may be closed by this pull request

GET /partitionings/{partId}/parents -> returns all ancestors, not just direct ones #305

Open

ABLL526 self-assigned this

ABLL526 changed the title ~~Added the getAncestors Database functionality.~~ #305 getAncestors Database functionality.

ABLL526 added the enhancement label

salamonpavel reviewed

View reviewed changes

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+              --      has_more            - Flag indicating if there are more partitionings available
+              -- Status codes:
+              --      11 - OK

Collaborator

salamonpavel Jan 28, 2025

Not super important I suppose but still, according to https://github.com/AbsaOSS/fa-db/blob/master/core/src/main/scala/za/co/absa/db/fadb/status/README.md we would maybe want to use status 10 instead of 11.

Author

ABLL526 Jan 28, 2025

Changed as mentioned

Contributor

benedeki Jan 29, 2025

I still see 11. 😉

salamonpavel reviewed

View reviewed changes

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql Show resolved Hide resolved

salamonpavel reviewed

View reviewed changes

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+              --
+              -------------------------------------------------------------------------------
+              DECLARE
+                  partitionCreateAt TIMESTAMP;

Collaborator

salamonpavel Jan 28, 2025

partitioning

Author

ABLL526 Jan 28, 2025

Changed as mentioned

Contributor

benedeki Jan 29, 2025

Also the local variables start with _ by convention - avoids confusion with OUT parameters and column names.

salamonpavel reviewed

View reviewed changes

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+                      LIMIT i_limit
+                      OFFSET i_offset;
+                  IF FOUND THEN

Collaborator

salamonpavel Jan 28, 2025 •

edited

Loading

You return status already from the query. And there is no reason to return 42. There are no records returned if ancestors don't exist. Have a look at runs.get_partitioning_checkpoints.

Collaborator

salamonpavel Jan 28, 2025

We then simply process the data as

if (results.nonEmpty && results.head.hasMore) ...

Author

ABLL526 Jan 28, 2025

This makes sense although. From runs.get_paritioning_checkpoint_v2 it has a similar logic to this.
What I will do is comment it out for now and determine if it is necessary.

salamonpavel mentioned this pull request

#305 Get-Ancestors-Server Functionality #312

Open

benedeki requested changes

View reviewed changes

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+               * limitations under the License.
+               */
+              CREATE OR REPLACE FUNCTION runs.get_ancestors(

Contributor

benedeki Jan 29, 2025

Would name it to runs.get_partitioning_ancestors, otherwise the name is little ambiguous.

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+                  -------------------------------------------------------------------------------
+              --
+              -- Function: runs.get_ancestors(3)
+              --      Returns Ancestors' partition ID for the given id

Contributor

benedeki Jan 29, 2025

Suggested change

      
            --      Returns Ancestors' partition ID for the given id
          
            --      Returns the ids and partitionings  of the ancestors of the given partitioin id

I think this explains the content better.

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+              --
+              -- Parameters:
+              --      i_id_partitioning   - id that we asking the Ancestors for
+              --      i_limit             - (optional) maximum number of partitionings to return, default is 5

Contributor

benedeki Jan 29, 2025

Not important:
Don't we used 10 as the default limit in our functions?

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+              --      has_more            - Flag indicating if there are more partitionings available
+              -- Status codes:
+              --      11 - OK

Contributor

benedeki Jan 29, 2025

I still see 11. 😉

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+              --
+              -------------------------------------------------------------------------------
+              DECLARE
+                  partitionCreateAt TIMESTAMP;

Contributor

benedeki Jan 29, 2025

Also the local variables start with _ by convention - avoids confusion with OUT parameters and column names.

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+              -- Status codes:
+              --      11 - OK
+              --      41 - Partitioning not found
+              --      42 - Ancestor Partitioning not found

Contributor

benedeki Jan 29, 2025

I think there is no need for this status (and error one furthermore). If no ancestors found, it's OK, simple an empty list (particularly with paging).

database/src/main/postgres/runs/V0.3.0.2__get_ancestors.sql

+                      WHERE
+                          PF2.fk_partitioning = i_id_partitioning
+                          AND
+                          P.created_at < partitionCreateAt

Contributor

benedeki Jan 30, 2025

Why this condition?
Actually I think the whole query is incorrect, unfortunately.
It should be

FROM
            flows.partitioning_to_flow PF
                INNER JOIN flows.flows F ON F.id_flow = PF.id_flow
                INNER JOIN runs.partitionings P ON P.id_partitioning = F.fk_primary_partitioning
        WHERE
            PF.fk_partitioning = i_id_partitioning AND
            P.id_partitioning IS DISTINCT FROM i_id_partitioning

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

salamonpavel salamonpavel left review comments

benedeki benedeki requested changes

lsulak Awaiting requested review from lsulak lsulak is a code owner

Zejnilovic Awaiting requested review from Zejnilovic Zejnilovic is a code owner

dk1844 Awaiting requested review from dk1844 dk1844 is a code owner

Requested changes must be addressed to merge this pull request.

Labels

DB enhancement good first issue Server