Advanced copy API needs PRN support #3853

dralley · 2025-01-09T16:40:02Z

As part of the effort to move Pulp APIs over to "PRNs" for various reasons (see pulp/pulpcore#5766), we need to update the advanced copy API, which unlike the other APIs didn't inherit support from pulpcore because it's custom.

The two places that definitely need updates are:

https://github.com/pulp/pulp_rpm/blob/main/pulp_rpm/app/viewsets/repository.py#L764C1-L765C55
https://github.com/pulp/pulp_rpm/blob/main/pulp_rpm/app/serializers/repository.py#L582-L583

dralley · 2025-01-16T16:03:57Z

https://issues.redhat.com/browse/PULP-276

Fixes: pulp#3853

pedro-psb · 2025-01-23T21:35:29Z

@dralley @ggainey On the copy task we do filter for the domain, but I assume we want a nice error message if there any content is not part of domain.

I'm also assuming doing an extra count query is our cheaper option:

compare the number of units provided by the user with the number of those content in the current domain (the extra count query), which should be equal.
If validation fails, make a new query to display what content does not belong to the domain before raising.

Wdyt?

dralley · 2025-01-24T03:20:07Z

Nice error messages are a bonus but not necessarily a requirement - nobody is complaining about the current state of things after all, and that would be an issue with the current code too yeah?

Of course, it would be nice :)

There's a few different ways you could structure it and I'm not sure which is cheapest. e.g. options might be

content.filter(pk__in=list_of_pks_from_prn).exclude(pulp_domain=domain).count()

or

content.filter(pk__in=list_of_pks_from_prn).values("pulp_domain").distinct()

Sidenote: how much are we just "assuming" that these are too expensive to do in the viewset?

pedro-psb · 2025-01-24T12:48:48Z

and that would be an issue with the current code too yeah?

True. The current code does identify the wrong domain href, but only the first, so it's not really too different in terms of usefulness for the user. But on the task context we can afford the better message for failures.

Sidenote: how much are we just "assuming" that these are too expensive to do in the viewset?

I'm 100% assuming. Do you have some estimates for safe numbers we could test? I can sync up a big repo on-demand and do a request with lots of content.

ggainey · 2025-01-24T13:02:08Z

I'm 100% assuming. Do you have some estimates for safe numbers we could test? I can sync up a big repo on-demand and do a request with lots of content.

I have a (very vague) memory of running into timeout issues in the view when testing with a Large Number of HREFs - but it would have been literally years ago, I may be misremembering. The katello usecase does result in some Very Large lists - but it's def worth getting some actual results.

Being able to make the decision with a single call makes a huge difference - in my head, it was going to cost one db-access per PRN, which would be Bad. Limiting the fields-being-returned from the queries suggested by @dralley will help some as well, both performance but especially memory.

ianballou · 2025-01-24T15:37:03Z

Katello implemented "chunked copying" with our use of the advanced copy API, and that is capped at 10,000 hrefs.

We only use the advanced copy API with dependency solving. Otherwise we use the normal add and remove content APIs.

* Add cross_domain validation for PRNs on copy API serializer. The new validation doesn't capture the first href/prn that failed anymore, as it relies on querying distinct domains in the bulk of references. Fixes: pulp#3853

* Add cross_domain validation for PRNs on copy API serializer. The new validation doesn't capture the first href/prn that failed anymore, as it relies on querying distinct domains in the bulk of references. Fixes: pulp#3853 fixup: review changes fixup: review changes on class type validation

* Add cross_domain validation for PRNs on copy API serializer. The new validation doesn't capture the first href/prn that failed anymore, as it relies on querying distinct domains in the bulk of references. Fixes: #3853 fixup: review changes fixup: review changes on class type validation

dralley added Task Triage-Needed and removed Triage-Needed labels Jan 9, 2025

dralley added this to the 3.28 milestone Jan 16, 2025

dralley added Feature and removed Task labels Jan 16, 2025

pedro-psb self-assigned this Jan 16, 2025

pedro-psb added a commit to pedro-psb/pulp_rpm that referenced this issue Jan 23, 2025

Assert PRN support to advanced copy API

f0697ee

Fixes: pulp#3853

pedro-psb added a commit to pedro-psb/pulp_rpm that referenced this issue Jan 23, 2025

Assert PRN support to advanced copy API

3e948d0

Fixes: pulp#3853

pedro-psb added a commit to pedro-psb/pulp_rpm that referenced this issue Jan 23, 2025

Assert PRN support to advanced copy API

0664071

Fixes: pulp#3853

pedro-psb linked a pull request Jan 24, 2025 that will close this issue

[PULP-276] Add support for prns #3864

Merged

dralley closed this as completed in #3864 Feb 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Advanced copy API needs PRN support #3853

Advanced copy API needs PRN support #3853

dralley commented Jan 9, 2025 •

edited

Loading

dralley commented Jan 16, 2025 •

edited

Loading

pedro-psb commented Jan 23, 2025

dralley commented Jan 24, 2025

pedro-psb commented Jan 24, 2025

ggainey commented Jan 24, 2025

ianballou commented Jan 24, 2025

Advanced copy API needs PRN support #3853

Advanced copy API needs PRN support #3853

Comments

dralley commented Jan 9, 2025 • edited Loading

dralley commented Jan 16, 2025 • edited Loading

pedro-psb commented Jan 23, 2025

dralley commented Jan 24, 2025

pedro-psb commented Jan 24, 2025

ggainey commented Jan 24, 2025

ianballou commented Jan 24, 2025

dralley commented Jan 9, 2025 •

edited

Loading

dralley commented Jan 16, 2025 •

edited

Loading