You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Opening an issue from an unresolved discussion. Since then, the payloads have slightly changed. Below [1], there's a more recent example. The bahavior from the initial discussion however still persists:
The dcat:distribution section on a dcat:Dataset still returns all transfer profiles without regard to how the edc:Asset was registered.
dct:accessService still points to a literal. So even in a dcat:Catalog document, the data isn't linked.
The dct:format property does point to a resource, however the @id field isn't namespaced. So it's hard to dereference this to a concept. I think, I remember @jimmarino saying they should be maintained in the dspace context but this isn't imported.
Originally posted by arnoweiss December 11, 2023
The catalog-requests return data in the structure of dcat:DataSets that look as follows. Please note the dcat:distribution section.
dcat:distribution currently always returns the two entries - one with the format "AmazonS3" and one with "HttpProxy" as dcat:format. As I understand the definition of this section in the context of an EDC, this should signify the means a consumer can use to access the data - in this case either the Http- or S3-Data-Planes. In its current behavior, the two distribution methods are returned independent of the èdc:type that was set by the Data Provider in the POST /v3/assets's DataAddress object. So here's a couple of hypotheses to discuss:
The catalog should return only those dcat:distributions that the backend's data is actually available via.
dcat:accessService should contain a resource, not a literal.
The range of edc:type (currently AmazonS3, HttpData) and dct:format (AmazonS3 , HttpProxy) should be the same.
The text was updated successfully, but these errors were encountered:
The dcat:distribution section on a dcat:Dataset still returns all transfer profiles without regard to how the edc:Asset was registered.
this is correct, the way the distribution is built is driven by the DistributorResolver, specifically, by default (DefaultDistributionResolver) the distribution types are obtained from the data flow manager. The way a consumer can get the data is dependent on the data planes registered on the connector.
e.g. if a data plane with support for "AmazonS3" is registered, every asset can get with an "AmazonS3-PUSH", also if the data is located e.g. on Azure Blobstorage.
For different needs, the DistributorResolver can be overridden.
dct:accessService still points to a literal. So even in a dcat:Catalog document, the data isn't linked.
true, I guess it was intended in the wrong way when it was implemented, at the moment the EDC by default has a single DataSource, that's specified by the dcat:service attribute and can be used instead of the accessService for now.
The dct:format property does point to a resource, however the @id field isn't namespaced. So it's hard to dereference this to a concept. I think, I remember @jimmarino saying they should be maintained in the dspace context but this isn't imported.
true, the format has not been namespaced. BTW I don't think the formats should be described in the dspace context but they should be dataspace specific in my opinion.
I think this is related to the work to be done in #4031
Opening an issue from an unresolved discussion. Since then, the payloads have slightly changed. Below [1], there's a more recent example. The bahavior from the initial discussion however still persists:
dcat:distribution
section on adcat:Dataset
still returns all transfer profiles without regard to how theedc:Asset
was registered.dct:accessService
still points to a literal. So even in adcat:Catalog
document, the data isn't linked.dct:format
property does point to a resource, however the@id
field isn't namespaced. So it's hard to dereference this to a concept. I think, I remember @jimmarino saying they should be maintained in thedspace
context but this isn't imported.[1]
Discussed in #3697
Originally posted by arnoweiss December 11, 2023
The catalog-requests return data in the structure of dcat:DataSets that look as follows. Please note the
dcat:distribution
section.dcat:distribution currently always returns the two entries - one with the format "AmazonS3" and one with "HttpProxy" as dcat:format. As I understand the definition of this section in the context of an EDC, this should signify the means a consumer can use to access the data - in this case either the Http- or S3-Data-Planes. In its current behavior, the two distribution methods are returned independent of the
èdc:type
that was set by the Data Provider in thePOST /v3/assets
's DataAddress object. So here's a couple of hypotheses to discuss:edc:type
(currentlyAmazonS3
,HttpData
) anddct:format
(AmazonS3
,HttpProxy
) should be the same.The text was updated successfully, but these errors were encountered: