Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BE] Special use cases - Semantic Asset Harvesting #28

Open
SrinivasanTarget opened this issue Dec 20, 2021 · 2 comments
Open

[BE] Special use cases - Semantic Asset Harvesting #28

SrinivasanTarget opened this issue Dec 20, 2021 · 2 comments

Comments

@SrinivasanTarget
Copy link
Contributor

Acceptance Criteria:
Story is aimed at handling better the special use cases during harvesting listed below:
Case 1: - Handling of multiple CSV or ttl or both files present in a same semantic asset folder.

E.g:
CV - Leaf node CV folder with lot of CSV’s & ttl files at same folder level- https://github.com/italia/daf-ontologie-vocabolari-controllati/tree/master/Vocabol[…]ollati/classifications-for-transparency/transparency-obligation
If we have multiple ttl files and csv files then its hard to go by same filenames for both since we can have multiple abstractions (csvs) for one ttl file.

Ontology - Latest folder containing multiple ttl files at same folder - https://github.com/italia/daf-ontologie-vocabolari-controllati/tree/master/Ontologie/CLV/latest has **main, align and DBGT ttl **files.

Case 2: How do we handle deprecated folders present inside a repository?
E.g: https://github.com/italia/daf-ontologie-vocabolari-controllati/tree/master/VocabolariControllati/vocs-deprecated has several deprecated CV's
- It's agreed with Giorgia that the deprecated folder will be moved to root of repo to better support harvesting.

@ioggstream
Copy link
Contributor

Related issue #41

@Clou-dia Clou-dia changed the title Special use cases - Semantic Asset Harvesting [BE] Special use cases - Semantic Asset Harvesting Nov 2, 2023
@ioggstream
Copy link
Contributor

This is currently addressed using skipwords (e.g., https://github.com/teamdigitale/dati-semantic-backend/blob/a8f666f60604b968bd94c946a43f3b21d6764557/docker-compose.yaml#L59C1-L60C1)

CV - Leaf node CV folder with lot of CSV’s & ttl files at same folder level-

In this case, this is probably a kludge, and a better solution should be found.

Ontology - Latest folder containing multiple ttl files at same folder - https://github.com/italia/daf-ontologie-vocabolari-controllati/tree/master/Ontologie/CLV/latest has **main, align and DBGT ttl **files.

Addressed via skipwords. We need to either standardize skipwords if we want to apply this pattern to all semantic repositories (e.g., not only to Ontopia) or to establish stricter requirements on repositories in the guidelines: these requirements could be relaxed later on.

Case 2: How do we handle deprecated folders present inside a repository?
E.g: https://github.com/italia/daf-ontologie-vocabolari-controllati/tree/master/VocabolariControllati/vocs-deprecated has several deprecated CV's

Deprecated stuff can be moved in specific folders.
The guidelines already address this via the assets/ folder.
Moreover, skipwords can be used (e.g., deprecated-) to skip the folders.

I suggest to close and eventually re-open detailing every specific use case in a different issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: 🆕 New
Development

No branches or pull requests

3 participants