Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug fix: Synonym Sync: Entries without exact mappings #695

Merged
merged 1 commit into from
Nov 22, 2024

Conversation

joeflack4
Copy link
Contributor

@joeflack4 joeflack4 commented Nov 20, 2024

resolves #687

Overview

The synonym sync template TSVs were being built incorrectly, and were sometimes adding synchronization outputs for cases where the Mondo term and source term had no exact mapping in mondo.sssom.tsv. The filtering was not being applied correctly.

The cause of these appearing in the first place is that these pairs of Mondo+source terms have synonyms on them, but they do not have exact mappings.

Pre-merge checklist

Documentation

Was the documentation added/updated under docs/?

  • Yes
  • No, updates to the docs were not necessary after careful consideration

QC

Was the full pipeline run before submitting this PR using sh run.sh make build-mondo-ingest on this branch (after
docker pull obolibrary/odkfull:dev), and no errors occurred?

  • Yes
  • No, there are no functional (code-related) changes to the pipeline in the PR, so no re-run is necessary

Mini-build:
Not a full build yet, but there is a mini-build which compares the only synonym sync outputs before and after.

Build:

New Packages

Were any new Python packages added?

Were any other non-Python packages added?

PR Review and Conversations Resolved

Has the PR been sufficiently reviewed by at least 1 team member of the Mondo Technical team and all threads resolved?

  • Yes

@joeflack4 joeflack4 marked this pull request as draft November 20, 2024 03:01
@joeflack4 joeflack4 self-assigned this Nov 20, 2024
@joeflack4 joeflack4 added the bug Something isn't working label Nov 20, 2024
@joeflack4 joeflack4 linked an issue Nov 20, 2024 that may be closed by this pull request
@joeflack4 joeflack4 marked this pull request as ready for review November 20, 2024 03:14
@joeflack4 joeflack4 changed the base branch from dev-new to sync-synonym2 November 20, 2024 14:29
@joeflack4 joeflack4 changed the base branch from sync-synonym2 to dev-new November 20, 2024 14:30
@joeflack4 joeflack4 force-pushed the bugfix-syn-sync-no-exact-mappings-2-code branch from 54a304c to ed9541e Compare November 20, 2024 22:15
- The synonym sync template TSVs were being built incorrectly, and were sometimes adding synchronization outputs for cases where the Mondo term and source term had no exact mapping in mondo.sssom.tsv.
@joeflack4 joeflack4 changed the base branch from dev-new to develop November 20, 2024 22:16
@joeflack4 joeflack4 marked this pull request as draft November 20, 2024 22:35
@joeflack4 joeflack4 marked this pull request as ready for review November 21, 2024 21:37
Copy link
Member

@matentzn matentzn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Assuming (which I cant see in the diff) that mappings_df is only exact matches, this LGTM!

@joeflack4
Copy link
Contributor Author

Assuming... that mappings_df is only exact matches

Yep! in mondo.sssom.tsv that does seem the case, but it also does a filter for exact just in case.

Copy link
Contributor

@twhetzel twhetzel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Mainly approving on Nico's approval here (and a previous discussion where these code changes look like what was needed) since I have not had enough time to look at this and the results in as much detail as I would prefer.

@twhetzel
Copy link
Contributor

@joeflack4 I'll this for you to merge

@joeflack4
Copy link
Contributor Author

@twhetzel OK, I'll merge it now. I just got done reviewing the build #707 and it is fine, consistent with the preliminary mini-build #696.

@joeflack4 joeflack4 merged commit 98debfc into develop Nov 22, 2024
@joeflack4 joeflack4 deleted the bugfix-syn-sync-no-exact-mappings-2-code branch November 22, 2024 21:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bug: Synonym Sync: -confirmed has non-exact entries
3 participants