Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Acquire MARBLE Data #1

Open
7 of 12 tasks
janetzki opened this issue Oct 28, 2022 · 1 comment
Open
7 of 12 tasks

Acquire MARBLE Data #1

janetzki opened this issue Oct 28, 2022 · 1 comment

Comments

@janetzki
Copy link
Owner

janetzki commented Oct 28, 2022

Goal

As a developer, I want to acquire the data from MARBLE to use their mappings from words to lexical domains. These mappings indicate to which semantic domain a verse (or word or phrase) belongs (for Hebrew and Greek).
Motivation: disambiguate words -> reduce FNs -> increase DC recall > 0.30

Tasks

  • get lexical domain mappings for the Old Testament
  • get lexical domain mappings for the New Testament
  • create a list of all data that we already can access
    • --> 2 parquet files
  • Parse mappings from words to lexical domains
  • think about how we can map lexical domains to SDs
  • see A.1.3 in proposal
  • try it out
  • Look at what we already have
    • a) mappings from words in verses to MARBLE domains
    • b) mappings from MARBLE domains to SDs
    • c) mappings from words in verses to SDs
  • refine c) with a) and b) (set intersection)
  • (alternative idea: add lexical domains to the graph (#GNN))
  • (disambiguate example verse with lexical domains)
  • (create more labels by trying out the semi-automatic labeling function from [Refinery](https://github.com/code-kern-ai/refinery) (NLP IDE).)

Notes

@janetzki
Copy link
Owner Author

Closed because fine-tuning ada looks more promising.

@janetzki janetzki reopened this May 15, 2023
janetzki pushed a commit that referenced this issue May 15, 2023
janetzki pushed a commit that referenced this issue May 17, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant