Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added md5 checks for 5 models downloaded from outside of the repo #12

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

anna-stacey
Copy link
Collaborator

  • Added md5 checking to 5 model files: fairseq glossing, fairseq segmentation, and 3 coling models (cls, dumb, smart).
  • Also ensured all 5 of these files download from online (i.e. I removed the 3 coling models from the repository - the other two already worked this way).
  • The md5 checking materials are stored in a new config directory in the root directory.
  • The md5 checks only occur when the models have just been downloaded.
  • The coling md5 checks are not very well-tested at this point because this part of the code is not running at present.

…iles from google drive).

	- Also fixed a typo.
… for the fairseq gloss model).

	- Created a .csv which will store expected MD5 values for model files that will be downloaded separately from the rest of the repo.
	- Created a python file that checks the actual MD5 against the expected value as found in the .csv. This python file is called by the entrypoint.sh script right after the relevant model file is downloaded.
…gle drive account.

	- This means the files will no longer be stored in the repo.
	- Changed to using more descriptive model-identifying names.
	- Added more descriptive error messages to the MD5-checking code - it now lets you know if the actual MD5 matches ANOTHER MD5 value in the csv (other than the expected one).
	- Issue: had to create two copies of MD5 checking files until I can figure out how to keep them both in a common config file in the root directory.
	- Ideally should be tested further once we can run the coling models from the GUI.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant