Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project Limit Request: bioregistry - 20 GiB #5904

Open
3 tasks done
cthoyt opened this issue Mar 14, 2025 · 0 comments
Open
3 tasks done

Project Limit Request: bioregistry - 20 GiB #5904

cthoyt opened this issue Mar 14, 2025 · 0 comments

Comments

@cthoyt
Copy link

cthoyt commented Mar 14, 2025

Project URL

https://pypi.org/project/bioregistry/

Does this project already exist?

  • Yes

New limit

20GB

Update issue title

  • I have updated the title.

Which indexes

PyPI

About the project

The Bioregistry is a small database that supports data standardization and integration in biomedical research, built on semantic web principles. We wrap the database (as some JSON files) along with Python code for accessing and using it in our bioregistry Python package that gets published to PyPI.

The project is about 5 years old and we have made nearly 800 releases to PyPI.

This database also aligns with other existing databases and also incorporates manual curations, meaning that it slowly grows over time. It is currently a bit larger than 5MiB, and we expect that every few years, this could increase to be about 1MiB larger.

How large is each release?

~13MiB total.

This includes a source tarball (~6MiB) and a universal wheel (~6MiB)

How frequently do you make a release?

We used to release nightly until we hit the 10GB limit, at which point we yanked some old releases that only represented small database changes, and scaled back to releasing weekly.

However, our community would appreciate increasing the cadence, so they can always have the latest data in their code. We are working towards improving the business logic of releases so we don't make a new PyPI upload when there are no "interesting" changes to the data.

Code of Conduct

  • I agree to follow the PSF Code of Conduct
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant