You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The Bioregistry is a small database that supports data standardization and integration in biomedical research, built on semantic web principles. We wrap the database (as some JSON files) along with Python code for accessing and using it in our bioregistry Python package that gets published to PyPI.
The project is about 5 years old and we have made nearly 800 releases to PyPI.
This database also aligns with other existing databases and also incorporates manual curations, meaning that it slowly grows over time. It is currently a bit larger than 5MiB, and we expect that every few years, this could increase to be about 1MiB larger.
How large is each release?
~13MiB total.
This includes a source tarball (~6MiB) and a universal wheel (~6MiB)
How frequently do you make a release?
We used to release nightly until we hit the 10GB limit, at which point we yanked some old releases that only represented small database changes, and scaled back to releasing weekly.
However, our community would appreciate increasing the cadence, so they can always have the latest data in their code. We are working towards improving the business logic of releases so we don't make a new PyPI upload when there are no "interesting" changes to the data.
Code of Conduct
I agree to follow the PSF Code of Conduct
The text was updated successfully, but these errors were encountered:
Project URL
https://pypi.org/project/bioregistry/
Does this project already exist?
New limit
20GB
Update issue title
Which indexes
PyPI
About the project
The Bioregistry is a small database that supports data standardization and integration in biomedical research, built on semantic web principles. We wrap the database (as some JSON files) along with Python code for accessing and using it in our
bioregistry
Python package that gets published to PyPI.The project is about 5 years old and we have made nearly 800 releases to PyPI.
This database also aligns with other existing databases and also incorporates manual curations, meaning that it slowly grows over time. It is currently a bit larger than 5MiB, and we expect that every few years, this could increase to be about 1MiB larger.
How large is each release?
~13MiB total.
This includes a source tarball (~6MiB) and a universal wheel (~6MiB)
How frequently do you make a release?
We used to release nightly until we hit the 10GB limit, at which point we yanked some old releases that only represented small database changes, and scaled back to releasing weekly.
However, our community would appreciate increasing the cadence, so they can always have the latest data in their code. We are working towards improving the business logic of releases so we don't make a new PyPI upload when there are no "interesting" changes to the data.
Code of Conduct
The text was updated successfully, but these errors were encountered: