Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

prune old large files from git history #256

Open
matthewhanson opened this issue Mar 5, 2022 · 1 comment
Open

prune old large files from git history #256

matthewhanson opened this issue Mar 5, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@matthewhanson
Copy link
Member

Is your feature request related to a problem? Please describe.
I noticed today when cloning the stactools repo it was getting sizable. It's 32MB, of which 8MB is the main branch (large files are in the test directory)

Describe the solution you'd like
Prune out old large files from the git history, I suspect at least some of these files are old test files

Describe alternatives you've considered
Live with it, which is fine - this is not critical, you can always do shallow clones. However I think that with the codebase largely stabilized now the old files can be pruned from the tree safely and will result in a leaner repo when cloning

@matthewhanson matthewhanson added the enhancement New feature or request label Mar 5, 2022
@gadomski
Copy link
Member

gadomski commented Mar 7, 2022

I'm a little wary of modifying public git history. 32MB doesn't seem that big to me in the grand scheme of things for a geospatial repo, and since we are maturing, most users will be pip-installing from a tagged release anyways?

If we were going to, it looks like this culprit is obvious (I used this):

56539fcf32b1  7.1MiB tests/data-files/sentinel2/S2B_MSIL2A_20191228T210519_N0212_R071_T01CCV_20201003T104658.SAFE/DATASTRIP/DS_ESRI_20201003T104659_S20191228T210521/MTD_DS.xml
343fb87cec25   13MiB tests/data-files/sentinel2/S2A_MSIL2A_20190212T192651_N0212_R013_T07HFE_20201007T160857.SAFE/DATASTRIP/DS_ESRI_20201007T160858_S20190212T192646/MTD_DS.xml
446fb4bf5ec8   18MiB tests/data-files/sentinel2/esa_S2B_MSIL2A_20210122T133229_N0214_R081_T22HBD_20210122T155500.SAFE/DATASTRIP/DS_VGS2_20210122T155500_S20210122T133224/MTD_DS.xml

These files were removed when we split out the packages: 1fb527f.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants