Skip to content

Instead of downloading the entire 8.9 TB dataset, download only files you need!

License

Notifications You must be signed in to change notification settings

MarcusLoppe/Objaverse-downloader

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Objaverse Downloader

Reduce the hassle of downloading the 8.9 TB dataset by fetching only the necessary files!

This script only downloads the mesh models selectively based on desired file size criteria.

Features

  • Download meshes based on minimum and maximum file size requirements.
  • Filter metadata files and export them into a JSON file containing only relevant metadata for downloaded files.

Usage

  1. Set your desired file size criteria in the provided notebook.
  2. Execute the notebook to download meshes and filter metadata accordingly.

Usecase

While training MeshGPT I needed to find more mesh models and was interested in using Objaverse which contains 800k+ models.

However, I could not clone the entire repo since the dataset is about 8.9 TB, which I don't have disk or time to download. I only wanted to download the small mesh models to train the MeshGPT model, e.g., max 80 KB. Git doesn't offer much support for downloading by filtering the file sizes, so I created this script to download only the mesh files within a file size range.

For getting file sizes, I dumped the entire commit history for Objaverse repo and then parsed the output to get the file sizes for each model. I've left the code and instructions on how to do this in the notebook.

About

Instead of downloading the entire 8.9 TB dataset, download only files you need!

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published