Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset Filename Mismatch with data_reorganize.csv #5

Open
Nech-C opened this issue Jan 15, 2025 · 0 comments
Open

Dataset Filename Mismatch with data_reorganize.csv #5

Nech-C opened this issue Jan 15, 2025 · 0 comments

Comments

@Nech-C
Copy link

Nech-C commented Jan 15, 2025

Hi Dr. Ji,

Thank you for this valuable project. I am trying to build the ColonINST dataset, but I encountered an issue where the filenames in the dataset do not align with the filenames referenced in the data_reorganize.csv for the polyp section of the Kvasir(Kvasir-V2).

Specifically, the CSV expects files with cju-prefixed names (e.g., cju0qkwl35piu0993l0dewei2.png) for images under kvasir-dataset-v2/polyps, but the dataset contains filenames in a UUID format (e.g., fd49618e-96fc-45ec-a917-9d04421faa3f.jpg). All other categories from Kvasir-V2 have valid paths in the data_reorganize.csv file. As a result, about 1000 polyp images are missing.

I downloaded the dataset through the link provided by this repo and also tried other sources, but none of them have polyp images with the expected filenames.

Could you clarify how the cju-prefixed filenames are derived? I appreciate any help you can give!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant