-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Missing QA tiles create empty labels at country boundaries #106
Comments
@joshwapiano can you use a tool like |
Hi @drewbo , appreciate the fast response! I'm assuming based on your response that the Many thanks |
@joshwapiano the class syntax looks correct (and is supported). The overall classification looks generally correct except near the country boundary. green showing It looks like what's happening is that because of the country boundary, it skips including certain tiles (belonging to Malaysia) in the OSM QA tiles. Then the underlying data isn't present so I'm going to retitle this issue to reflect this and will inquire with the maintainers of the upstream data to see if there's a good workaround. For now, I'd leave out boundary tiles from training. |
@drewbo Thanks for looking in to this. I should have time to make changes based on your findings before my project deadline. My previous approach was to first run: $ label-maker download --dest <example> --config <example>.json
$ label-maker labels --dest <example> --config <example>.json
$ label-maker images --dest <example> --config <example>.json I would then manually inspect the images that had been downloaded - deleting those which were either cloud covered, or poor quality (a high proportion when requesting data at 14 zoom level!) to create a cleansed folder of images. $ label-maker package --dest <example> --config <example>.json This has taken me quite some time, so could you confirm whether it would be possible to copy the cleansed image folders already created in to a different directory and from there, make changes to the config file to reduce risk of boundary overlap and simply run the following: $ label-maker download --dest <example_2> --config <example_2>.json
$ label-maker labels --dest <example_2> --config <example_2>.json
$ label-maker package --dest <example_2> --config <example_2>.json Or would I need to re-download and re-cleanse the images for each country? Many thanks P.s. One other thought - could the issue you've identified also impact on land/sea borders as well? The full set of bounding boxes I've used are as follows: |
@drewbo If I can find time I'll look in to this in more detail too over the next few days - would you mind sharing the code you used to produce the black/purple map above, was this using Mapnik? |
|
After seeing really poor results in my ResNet50 I have manually inspected the data and labels and it appears that the background images label_maker is producing are labelled almost randomly.
Here's the config that I'm using:
Essentially what I've attempted to do in the above 'classes' is to identify tiles with any sign of human activity.
At that zoom level around 300 images are packaged in to data.npz.
Once we have the data.npz file a quick inspection of some of the imagery with the following code (I used a Jupyter Notebook) shows a high proportion of the images are labelled as not in the class above despite very clearly containing man made structures.
I also tried at zoom level 15 and the same issue exists.
I'm wondering a few things:
Is the class I'm using too complex to use label-maker for this purpose?
Is the OSM tiling simply unreliable?
Is there a bug in label maker?
Am I using background_ratio incorrectly?
Am I making a stupid mistake somewhere?
Any advice greatly appreciated!
Thanks
Josh
The text was updated successfully, but these errors were encountered: