Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to prepare the dataset of AitW #4

Open
durianer-D opened this issue Jul 9, 2024 · 1 comment
Open

How to prepare the dataset of AitW #4

durianer-D opened this issue Jul 9, 2024 · 1 comment

Comments

@durianer-D
Copy link

I am very interested in your work, so I tried to follow your experiments. When I was preparing the dataset, I found that the downloaded AitW dataset file was in TensorFlow format, but the json file you provided would read the .png image. I don't know how to relate the encoded image in TensorFlow to the .png image in the json file you provided.

for example:
{"conversations": [{"from": "human", "value": "Please identify the labels of each marked region in the image. "}, {"from": "gpt", "value": "Region 1: Screenshot Text ('31')\nRegion 2: Screenshot Text ('Google Calendar')\nRegion 3: Screenshot Text ('Make the most of every day.')\nRegion 4: Screenshot ICON (V_FORWARD)\nRegion 5: Screenshot ICON (NAV_BAR_RECT)\nRegion 6: Screenshot ICON (V_BACKWARD)"}], "bbox": [[176, 248, 51, 43], [125, 446, 161, 21], [118, 490, 168, 14], [374, 636, 7, 29], [315, 695, 14, 21], [81, 695, 7, 21]], "image": "aitw/google_apps/images/google_apps_3619507004205175709_3.png"}

I want to know how to get the image name of google_apps_3619507004205175709_3.png from AitW dataset.

Can you help me? Looking forward to your reply very much.

@SiyuanHuang95
Copy link

Hi, thanks for your interest in our project.

You can refer to this official demo: https://github.com/google-research/google-research/blob/master/android_in_the_wild/demo.ipynb

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants