Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing text2sql pairs in train folder #147

Open
HelenGuohx opened this issue May 5, 2024 · 3 comments
Open

Missing text2sql pairs in train folder #147

HelenGuohx opened this issue May 5, 2024 · 3 comments

Comments

@HelenGuohx
Copy link

Hi,
I want to express my appreciation for your outstanding work on text2sql research.

I recently downloaded the BIRD-bench dataset from https://bird-bench.github.io/ and noticed that the train folder seemed to be missing the text2sql pairs. I found the database descriptions and sqlite files, but not the actual text prompts and corresponding SQL queries.

However, I was glad to see that the dev.json file in the dev folder contains the text2sql pairs I was looking for.

Could you please clarify if the text2sql pairs are intentionally excluded from the train folder, or if there might be a missing file I should download?

@superctj
Copy link

Any update on this issue? I am also looking for SQL queries of the training set.

@bird-bench
Copy link
Contributor

@HelenGuohx @superctj Thanks for interests in our work. Could you check whether you met connection errors or somehow? We re-downloaded again, and it seems train.json exists. For your convenience, I also attach this train.json here. Thanks.
train.json

@superctj
Copy link

Thank you for your quick response! I appreciate it. btw, I didn't run into connection errors and somehow train.json is not in the decompressed directory.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants