Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stephen yin01/spider analysis #11

Open
wants to merge 49 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
49 commits
Select commit Hold shift + click to select a range
3e0e32a
Update sql2pandas with better error checking + cleanup
troyfeng116 Mar 3, 2022
89e8511
Sandbox scripts + testing
troyfeng116 Mar 9, 2022
39fb73e
Create json file with converted SQL queries
troyfeng116 Mar 15, 2022
60cdba2
Initial preprocessing: cleanup + nested SELECT framework
troyfeng116 Mar 15, 2022
26b46ac
Support for complex non-convertible queries: ProcessedSQLQueryNode fr…
troyfeng116 Mar 20, 2022
383078f
Cleanup
troyfeng116 Mar 20, 2022
d3cc6e3
Refine ProcessedSQLQueryTree structure
troyfeng116 Mar 20, 2022
5057c7e
gitignore
troyfeng116 Mar 20, 2022
7b7b8b7
Merge main (rebase)
troyfeng116 Mar 20, 2022
5b18a95
Fix l_to_r_keys
troyfeng116 Mar 20, 2022
cb1cc6f
Cleanup + internal/external symbol keys
troyfeng116 Mar 20, 2022
cdab109
Docstrings + cleanup
troyfeng116 Mar 20, 2022
e0fd20f
Update TODO
troyfeng116 Mar 20, 2022
6ea91e4
Remove extra commits for pre-untracked files
troyfeng116 Mar 20, 2022
895263a
Clean up gitignore conflicts
troyfeng116 Mar 20, 2022
e184279
Rm ast_sandbox from cache
troyfeng116 Mar 21, 2022
6e1ea8f
Rm sandbox_bad_queries.txt from git cache
troyfeng116 Mar 21, 2022
c54c725
Expand processed tree functionality to UNION/INTERSECT/EXCEPT
troyfeng116 Mar 21, 2022
cd40d80
Add output txt test file to gitignore
troyfeng116 Mar 21, 2022
473d315
Naive function to extract entire table from SELECT query
troyfeng116 Mar 21, 2022
82b9dcc
Clean up table extraction + separate table alias parsing
troyfeng116 Mar 21, 2022
4acfd59
Add table alias + substitute symbol to leaf nodes
troyfeng116 Mar 21, 2022
e607727
DFS to extract code snippets
troyfeng116 Mar 21, 2022
daba852
Cleanup some naming
troyfeng116 Mar 21, 2022
3ff0c81
Cleanup + indexing nit
troyfeng116 Mar 21, 2022
9e5a4ba
Add script to assert validity of SQL tree
troyfeng116 Mar 22, 2022
80cbfb4
Update gitignore for output files
troyfeng116 Mar 22, 2022
3ddc969
Clean up file structure + add table_expr class + debug symbol generation
troyfeng116 Mar 22, 2022
83d2c40
Docstrings + cleanup DFS
troyfeng116 Mar 22, 2022
17328bc
Refine pandas generation: initial setup to handle INTERSECT/UNION/EXCEPT
troyfeng116 Mar 22, 2022
af7570c
Move single node to pandas helpers to separate file, clean up node init
troyfeng116 Mar 22, 2022
9146a03
Docstrings + handle multiple JOIN ONs in table expression
troyfeng116 Mar 22, 2022
def0726
Update TODO
troyfeng116 Mar 22, 2022
b988ad6
Table alias removal + aliased_table_expr field
troyfeng116 Apr 1, 2022
624015f
Cleanup
troyfeng116 Apr 1, 2022
823340f
Gitignore
troyfeng116 Apr 1, 2022
51798d8
JOIN tables -> pandas
troyfeng116 Apr 2, 2022
3f79033
Cleanup + docstrings
troyfeng116 Apr 2, 2022
cdb359d
Add quick dirty fixes for spider execution
chenyx512 Apr 4, 2022
394c115
try to fix the intersect and union
niansong1996 Apr 6, 2022
f4739ad
Improve to 75% correct
chenyx512 Apr 11, 2022
98b5af6
unofficial files, playground progress for squall dataset
StephenYin01 Apr 11, 2022
4a295dd
Merge branch 'main' into yuxuan/sql2pandas
niansong1996 Apr 15, 2022
87c3760
Merge branch 'main' of github.com:Yale-LILY/NLP4Code into StephenYin0…
StephenYin01 Apr 18, 2022
3496afb
Merge branch 'yuxuan/sql2pandas' of github.com:Yale-LILY/NLP4Code int…
StephenYin01 Apr 18, 2022
b1d4cea
put WIP on remote github
StephenYin01 Jul 7, 2022
8d67d34
finished ten correction of failed conversions for codex
StephenYin01 Jul 11, 2022
2bcb5da
Merge branch 'main' of github.com:Yale-LILY/NLP4Code into StephenYin0…
StephenYin01 Jul 11, 2022
7ff994e
annotated 10 more examples
StephenYin01 Aug 3, 2022
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Update gitignore for output files
  • Loading branch information
troyfeng116 committed Mar 22, 2022
commit 80cbfb45fb58cbf1bbfa95d02c605635079a4237
2 changes: 1 addition & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -130,7 +130,7 @@ dmypy.json

# sandboxes
**sandbox.py
parsing/sandbox_bad_queries.txt
parsing/sandbox*

# defined by Ansong
.venv
Expand Down