Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EMR Data Exploration #61

Open
3 of 6 tasks
gatechke opened this issue Sep 1, 2018 · 9 comments
Open
3 of 6 tasks

EMR Data Exploration #61

gatechke opened this issue Sep 1, 2018 · 9 comments
Assignees

Comments

@gatechke
Copy link
Collaborator

gatechke commented Sep 1, 2018

  • Obtain access to Neptune
  • Contact Patrick to help with getting login
  • Install sqldeveloper to connect to Neptune
  • Understanding Neptune
  • Ask Patrick to show you data summarization tool that he has built
  • Focus on diagnosis and medication tables in Neptune and understand the columns
  • Use SQL to conduct exploratory data analysis to understand type, amount and distribution of data on diagnosis and medication tables
  • Generate plot, summary and reports for EDA
@shyamvis
Copy link

shyamvis commented Sep 3, 2018

Give Patrick (https://github.com/pohalloran) access here.

@kayhan-batmanghelich
Copy link
Collaborator

@gatechke what is the latest status on this?

@gatechke
Copy link
Collaborator Author

gatechke commented Sep 9, 2018

@kayhan-batmanghelich

I had a meeting with Patrick on last Thursday. He was still working with another person to get me the access to Neptune. I also asked Patrick to send existing data dictionary to me.

Currently, Diagnosis table, procedure table and a few reference tables are available on Neptune. Medication table hasn't been loaded yet.

Shyam and I will have one-o-one meeting on Monday morning.

@gatechke
Copy link
Collaborator Author

gatechke commented Sep 27, 2018

@shyamvis
Table below: counts of distinct procedure code by procedure coding system. ~1.2 million records from 806 procedure codes have missing value in the filed of procedure coding system.

image

@gatechke
Copy link
Collaborator Author

gatechke commented Sep 27, 2018

@shyamvis
Top 10 procedure codes that don't have procedure coding system populated
image

@gatechke
Copy link
Collaborator Author

@shyamvis
Good news is that no missing value has been found in procedure code filed.

select count(*) from SAM_ETL.PROCEDURES
where PROC_CODE is null;

image

@shyamvis
Copy link

Ke, Create a document that gives a brief description of the columns in the Diagnosis table fro future reference. You can include the counts table in it. Do the same for Procedures table. Upload them to dropbox.

@shyamvis
Copy link

Ke, Talk to Patrick and Brian McLay (who Patrick can introduce) to get a description of each of the 11 procedure coding systems.

@gatechke
Copy link
Collaborator Author

gatechke commented Oct 4, 2018

@shyamvis
I contacted Brian McLay to find out the description of the 11 procedure coding systems. Based on his reply

  • CPT(R) is the same as CPT4 which is the Level I of HCPCS
  • HCPCS, ICD10PC, ICD9CM, CPT(R), CPT4 are standard code types
  • The other small procedure code types are custom values in the source systems. So we don’t know their exact mapping to standard code types.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants