Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make consistent variable names in all dbs #58

Open
kellymccain28 opened this issue Jan 15, 2025 · 0 comments
Open

Make consistent variable names in all dbs #58

kellymccain28 opened this issue Jan 15, 2025 · 0 comments

Comments

@kellymccain28
Copy link
Collaborator

The first few pathogens (I believe up to SARS) used a database with a couple of mistakes in variable name spellings or definitions. First, the parameter_uncertainty_singe_type variable was misspelled in the parameter form, and second, the first and last names of authors were flipped, so the first name was stored in FirstAauthor_Surname and the surname was stored in FirstAuthor_FirstName.

In more recent versions of the database (from Zika, I'm not sure about SARS and Lassa), parameter_uncertainty_singe_type was updated to be spelled correctly: parameter_uncertainty_single_type. The author variable names were switched so the first name was stored in the first name variable and vice versa for surname. The spelling error in FirstAauthor_Surname was not fixed.

The cleaning code (cleaning.R in db_compilation) at the moment changes the correctly spelled variable to be incorrectly spelled to allow any code from previous pathogens to work. However, it would be good to update this for all pathogens so it is the other way around -- the incorrectly spelled variable is changed to the correctly spelled one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant