You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Function to add "filtering columns" for duplicates. Function should ask for dataset_id, list of traits to filter, and (maybe) list of studies to filter. Output would be added columns in data.csv file specifying the dataset & observation_id from the dataset that is the suspected duplicates. A second output would be R code that can be copied and pasted into custom R code. The step of looking at the duplicates table and identifying traits/datasets to filter should remain manual.
Various bits of code to convert between tables and yml format:
Code/function to read a .csv file into the substitutions section of metadata file
Code/function to read a .csv file into the taxonomic updates section of metadata file
Code to generate a table that compiles all the "additional_traits" listed. As in, under questions: the subheading additional_traits: provides a list of trait data in a study that isn't yet merged into AusTraits (i.e. traits not yet supported by AusTraits). I've been wondering how to create a table, where one column is the dataset_id and a second column is the information listed under additional_traits:.
The text was updated successfully, but these errors were encountered:
ehwenk
changed the title
Functions / code for faster metadata entry & error checking
Improvements for adding new studies: Possible functions / code
Jul 7, 2021
ehwenk
changed the title
Improvements for adding new studies: Possible functions / code
[traits.build adding studies functions] Improvements for adding new studies: Possible functions / code
Jul 31, 2024
dataset_id
, list of traits to filter, and (maybe) list of studies to filter. Output would be added columns in data.csv file specifying the dataset & observation_id from the dataset that is the suspected duplicates. A second output would be R code that can be copied and pasted into custom R code. The step of looking at the duplicates table and identifying traits/datasets to filter should remain manual.Various bits of code to convert between tables and yml format:
substitutions
section of metadata filetaxonomic updates
section of metadata filequestions:
the subheadingadditional_traits:
provides a list of trait data in a study that isn't yet merged into AusTraits (i.e. traits not yet supported by AusTraits). I've been wondering how to create a table, where one column is the dataset_id and a second column is the information listed underadditional_traits:
.The text was updated successfully, but these errors were encountered: