Creating semantic concept-feature norms using STRUDEL
Below is the process you need to run in order to contribute a language to the collection.
- Open up
process_strudels.Rmd
in the root (~/
) folder. It is the same folder that contains thisREADME.md
. - Select a language not currently completed.
See
~/data/udpipe_languages.csv
'sCompleted
column. - Select a number of sub processes. Each sub process takes one core and ~2GB of RAM.
- Run, NOT KNIT, everything.
This will:
- Download a new language file into the
~/data
folder. - Download a new language udpipe control file into the
~/
folder. - Splits the langage file into smaller files for parallel processing.
This makes more files in the
~/data
folder. - Runs each smaller file in its own process.
This generates files in the
~/concept-feature
folder. - Combines the files into a single file.
- Download a new language file into the
- Upload the combined file to the releases in GitHub.
- Update the releases and
~/data/udpipe_languages.csv
noting the progress.