Skip to content

Copying field data

cjshawMIT edited this page Feb 14, 2018 · 8 revisions

When collecting data from the field, you'll want to grab the qbank assessment data as well as any student uploaded files.

On Windows unplatform installations, there is a script provided in unplatform, called DataExtractionScript.bat. Just run that script and copy the resulting zip file. That script takes care of assessment data export as well as student uploaded files, and bundles both sets of data into a single zip file.

MongoDB

For a gstudio / MongoDB installation, just do a mongodump and you'll have all the qbank data! Depending on your configuration, it will most likely be stored in the following databases

  • assessment
  • assessment_authoring
  • hierarchy
  • id
  • logging
  • relationship
  • repository
  • resource

Specific collections

In the field, student-responses is only saved in certain database collections -- the remainder is data used to serve the assessments, and does not change in the field. Student-related data is in:

  • assessment/AssessmentSection
  • assessment/AssessmentTaken
  • logging/Log
  • logging/LogEntry
  • repository/Asset
  • repository/Repository

Also, any files uploaded as part of an assessment are put into the webapps/CLIx/studentResponseFiles directory (see below for more details).

Make sure that all of the above data is saved by the field service provides and transported back to the central data store, for analytics and research purposes.

NOTE If you expect that teachers in the field will be authoring new content, then you MUST save / export the entire MongoDB datastore (not just the student-specific collections listed above), so that you capture the new questions and assessments.

Merging multiple MongoDBs

Once you have collected the data from multiple schools, you can merge all the data into a single MongoDB instance with mongorestore. If you collected the raw MongoDB data (i.e. *.wt files), then first you'll need to run mongodump on each of the data sets.

Converting raw MongoDB file into a database dump

To convert from the raw WiredTiger *.wt files to a proper MongoDB database dump, you have to run mongodump against a running MongoDB instance. For example, if your *.wt files are located in a directory called rj1:

$ mongod --dbpath rj1 &
$ mongodump -o rj1-dump

Merging data dumps

Once you have a set of dump directories, you can now restore then into a single database. For the first set, you'll use the --drop flag to clean out any existing data:

$ mongorestore --drop rj1-dump

For all subsequent directories, do not use the --drop flag, to preserve the previously merged data sets.

$ mongorestore rj2-dump

Filesystem storage

For an unplatform installation, the JSON data files will be located in the webapps/ directory under the qbank installation (the path can be found by visiting https://localhost:8080/datastore_path). You'll want to copy the entire webapps/CLIx/ directory (will include the student-uploaded files).

Student-uploaded files

In either of the two above situations (gstudio or unplatform), you still need to copy the student-uploaded files, which are not stored in GridFS, but instead are stored on the filesystem. These files will be located in the webapps/ directory under the qbank installation (the full path can be found by visiting https://localhost:8080/datastore_path). You'll want to copy the entire webapps/CLIx/studentResponseFiles/ directory.