-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add either a function or a data.frame in the package to get the list of respondent metadata columns #272
Comments
I could see that being useful! I lean toward adding it as a dataframe? Could there be any automated way for us to regularly check (like with CI) that the info is not outdated? |
Maybe from this? And perhaps it could be instead a way to get all of the "dataTypes" for a survey rather than just having the list of metadata. It even has the display order stuff. But the browser meta info isn't included |
Also response quality columns: https://www.qualtrics.com/support/survey-platform/survey-module/survey-checker/response-quality/ eg, |
Hi @chrisumphlett, just saw your video. You got one thing wrong--you said it was "okay" that requests to exclude the metadata and questions didn't actually exclude them. It wasn't okay--it was my mistake. Sorry about that! But I think you had the right idea about an approach, and I think it should work now. (FYI @juliasilge requests for "exclude all types of this variable" specifically were being stripped out of the request body due to a subtlety I didn't catch, and didn't properly write a test for. Fixed now, plus added a suitable test.) So, @chrisumphlett, if you want just the metadata & embedded data for one table, you should be able to do it this way: fetch_survey(
"[surveyid]",
include_questions = NA
) Similarly, for just the questions (keeping response ID to pivot/merge on): fetch_survey(
"[surveyid]",
include_metadata = "ResponseId",
include_embedded = NA
) Does that look right for what you need? I still tend to agree with your thought that having something that more automatically stays up-to-date with the API schema would be better, but I hopefully this can serve well enough for now. |
Thanks Jim. This is definitely an improvement, not perfect (through not fault of yours). Qualtrics data model is not well-aligned with the categories available. When I run the call to exclude questions, I get some things I don't want, and don't get things I do want. Things that are included
Things not included
This stuff can be programmed around. I'm doing this with the browser fields now, looking for questions that end in It's a strange, arbitrary division that Qualtrics makes. |
Here's a 2.5m video that shows what I ultimately settled on with several hardcoded lists and the |
Let's reopen this and still consider if we should add something like a dataframe of columns. |
@chrisumphlett, I didn't notice the music in the last video until I realized I was grooving a bit by the end. Thanks for that. You're describing things that closely overlap with what I dealt with in my own big project, so I can sympathize. The trouble with the browser meta items and the timing items, both, is that they are types of questions in the Qualtrics sense--an item that goes in a block on the main page (using the web interface). So I don't think we can do anything about that via tweaks to the API requests. Same for display ordering items, which despite being metadata are closely linked with their questions. I actually worry about We should think about this more, but like you I ended up setting up a programmatic solution. But I will say that was why I added Below is one from my API testing survey. The key tool for me was the "ImportId" column, which is non-user-editable and thus far more standardized. I added in browser meta, timing, topics, quality checks & scoring to see:
Here's how they break out via API params: Metadata (same as always, looks pretty fixed): Questions (all QID, FL_, BL_, almost, see next): Embedded (embeddeds, quality, scores, & topics. Topics breaks the pattern for QID's): |
So, anyway, one option I'm thinking about now that I've written this is that we could apply this logic to the code that creates column mapping and make another column that labels variables by type based on "ImportId", which would make filtering like this a lot more straightforward. Wouldn't be too hard--basically something I did before for myself. I could see edge cases we'd need to address (e.g., I've encountered embedded data fields labeled as "status" so you'd be facing duplicates). But I think it's something we could work around (we already know what metadata columns the user requested for any given download). Still might be good if there's a way to ensure this stays up-to-date with whatever Qualtrics does for the API, but that seems like a separate question. |
Caveat - we don't know what the user requested if they just call |
re: the music. Using my company's video editor, Camtasia, you can add an "emphasize audio" effect to your screen recording and it automatically suppresses other audio to a good background level. Glad you enjoyed :) I asked our customer researcher to not use I agree w/you that Qualtrics treats the browser fields as questions because they are blocks in the survey flow. My philsophical objection I guess would be that it shouldn't be added to the survey flow in the first place. Perhaps there's some reason in their view that it needs to be done that way. I don't think I have a good perspective on balancing usability for the broad user base of the package and these types of power user functionality. For me, the ability to get the embedded data dynamically has made a significant improvement in my process in combination with the static lists for respondent metadata, browser, and timing. |
Thanks. Generally agree; separating out variable types was a really important part of my processes as well. We might eventually want to think about something better for handling duplicated names, including things like |
I would like to be able to have the package provide a global list of the 17 standard metadata columns (plus browser meta info which comes through like response data but should be treated like metadata IMHO), rather than me having them hardcoded. Perhaps others have them hardcoded too; if/when Qualtrics adds another, we'll all need to update our list. If the package holds it then we could automagically have it update whatever we're doing in our code.
Why, you might ask? Here's a 3:42 video with context on why I have a hardcoded list today, and why you might want to have one as well.
re: the missing metadata I refer to in the video, I was wrong. Qualtrics labels it as "Response Type" but the column is "Status" and that is included.
I can work on this but am curious to hear how those of you who have managed the package long-term feel about including this at all, and if so, whether you'd want it as a function, data.frame, or something else.
Relevant links from Qualtrics' docs:
The text was updated successfully, but these errors were encountered: