-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Consider making convert = FALSE
the default in fetch_survey()
#281
Comments
We can consider whether this change is a good idea, or whether we should solve these types of problems in a better way such as outlined in #267. I am concerned about changing a very old default (predates my involvement with this package), especially when we don't have nice ways to communicate this kind of change to users and we think the situation will change moving away from the old v2 endpoint. |
Thanks. I feel the same way, in that it's something I'm reluctant to touch given its history. I also personally don't care for the feature and always use it turned off, but for that reason I may also lack perspective for those that do. As noted elsewhere, theres this new So, we could theoretically do something that would link up recodes and labels, without calls to any external endpoints. That said, the more I think about it it's not quite the same, and I think there are some important considerations for how to get this right:
|
@juliasilge re: your wise point in #278 about not annoying users with warnings, one thing I'm curious about. In the tidyverse packages I've seen a "warn on first instance in a session" behavior for certain actions, e.g.: require(tidyverse)
vbls <-
c("mpg", "drat")
mtcars |>
select(test) If we did end up changing our minds and decided to change the default, how hard would it be to do something like that? |
@juliasilge thanks, that's useful to have in the toolbox. FYI, all the above is essentially why the PR for It was hacky in part because the response schema weren't yet published at the time. Now they're out so they could be a great resource, but I'm still a little unclear where/how to get the specific endpoint response schema that describes everything that might come back from |
I think the documentation here is pretty good for that endpoint. Does it have what you were thinking of? |
Sort of--yes, the schema they're now publishing here are what I'm interested in. But using the web display for that isn't great. Main issue is similar the one before when I was mining the survey description downloads themselves--the trees describing the survey/schema are of arbitrary depth, so it's not easy to map out where everything important is/would be. A smaller challenge here is that the schema references other sub-schema, e.g. the Questions object that embeds what comes from here. (and is something we'd need if we want to properly update Here's what I know so far. The schema for each endpoint docs page aren't a part of the webpage source, but are imported via JS-based calls to some more programmatic API reference files served from company Stoplight, e.g. these things I dug out: (These are JSON, Firefox has a nice tool for formatting these but apologies if it's a mess on your screen) That first link looks like the raw data we would want, just without the organizing/labeling the web page offers. I (or someone) could put the time in to manually build that framework from that raw data, sure--but I'm wondering if someone who knew things like web scraping and/or conventions of API publishing would see how this could be done faster + in a more maintainable way. Both are way outside my expertise, though. |
Side note--these same Stoplight files are probably where we'd get any standardized list of metadata vars such as @chrisumphlett suggested in #272. |
@juliasilge what would you think about making
convert = FALSE
the default, at least until we can get things off the v2 endpoint? I know conversion has been the default behavior for a long time, but I'm concerned we might be steering users, maybe esp. new ones, towards potentially problematic behavior.As @context-dependent noted, that change would just mean that users by default get the same data they get from a web download with default settings (still with some bonuses like cleaning up the metadata row & a column map)
Originally posted by @jmobrien in #278 (comment)
The text was updated successfully, but these errors were encountered: