-
Notifications
You must be signed in to change notification settings - Fork 991
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Help draft the data.table community survey (open until September 26) #5686
Comments
I had a brief look... I would completely drop 5) |
Hi this is great thanks so much @sluga |
Thanks @jangorecki, could you elaborate why? My thinking was that the input from the broader community could inform some decisions. As an example, take the helper function for using data.table with the pipe. To me it would seem useful to know if, say, 95/100 users say "We need this" vs. if only 5/100 do. Note that I'm not suggesting that any survey result should be taken as binding in any way. This isn't a popularity contest, so the core team should follow their best judgment. But couldn't that judgment potentially be informed by survey results? |
Thanks @tdhock, I've added the extra options to the context & contribution questions. But I don't know about the Likert->Yes/No suggestion. Likert scales are very common in survey research & not particularly difficult to analyze, and we'd lose a lot of information by dichotomizing. |
I believe all governance processes we are trying to establish should be completely independent from what issue users/devs are liking/disliking/prioritizing/etc. We have, and promote, using upvote, so we have a way to measure users interest. ... OK. as for DT community survey (that meant not to be related to setting up governance) that make some sense, but assuming we want to focus on governance, and survey is meant to help on that, then I would skip that part. Or at least made that kind of DT community survey after we set up governance. We could promote survey in package startup message. But really, now I would completely focus on governance and use survey towards that, and then when that is set, make another survey "what you would like? etc. Maybe we simply don't need a survey now, to move forward with governance, then ti's fine, and point 5) could of course stay. My bad, I kind of get idea that survey will help to push governance setup, which doesn't have to be true. |
haha that is fine with me, I guess I just have the tendency to always respond either 1 or 5 on those kind of questions. |
Thanks for the elaboration @jangorecki. Yes, I wasn't thinking of the survey as purely about governance - should've made that clearer. Having two separate surveys - one now, one later - makes sense as well, but that way we'd probably be asking people to complete another survey in a couple of months, and it would likely repeat a number of questions (as it would be interesting to correlate some of the responses from sections 1/2 and 4/5). So I'm leaning toward the one-bigger-survey option. |
@sluga Sorry if this seems pedantic, but in the first two questions the options seem a bit untidy. For example, there is no option that includes 4.5 years. Maybe "2 - 4 years" is meant to include 4.5 years, 4.9 years, etc., but if that's the intended interpretation then it doesn't seem right that "13 - 24 months" and "2 - 4 years" have no gap between them. Also, the second question doesn't have the "Less than 3 months" option, but I'm guessing it was just left out by mistake. From a statistical point of view, it would be preferable to collect these as continuous variables to avoid loss of information by categorising. But I understand that collecting continuous variables might have disadvantages in other ways. |
For (1), could include a question like
This would give context for (2) re possible gaps between desire to contribute and (current) skills among respondents. (4) looks like a mix of quality-of-life and functionality points (the latter in the final bullets starting from "Programming") -- maybe these should be treated separately in some way? Eg,
I think this provides value above the 1-5 "Scope/Breadth of functionality" question by generating asks to consider when defining the package scope. (I doubt folks would bring up scope gaps re the "biggest challenges" question, which I suspect invites more QOL answers.) On the items in (4):
Maybe "Filtering and join functionality" since joins aren't covered by another bullet, and filtering doesn't seem like a major feature area on its own.
Maybe "Minimal dependencies" which also refers to the package working for R versions "as old as possible for as long as possible" (from the readme).
I think these sound like the same thing to most folks (like thumbs up or down for the syntax in general); and I'm not sure what expressive means in this context (?). Personally, I value the generality of the syntax (eg, so I can write
I'm wondering what a low score on "scope" would tell us -- everything is documented (as far as I have seen), so it could either mean the person would like to see more vignette coverage; could not find what they needed in the docs; or (unlikely) is trolling. Maybe switch to
Personally, I find the verbose output very useful and use it as a debugging tool even after learning do's and don'ts from errors and docs (eg, checking GForce to diagnose slower-than-expected operations; or # rows modified when I expect a join to be 1:1 instead of 1:n etc). |
Thanks for putting this together! On 1) How would I answer the last two questions ("In what context are you using data.table?" and "What are you currently using data.table for?") if my answer to "How often do you use data.table these days?" was "I don't use data.table anymore"? Similar consideration applies to later questions, where an earlier question precludes any answer to a later question. On 2)
On 5) I see Jan's point. It does seem a bit out of place, and with so many open-ended questions earlier in the survey, it feels like it's getting long. Maybe best is to remind users in the survey introduction to check out & interact with the issue tracker to express support/feedback for pending bugs/FRs. |
I know I'm late to the party, but this survey looks annoyingly long. Personally, I would see this and immediately close it. My two cents: boil it down to two or three questions including an open-ended What are your suggestion(s) for data.table moving forward? <--That's the question I want to answer :) |
Thanks everyone, I've begun incorporating your suggestions & will finalize the draft & address your comments sometime in the next few days. |
Hi @sluga, I guess the survey format is either finished or about to be finished. Thus, I was wondering how is going to be distributed to the data.table users? |
Hi @phisanti, yes, it's taken me longer than anticipated because of other obligations, but I'll post the link to the final draft today or tomorrow, asking @tdhock for a final check. To answer your question, I'm hoping to get some help on this front. Including an invitation to take part in the survey on the data.table website and in this repo seem like the obvious moves, and I think @jangorecki mentioned the possibility of including an invitation in the start-up message. I'm not much of a social media user, so I'm hoping to get some help there. Do you have any suggestions? |
I will be traveling to Montevideo to talk about data.table at the LatinR conference on Weds Oct 18, so it would be great if you could create the survey before then, so I could mention the survey during the talk. BTW slides are here if anyone wants to comment / leave constructive criticism: https://docs.google.com/presentation/d/1ypW1LUMmcrUTMF6B9h9s8qbvW5BSbN1IW6CEgqX01Co/edit?usp=sharing |
That'd be great @tdhock! Alright, the draft of the survey is here: https://forms.office.com/e/d7gLkySP3n?origin=lprLink A couple of notes:
|
In Q12, should the options say "important/unimportant" instead of "satisfied/dissatisfied"? |
Nice Q2 without seeing remaining questions is confusing. Better to put it at the end? Q7 could have options like data preparation/transformation, not everyone does statistical analysis,. Sometimes it is just a piece in big ETL pipeline. Good to mention at start how much time it can take, like 2 minutes. Good to create some short URL to be easily distributed on slides or pkg startup message. |
Thanks @markseeto, @jangorecki. I fixed Q12, moved the email & response-sharing questions to the end, and added data preparation to Q7. Not sure how to estimate time: the survey has 20 questions, including several open-ended ones, but respondents can skip questions/sections. URL: https://forms.office.com/e/d7gLkySP3n?origin=lprLink |
great thanks for the qr and short link, I added them to slides |
If the survey looks OK now, I suggest the following:
|
@sluga I would also open a new issue and pin it. |
I'm definitely can't change the README on master branch (only Matt can do that, you may ask him), but I may be able to pin an issue. |
I've opened a new issue (#5704) with the invitation and a PR (#5705) with the README update, hopefully @mattdowle sees it in time. |
Hi all,
following the discussion in #5676, here's the initial draft of a data.table community survey.
The plan is to collect suggestions/feedback in this issue until September 26 and then run the survey for a month.
Please give your suggestions below. I'll make a final draft after September 26 & then give @tdhock (or another senior data.table contributor) the chance to have a final say on survey content.
Draft
About you
Contributing to data.table
Evaluation & Priorities
Feedback on candidate/upcoming features
Haven't drafted the exact questions for this section yet, but I was thinking of covering:
:=
Project governance
The text was updated successfully, but these errors were encountered: