Analyzes data from surveys re: personal data
We surveyed two populations:
- Qualtrics panel with minimum quotas for Black folks and non-users of social media (N = 586)
- Mechanical Turkers (N = 389)
I did some of the data prep manually:
- removing columns that are for Qualtrics or MTurk but not data
- removing headers that are not reasonable variable names
- replacing spaces in variable names with underscores
- replace ’ with apostrophe
- removing display order variables
- lower casing variable names
- removed preview responses
It was faster to do by hand in Excel than to do programmatically.
- notebooks: contains Jupyter notebooks with code for prepping and analyzing data; the prep_survey_responses notebook does most of the data type and recoding work. Individual analysis notebooks focus on a particular research question and often start with some serious
mutation
work to get data into the right format. - reports: contains output like tables and figures
Data is available through Deep Blue Data with the identifier https://doi.org/10.7302/6vjf-av59.
The data is also provided in the data folder of this repo.
Here's the original mapping of variable name to question from Qualtrics. The prep notebook changes some of these variable names, relevels, and recodes dummies.
Q43 | Are you 18 years old or over? |
---|---|
social media use | Do you ever use social media sites like Facebook, Twitter, or Instagram? |
how often_1 | Thinking about the social media sites you use. About how often do you visit or use - Facebook |
how often_2 | Thinking about the social media sites you use. About how often do you visit or use - Instagram |
how often_3 | Thinking about the social media sites you use. About how often do you visit or use - LinkedIn |
how often_4 | Thinking about the social media sites you use. About how often do you visit or use - Nextdoor |
how often_5 | Thinking about the social media sites you use. About how often do you visit or use - Pinterest |
how often_6 | Thinking about the social media sites you use. About how often do you visit or use - Reddit |
how often_7 | Thinking about the social media sites you use. About how often do you visit or use - Snapchat |
how often_8 | Thinking about the social media sites you use. About how often do you visit or use - TikTok |
how often_9 | Thinking about the social media sites you use. About how often do you visit or use - Twitter |
how often_10 | Thinking about the social media sites you use. About how often do you visit or use - WhatsApp |
how often_11 | Thinking about the social media sites you use. About how often do you visit or use - YouTube |
tweets public | Are your tweets publicly viewable? |
insta public | Are your Instagram posts publicly viewable? |
researchers_1 | Is it ok for academic researchers to use - texts from your social media posts to study how language changes over time? |
researchers_2 | Is it ok for academic researchers to use - texts from your social media posts to train bots to help reduce racist messages? |
researchers_3 | Is it ok for academic researchers to use - texts from your social media posts to figure out how to show you relevant ads? |
researchers_4 | Is it ok for academic researchers to use - texts or images from your social media posts to provide you with interventions on social media that may help you gain support and feel better? |
researchers_5 | Is it ok for academic researchers to use - texts or images from your social media posts to predict whether you are at risk of harming yourself? |
researchers_6 | Is it ok for academic researchers to use - texts or images from your social media posts to predict whether you are at risk of harming others? |
researchers_7 | Is it ok for academic researchers to use - texts or images from your social media posts to study how misinformation spreads? |
researchers_8 | Is it ok for academic researchers to use - text of your social media posts and millions of others’ posts to predict election outcomes? |
researchers_9 | Is it ok for academic researchers to use - posts you’ve deleted from social media to study how information gets updated during crises like natural disasters? |
researchers_10 | Is it ok for academic researchers to use - images you’ve posted to social media to study variability in vegetation in National Parks? |
researchers_11 | Is it ok for academic researchers to use - images you’ve uploaded to social media to train facial recognition software? |
researchers_12 | Is it ok for academic researchers to use - images from security camera footage to train facial recognition software? |
researchers_13 | Is it ok for academic researchers to use - social media posts you made at a protest in a virtual exhibit of related posts about the protest? |
researchers_14 | Is it ok for academic researchers to use - metadata from your photos in social media to create a public map of peony gardens in your area? |
researchers_15 | Is it ok for academic researchers to use - location data from your social media posts to understand the spread of COVID-19? |
researchers_16 | Is it ok for academic researchers to use - timestamps of your social media posts to understand the spread of COVID-19? |
researchers_17 | Is it ok for academic researchers to use - your cell phone location data to study commuting patterns? |
researchers_18 | Is it ok for academic researchers to use - your voter file to study voter turnout over time and between groups? |
researchers_19 | Is it ok for academic researchers to use - your grocery store purchases to understand the influence of other shoppers? |
researchers_20 | Is it ok for academic researchers to use - your grocery store purchases to figure out how to send you coupons? |
sm companies_1 | Is it ok for social media companies to use - texts from your social media posts to study how language changes over time? |
sm companies_2 | Is it ok for social media companies to use - texts from your social media posts to train bots to help reduce racist messages? |
sm companies_3 | Is it ok for social media companies to use - texts from your social media posts to show you relevant ads? |
sm companies_4 | Is it ok for social media companies to use - texts or images from your social media posts to provide you with interventions on social media that may help you gain support and feel better? |
sm companies_5 | Is it ok for social media companies to use - texts or images from your social media posts to predict whether you are at risk of harming yourself? |
sm companies_6 | Is it ok for social media companies to use - texts or images from your social media posts to predict whether you are at risk of harming others? |
sm companies_7 | Is it ok for social media companies to use - texts or images from your social media posts to study how misinformation spreads? |
sm companies_8 | Is it ok for social media companies to use - text of your social media posts and millions of others’ posts to predict election outcomes? |
sm companies_9 | Is it ok for social media companies to use - posts you’ve deleted from social media to study how information gets updated during crises like natural disasters? |
sm companies_10 | Is it ok for social media companies to use - images you’ve posted to social media to study variability in vegetation in National Parks? |
sm companies_11 | Is it ok for social media companies to use - images you’ve uploaded to social media to train facial recognition software? |
sm companies_12 | Is it ok for social media companies to use - images from security camera footage to train facial recognition software? |
sm companies_13 | Is it ok for social media companies to use - social media posts you made at a protest in a virtual collection of related posts about the protest? |
sm companies_14 | Is it ok for social media companies to use - metadata from your photos in social media to create a public map of peony gardens in your area? |
sm companies_15 | Is it ok for social media companies to use - location data from your social media posts to understand the spread of COVID-19? |
sm companies_16 | Is it ok for social media companies to use - timestamps of your social media posts to understand the spread of COVID-19? |
sm companies_17 | Is it ok for social media companies to use - your cell phone location data to study commuting patterns? |
sm companies_18 | Is it ok for social media companies to use - your voter file to study voter turnout over time and between groups? |
sm companies_19 | Is it ok for social media companies to use - your grocery store purchases to understand the influence of other shoppers? |
sm companies_20 | Is it ok for social media companies to use - your grocery store purchases to send you coupons? |
journalists_1 | Is it ok for journalists to use - texts from your social media posts in a story about how language changes over time? |
journalists_2 | Is it ok for journalists to use - texts from your social media posts in a story about bots trained to reduce racist messages? |
journalists_3 | Is it ok for journalists to use - texts from your social media posts in a story about algorithms that recognize emotions? |
journalists_4 | Is it ok for journalists to use - texts or images from your social media posts in a story about misinformation? |
journalists_5 | Is it ok for journalists to use - text of your social media posts in a story about elections? |
journalists_6 | Is it ok for journalists to use - posts you’ve deleted from social media in a story about natural disasters? |
journalists_7 | Is it ok for journalists to use - images you’ve posted to social media in a story about vegetation in National Parks? |
journalists_8 | Is it ok for journalists to use - images you’ve uploaded to social media in a story about facial recognition software? |
journalists_9 | Is it ok for journalists to use - images from security camera footage in a story about facial recognition software? |
journalists_10 | Is it ok for journalists to use - social media posts you made at a protest in a story about the protest? |
journalists_11 | Is it ok for journalists to use - metadata from your photos in social media to create a public map of peony gardens in your area? |
journalists_12 | Is it ok for journalists to use - location data from your social media posts in a story about the spread of COVID-19? |
journalists_13 | Is it ok for journalists to use - timestamps of your social media posts in a story about the spread of COVID-19? |
journalists_14 | Is it ok for journalists to use - your cell phone location data in a story about commuting patterns? |
journalists_15 | Is it ok for journalists to use - your voter file in a study about voter turnout? |
journalists_16 | Is it ok for journalists to use - your grocery store purchases in a story about how others influence what we buy? |
sometimes | Did you find yourself wanting to answer "sometimes" or "it depends" to any of the questions about academic researchers, journalists, and social media companies using data? |
sometimes text | Why did you want to answer "sometimes" or "it depends"? For instance, on what does it depend or when would you say "sometimes"? |
digital privacy_1 | Please respond how much you agree or disagree with the following statements. - When companies or government agencies try to collect my personal information, I sometimes hesitate to provide it. |
digital privacy_2 | Please respond how much you agree or disagree with the following statements. - I am concerned that companies or government agencies do not allow me to delete information I've given them. |
digital privacy_3 | Please respond how much you agree or disagree with the following statements. - It usually bothers me that companies or government agencies don't offer a process for me to request deletion of information I've given them. |
digital privacy_4 | Please respond how much you agree or disagree with the following statements. - It usually bothers me that companies or government agencies do not give me the option to have my information deleted. |
social science | Researchers should be able to get my social media data without my permission if it will help them to do research that will advance knowledge about society and social relationships. |
data archive_1 | How important is it that researchers ask you for your permission to add the following information to a data archive? - your public social media posts? |
data archive_2 | How important is it that researchers ask you for your permission to add the following information to a data archive? - your grocery purchases? |
data archive_3 | How important is it that researchers ask you for your permission to add the following information to a data archive? - your cell phone location data? |
data archive_4 | How important is it that researchers ask you for your permission to add the following information to a data archive? - your voting records (only whether you voted in a particular election, not how you voted)? |
data archive_5 | How important is it that researchers ask you for your permission to add the following information to a data archive? - your anonymized health records? |
secure archive | Do you think a secure social media archive is a good idea or a bad idea? |
anonymous archive | Do you think an anonymous social media archive is a good idea or a bad idea? |
concern-misuse | How concerned are you about the possibility that your data might be misused if it's in a data archive? |
concern-harm | How concerned are you that you could be harmed through misuse of your data? |
age | Please select your age range |
gender identity_1 | Which of the following best describes your current gender identity? (select all that apply) - Woman |
gender identity_2 | Which of the following best describes your current gender identity? (select all that apply) - Man |
gender identity_3 | Which of the following best describes your current gender identity? (select all that apply) - Transgender |
gender identity_4 | Which of the following best describes your current gender identity? (select all that apply) - Nonbinary/genderqueer |
gender identity_5 | Which of the following best describes your current gender identity? (select all that apply) - Prefer not to answer |
gender identity_6 | Which of the following best describes your current gender identity? (select all that apply) - Something else |
sexual orientation_1 | Which of the following best describes your current sexual orientation? (select all that apply) - Gay or lesbian |
sexual orientation_2 | Which of the following best describes your current sexual orientation? (select all that apply) - Heterosexual (straight) |
sexual orientation_3 | Which of the following best describes your current sexual orientation? (select all that apply) - Bisexual |
sexual orientation_4 | Which of the following best describes your current sexual orientation? (select all that apply) - Prefer not to answer |
sexual orientation_5 | Which of the following best describes your current sexual orientation? (select all that apply) - Something else |
race ethnicity_1 | Which of the following best describes your race and/or ethnicity? (select all that apply) - American Indian or Alaskan Native |
race ethnicity_2 | Which of the following best describes your race and/or ethnicity? (select all that apply) - Asian |
race ethnicity_3 | Which of the following best describes your race and/or ethnicity? (select all that apply) - Black or African American |
race ethnicity_4 | Which of the following best describes your race and/or ethnicity? (select all that apply) - Hispanic or Latino |
race ethnicity_5 | Which of the following best describes your race and/or ethnicity? (select all that apply) - Middle Eastern or North Africa |
race ethnicity_6 | Which of the following best describes your race and/or ethnicity? (select all that apply) - Native Hawaiian or other Pacific Islander |
race ethnicity_7 | Which of the following best describes your race and/or ethnicity? (select all that apply) - White |
race ethnicity_8 | Which of the following best describes your race and/or ethnicity? (select all that apply) - Prefer not to answer |
race ethnicity_9 | Which of the following best describes your race and/or ethnicity? (select all that apply) - Something else |
education | What is the highest level of education you have completed or the highest degree you received? |
income | What was your total household income during the past 12 months? |
sensitivity_1 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Mother's maiden name |
sensitivity_2 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Handwriting Sample |
sensitivity_3 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Voice Print |
sensitivity_4 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Fingerprint |
sensitivity_5 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Driver's License Number |
sensitivity_6 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Vehicle Registration Number |
sensitivity_7 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Online Screen Name |
sensitivity_8 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Race |
sensitivity_9 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Religion |
sensitivity_10 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Weight |
sensitivity_11 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Height |
sensitivity_12 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - License Plate Number |
sensitivity_13 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Medical History |
sensitivity_14 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Family / Friends contact information |
sensitivity_15 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Mental health |
sensitivity_16 | If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Emotions |
trust_1 | How much trust do you personally have in each of the following? - People in general? |
trust_2 | How much trust do you personally have in each of the following? - Journalists? |
trust_3 | How much trust do you personally have in each of the following? - Academic researchers? |
trust_4 | How much trust do you personally have in each of the following? - America's courts and legal system? |
trust_5 | How much trust do you personally have in each of the following? - Major private companies in America? |
trust_6 | How much trust do you personally have in each of the following? - Social media companies? |
privacy behaviors_1 | For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you read a website’s privacy policy before you register your information? |
privacy behaviors_2 | For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you look for a privacy certification on a website before you register your information? |
privacy behaviors_3 | For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you read license agreements fully before you agree to them? |
privacy behaviors_4 | For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you watch for ways to control what people send you online (such as check boxes that allow you to opt-in or opt-out of certain offers)? |
privacy behaviors_5 | For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you remove cookies? |
privacy behaviors_6 | For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you clear your browser history regularly? |
privacy behaviors_7 | For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you block messages/emails from someone you do not want to hear from? |
R version 4.1.0 (2021-05-18)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: CentOS Linux 7 (Core)
Matrix products: default
BLAS: /sw/arcts/centos7/stacks/gcc/8.2.0/R/4.1.0/lib64/R/lib/libRblas.so
LAPACK: /sw/arcts/centos7/stacks/gcc/8.2.0/R/4.1.0/lib64/R/lib/libRlapack.so
locale:
[1] LC_CTYPE=en_US.UTF-8 LC_NUMERIC=C
[3] LC_TIME=en_US.UTF-8 LC_COLLATE=en_US.UTF-8
[5] LC_MONETARY=en_US.UTF-8 LC_MESSAGES=en_US.UTF-8
[7] LC_PAPER=en_US.UTF-8 LC_NAME=C
[9] LC_ADDRESS=C LC_TELEPHONE=C
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
attached base packages:
[1] stats graphics grDevices utils datasets methods base
other attached packages:
[1] margins_0.3.26 ggeffects_1.1.1 lme4_1.1-27.1 Matrix_1.3-3
[5] tidyr_1.1.4 purrr_0.3.4 sjPlot_2.8.9 arsenal_3.6.3
[9] stargazer_5.2.2 ggplot2_3.3.5 readr_2.0.2 stringr_1.4.0
[13] reshape2_1.4.4 forcats_0.5.1 dplyr_1.0.7