Skip to content

Code for analyzing data from the personal and social media data survey

License

Notifications You must be signed in to change notification settings

casmlab/personal-data-survey

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

personal-data-survey

DOI

Analyzes data from surveys re: personal data

We surveyed two populations:

  1. Qualtrics panel with minimum quotas for Black folks and non-users of social media (N = 586)
  2. Mechanical Turkers (N = 389)

Manual data prep

I did some of the data prep manually:

  • removing columns that are for Qualtrics or MTurk but not data
  • removing headers that are not reasonable variable names
  • replacing spaces in variable names with underscores
  • replace ‚Äô with apostrophe
  • removing display order variables
  • lower casing variable names
  • removed preview responses

It was faster to do by hand in Excel than to do programmatically.

Repo structure

  • notebooks: contains Jupyter notebooks with code for prepping and analyzing data; the prep_survey_responses notebook does most of the data type and recoding work. Individual analysis notebooks focus on a particular research question and often start with some serious mutation work to get data into the right format.
  • reports: contains output like tables and figures

Data

Data is available through Deep Blue Data with the identifier https://doi.org/10.7302/6vjf-av59.

The data is also provided in the data folder of this repo.

Codebook

Here's the original mapping of variable name to question from Qualtrics. The prep notebook changes some of these variable names, relevels, and recodes dummies.

Q43 Are you 18 years old or over?
social media use Do you ever use social media sites like Facebook, Twitter, or Instagram?
how often_1 Thinking about the social media sites you use. About how often do you visit or use - Facebook
how often_2 Thinking about the social media sites you use. About how often do you visit or use - Instagram
how often_3 Thinking about the social media sites you use. About how often do you visit or use - LinkedIn
how often_4 Thinking about the social media sites you use. About how often do you visit or use - Nextdoor
how often_5 Thinking about the social media sites you use. About how often do you visit or use - Pinterest
how often_6 Thinking about the social media sites you use. About how often do you visit or use - Reddit
how often_7 Thinking about the social media sites you use. About how often do you visit or use - Snapchat
how often_8 Thinking about the social media sites you use. About how often do you visit or use - TikTok
how often_9 Thinking about the social media sites you use. About how often do you visit or use - Twitter
how often_10 Thinking about the social media sites you use. About how often do you visit or use - WhatsApp
how often_11 Thinking about the social media sites you use. About how often do you visit or use - YouTube
tweets public Are your tweets publicly viewable?
insta public Are your Instagram posts publicly viewable?
researchers_1 Is it ok for academic researchers to use - texts from your social media posts to study how language changes over time?
researchers_2 Is it ok for academic researchers to use - texts from your social media posts to train bots to help reduce racist messages?
researchers_3 Is it ok for academic researchers to use - texts from your social media posts to figure out how to show you relevant ads?
researchers_4 Is it ok for academic researchers to use - texts or images from your social media posts to provide you with interventions on social media that may help you gain support and feel better?
researchers_5 Is it ok for academic researchers to use - texts or images from your social media posts to predict whether you are at risk of harming yourself?
researchers_6 Is it ok for academic researchers to use - texts or images from your social media posts to predict whether you are at risk of harming others?
researchers_7 Is it ok for academic researchers to use - texts or images from your social media posts to study how misinformation spreads?
researchers_8 Is it ok for academic researchers to use - text of your social media posts and millions of others’ posts to predict election outcomes?
researchers_9 Is it ok for academic researchers to use - posts you’ve deleted from social media to study how information gets updated during crises like natural disasters?
researchers_10 Is it ok for academic researchers to use - images you’ve posted to social media to study variability in vegetation in National Parks?
researchers_11 Is it ok for academic researchers to use - images you’ve uploaded to social media to train facial recognition software?
researchers_12 Is it ok for academic researchers to use - images from security camera footage to train facial recognition software?
researchers_13 Is it ok for academic researchers to use - social media posts you made at a protest in a virtual exhibit of related posts about the protest?
researchers_14 Is it ok for academic researchers to use - metadata from your photos in social media to create a public map of peony gardens in your area?
researchers_15 Is it ok for academic researchers to use - location data from your social media posts to understand the spread of COVID-19?
researchers_16 Is it ok for academic researchers to use - timestamps of your social media posts to understand the spread of COVID-19?
researchers_17 Is it ok for academic researchers to use - your cell phone location data to study commuting patterns?
researchers_18 Is it ok for academic researchers to use - your voter file to study voter turnout over time and between groups?
researchers_19 Is it ok for academic researchers to use - your grocery store purchases to understand the influence of other shoppers?
researchers_20 Is it ok for academic researchers to use - your grocery store purchases to figure out how to send you coupons?
sm companies_1 Is it ok for social media companies to use - texts from your social media posts to study how language changes over time?
sm companies_2 Is it ok for social media companies to use - texts from your social media posts to train bots to help reduce racist messages?
sm companies_3 Is it ok for social media companies to use - texts from your social media posts to show you relevant ads?
sm companies_4 Is it ok for social media companies to use - texts or images from your social media posts to provide you with interventions on social media that may help you gain support and feel better?
sm companies_5 Is it ok for social media companies to use - texts or images from your social media posts to predict whether you are at risk of harming yourself?
sm companies_6 Is it ok for social media companies to use - texts or images from your social media posts to predict whether you are at risk of harming others?
sm companies_7 Is it ok for social media companies to use - texts or images from your social media posts to study how misinformation spreads?
sm companies_8 Is it ok for social media companies to use - text of your social media posts and millions of others’ posts to predict election outcomes?
sm companies_9 Is it ok for social media companies to use - posts you’ve deleted from social media to study how information gets updated during crises like natural disasters?
sm companies_10 Is it ok for social media companies to use - images you’ve posted to social media to study variability in vegetation in National Parks?
sm companies_11 Is it ok for social media companies to use - images you’ve uploaded to social media to train facial recognition software?
sm companies_12 Is it ok for social media companies to use - images from security camera footage to train facial recognition software?
sm companies_13 Is it ok for social media companies to use - social media posts you made at a protest in a virtual collection of related posts about the protest?
sm companies_14 Is it ok for social media companies to use - metadata from your photos in social media to create a public map of peony gardens in your area?
sm companies_15 Is it ok for social media companies to use - location data from your social media posts to understand the spread of COVID-19?
sm companies_16 Is it ok for social media companies to use - timestamps of your social media posts to understand the spread of COVID-19?
sm companies_17 Is it ok for social media companies to use - your cell phone location data to study commuting patterns?
sm companies_18 Is it ok for social media companies to use - your voter file to study voter turnout over time and between groups?
sm companies_19 Is it ok for social media companies to use - your grocery store purchases to understand the influence of other shoppers?
sm companies_20 Is it ok for social media companies to use - your grocery store purchases to send you coupons?
journalists_1 Is it ok for journalists to use - texts from your social media posts in a story about how language changes over time?
journalists_2 Is it ok for journalists to use - texts from your social media posts in a story about bots trained to reduce racist messages?
journalists_3 Is it ok for journalists to use - texts from your social media posts in a story about algorithms that recognize emotions?
journalists_4 Is it ok for journalists to use - texts or images from your social media posts in a story about misinformation?
journalists_5 Is it ok for journalists to use - text of your social media posts in a story about elections?
journalists_6 Is it ok for journalists to use - posts you’ve deleted from social media in a story about natural disasters?
journalists_7 Is it ok for journalists to use - images you’ve posted to social media in a story about vegetation in National Parks?
journalists_8 Is it ok for journalists to use - images you’ve uploaded to social media in a story about facial recognition software?
journalists_9 Is it ok for journalists to use - images from security camera footage in a story about facial recognition software?
journalists_10 Is it ok for journalists to use - social media posts you made at a protest in a story about the protest?
journalists_11 Is it ok for journalists to use - metadata from your photos in social media to create a public map of peony gardens in your area?
journalists_12 Is it ok for journalists to use - location data from your social media posts in a story about the spread of COVID-19?
journalists_13 Is it ok for journalists to use - timestamps of your social media posts in a story about the spread of COVID-19?
journalists_14 Is it ok for journalists to use - your cell phone location data in a story about commuting patterns?
journalists_15 Is it ok for journalists to use - your voter file in a study about voter turnout?
journalists_16 Is it ok for journalists to use - your grocery store purchases in a story about how others influence what we buy?
sometimes Did you find yourself wanting to answer "sometimes" or "it depends" to any of the questions about academic researchers, journalists, and social media companies using data?
sometimes text Why did you want to answer "sometimes" or "it depends"? For instance, on what does it depend or when would you say "sometimes"?
digital privacy_1 Please respond how much you agree or disagree with the following statements. - When companies or government agencies try to collect my personal information, I sometimes hesitate to provide it.
digital privacy_2 Please respond how much you agree or disagree with the following statements. - I am concerned that companies or government agencies do not allow me to delete information I've given them.
digital privacy_3 Please respond how much you agree or disagree with the following statements. - It usually bothers me that companies or government agencies don't offer a process for me to request deletion of information I've given them.
digital privacy_4 Please respond how much you agree or disagree with the following statements. - It usually bothers me that companies or government agencies do not give me the option to have my information deleted.
social science Researchers should be able to get my social media data without my permission if it will help them to do research that will advance knowledge about society and social relationships.
data archive_1 How important is it that researchers ask you for your permission to add the following information to a data archive? - your public social media posts?
data archive_2 How important is it that researchers ask you for your permission to add the following information to a data archive? - your grocery purchases?
data archive_3 How important is it that researchers ask you for your permission to add the following information to a data archive? - your cell phone location data?
data archive_4 How important is it that researchers ask you for your permission to add the following information to a data archive? - your voting records (only whether you voted in a particular election, not how you voted)?
data archive_5 How important is it that researchers ask you for your permission to add the following information to a data archive? - your anonymized health records?
secure archive Do you think a secure social media archive is a good idea or a bad idea?
anonymous archive Do you think an anonymous social media archive is a good idea or a bad idea?
concern-misuse How concerned are you about the possibility that your data might be misused if it's in a data archive?
concern-harm How concerned are you that you could be harmed through misuse of your data?
age Please select your age range
gender identity_1 Which of the following best describes your current gender identity? (select all that apply) - Woman
gender identity_2 Which of the following best describes your current gender identity? (select all that apply) - Man
gender identity_3 Which of the following best describes your current gender identity? (select all that apply) - Transgender
gender identity_4 Which of the following best describes your current gender identity? (select all that apply) - Nonbinary/genderqueer
gender identity_5 Which of the following best describes your current gender identity? (select all that apply) - Prefer not to answer
gender identity_6 Which of the following best describes your current gender identity? (select all that apply) - Something else
sexual orientation_1 Which of the following best describes your current sexual orientation? (select all that apply) - Gay or lesbian
sexual orientation_2 Which of the following best describes your current sexual orientation? (select all that apply) - Heterosexual (straight)
sexual orientation_3 Which of the following best describes your current sexual orientation? (select all that apply) - Bisexual
sexual orientation_4 Which of the following best describes your current sexual orientation? (select all that apply) - Prefer not to answer
sexual orientation_5 Which of the following best describes your current sexual orientation? (select all that apply) - Something else
race ethnicity_1 Which of the following best describes your race and/or ethnicity? (select all that apply) - American Indian or Alaskan Native
race ethnicity_2 Which of the following best describes your race and/or ethnicity? (select all that apply) - Asian
race ethnicity_3 Which of the following best describes your race and/or ethnicity? (select all that apply) - Black or African American
race ethnicity_4 Which of the following best describes your race and/or ethnicity? (select all that apply) - Hispanic or Latino
race ethnicity_5 Which of the following best describes your race and/or ethnicity? (select all that apply) - Middle Eastern or North Africa
race ethnicity_6 Which of the following best describes your race and/or ethnicity? (select all that apply) - Native Hawaiian or other Pacific Islander
race ethnicity_7 Which of the following best describes your race and/or ethnicity? (select all that apply) - White
race ethnicity_8 Which of the following best describes your race and/or ethnicity? (select all that apply) - Prefer not to answer
race ethnicity_9 Which of the following best describes your race and/or ethnicity? (select all that apply) - Something else
education What is the highest level of education you have completed or the highest degree you received?
income What was your total household income during the past 12 months?
sensitivity_1 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Mother's maiden name
sensitivity_2 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Handwriting Sample
sensitivity_3 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Voice Print
sensitivity_4 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Fingerprint
sensitivity_5 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Driver's License Number
sensitivity_6 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Vehicle Registration Number
sensitivity_7 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Online Screen Name
sensitivity_8 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Race
sensitivity_9 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Religion
sensitivity_10 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Weight
sensitivity_11 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Height
sensitivity_12 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - License Plate Number
sensitivity_13 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Medical History
sensitivity_14 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Family / Friends contact information
sensitivity_15 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Mental health
sensitivity_16 If a marketer has access to information about the following items, how SENSITIVE would you consider this information? - Emotions
trust_1 How much trust do you personally have in each of the following? - People in general?
trust_2 How much trust do you personally have in each of the following? - Journalists?
trust_3 How much trust do you personally have in each of the following? - Academic researchers?
trust_4 How much trust do you personally have in each of the following? - America's courts and legal system?
trust_5 How much trust do you personally have in each of the following? - Major private companies in America?
trust_6 How much trust do you personally have in each of the following? - Social media companies?
privacy behaviors_1 For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you read a website’s privacy policy before you register your information?
privacy behaviors_2 For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you look for a privacy certification on a website before you register your information?
privacy behaviors_3 For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you read license agreements fully before you agree to them?
privacy behaviors_4 For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you watch for ways to control what people send you online (such as check boxes that allow you to opt-in or opt-out of certain offers)?
privacy behaviors_5 For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you remove cookies?
privacy behaviors_6 For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you clear your browser history regularly?
privacy behaviors_7 For this part of the survey, we are interested in your privacy-related behavior in general and when online. Please answer every question using the full scale provided - Do you block messages/emails from someone you do not want to hear from?

Requirements

R version 4.1.0 (2021-05-18)
Platform: x86_64-pc-linux-gnu (64-bit)
Running under: CentOS Linux 7 (Core)

Matrix products: default
BLAS:   /sw/arcts/centos7/stacks/gcc/8.2.0/R/4.1.0/lib64/R/lib/libRblas.so
LAPACK: /sw/arcts/centos7/stacks/gcc/8.2.0/R/4.1.0/lib64/R/lib/libRlapack.so

locale:
 [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
 [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8    
 [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
 [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
 [9] LC_ADDRESS=C               LC_TELEPHONE=C            
[11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] margins_0.3.26  ggeffects_1.1.1 lme4_1.1-27.1   Matrix_1.3-3   
 [5] tidyr_1.1.4     purrr_0.3.4     sjPlot_2.8.9    arsenal_3.6.3  
 [9] stargazer_5.2.2 ggplot2_3.3.5   readr_2.0.2     stringr_1.4.0  
[13] reshape2_1.4.4  forcats_0.5.1   dplyr_1.0.7    

About

Code for analyzing data from the personal and social media data survey

Resources

License

Stars

Watchers

Forks

Packages

No packages published