Designed Big Data

Introduction

The increasing volume of Big Data produced by sensors and smart devices can transform the social and behavioral sciences. We will discuss how the true power of these data for the social sciences that lies in integrating Big Data with surveys. Using examples of successful existing studies that used digital data to provide new insights into social reality, we will focus on challenges and opportunities of integrating sensor- and app-based data collection into surveys. The Total Survey Error framework learned in Week 2 will serve as a basis for our discussion of introducing design to Big Data to gauge the inherent challenges of representativeness and measurement. Students will develop a scenario of combing Big Data and survey data, focusing on theoretical and practical aspects of such data integration. Students will have a chance to make decisions on preparing the raw data for analysis and obtaining inference.

##Literature:

  • Chapter 3 - Big Data: A Survey Research Perspective and
  • Chapter 2 - Total Twitter Error in Biemer Paul B., Edith de Leeuw et al. (eds.) (2017). Total Survey Error in Practice. John Wiley & Sons, Available through UU library (DOI:10.1002/9781119041702)

##Lecture: In the lecture, we will discuss the types of Big Data, how types of errors from the Total Survey Error relate to Big Data, and how to combine survey data with Big Data sources. Slides part 1

Class Exercise:

You will design a scenario that combines survey data with one or several Big Data sources.

##Take home exercise: You will have a chance to analyze smartphone sensor data that you yourself have produced (e.g., from Apple Health). You can either use your own data by downloading a Data Download Package, or you can work with data produced by Apple Health. exercise data exercise script

Send your insights into your behavior and a short discussion on construct validity to b.struminskaya@uu.nl before the lecture of week 13.

Previous
Next