Seminar: Dan Hedlin, Department of Statistics, Stockholm University

EVENT

Date: 25 September 2019, 1.00 PM - 25 September 2019, 2.00 PM
Venue: B705

"Some comments on Xiao-Li Meng’s paper ‘Statistical paradises and paradoxes in big data"

Which one should I trust more: a survey with a random sample that covers 1% of the population but with 60% response rate, or a dataset taken from social media, if this datasets covers 80% of the population? This is a motivating question for Xiao-Li Meng, and he is able to give a general answer to it (spoiler: the social media dataset is more trustworthy). This amazing paper is worth a discussion. A key formula relates the error of an estimate of the population mean to the product of three and only three factors: the “data quality”, the “data quantity” and the “problem difficulty”. A further result is that the design effect for a large non-probability sample is N-1 times the square of the expected “data quality”, where N is the population size. That is, the design effects grows with N.

Reference: Meng, X.-L. (2018). Statistical paradises and paradoxes in big data (I): Law of large populations, big data paradox, and the 2016 US presidential election. The Annals of Applied Statistics, 12(2), 685-726.

When: September 25, at 1-2 pm
Where: Room B705, Department of Statistics, Stockholm University

Last updated: January 27, 2020
Page editor: Robert Standar
Source: Statistiska institutionen

Tell a friend

Contacts

Visiting adress
Statistiska institutionen
Universitetsvägen 10B, plan 7
Stockholm

Postal adress
Statistiska institutionen
Stockholms universitet
SE - 106 91 STOCKHOLM

Fax 08 - 16 7511

Opening hours

The Department is open Monday - Friday, 07:50 AM – 5:05 PM throughout academic year.

More contact details

We belong to the Faculty of Social Sciences

Faculty of Social Sciences

Seminars

Seminar: Dan Hedlin, Department of Statistics, Stockholm University

EVENT

Contacts

We belong to the Faculty of Social Sciences