Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Statistical Inference: Determining the Reliability of Surveys, Study notes of Mathematics

The concept of statistical inference and how it applies to the reliability of survey results. The text focuses on the importance of sample size in determining the accuracy of survey results, specifically when the proportion of yes responses (p) in the population is unknown. The industry standard for determining an appropriate sample size based on the desired confidence level and the expected proportion of yes responses. It also discusses the relationship between sample size and margin of error, and provides examples using a presidential poll and a computer simulation.

Typology: Study notes

Pre 2010

Uploaded on 08/30/2009

koofers-user-97v
koofers-user-97v 🇺🇸

10 documents

1 / 2

Toggle sidebar

Related documents


Partial preview of the text

Download Statistical Inference: Determining the Reliability of Surveys and more Study notes Mathematics in PDF only on Docsity! Math 203, Fall 2008: Statistical Inference: How reliable is a survey? Consider a survey with a single question, to which respondents are asked to give an answer of yes or no. Suppose you pick a random sample of n people, and you find that the proportion that answered yes is p̂. Question: How close is p̂ to the actual proportion p of people in the whole population who would have answered yes? In order for there to be a reliable answer to this question, the sample size, n, must be big enough so that the sample distribution is close to a bell shaped curve (i.e., close to a normal distribution). But even if n is big enough that the distribution is close to a normal distribution, usually you need to make n even bigger in order to make sure your margin of error is reasonably small. Thus the first thing to do is to be sure n is big enough for the sample distribution to be close to normal. The industry standard for being close enough is for n to be big enough so that n > 9 1− p p and n > 9 1− p p both hold. When p is about 50%, n can be as small as 10, but when p gets close to 0 or close to 1, the sample size n needs to get bigger. If p is 1% or 99%, then n must be at least 892, for example. (Note also that n here depends on p but not on the size of the whole population.) See Figures 1 and 2 showing frequency histograms for the number of yes respondents if p = 1% when the sample size n is 10 versus 1000 (this data was obtained by running a computer simulation taking 10000 samples). When n = 10 the distribution does not look very close to being a normal distribution, but you get a nice bell shaped curve when n = 1000. Consider for example a presidential poll. If you’re trying to measure support for one of the major candi- dates, p is likely reasonably close to 50% (as is true for both McCain and Obama right now, a week before the election), so even fairly small samples are more or less normally distributed. But if you want to be able to accurately analyze the support for a minor candidate, say Nader, you’ll need a much bigger sample. Of course, you don’t know p or p̂ before you do your survey, but usually you have some idea of about what the value of p is, so you can use that to get an idea of how big n will have to be. If it turns out that your expectation as to what the value of p was going to be was way off, you might have to run your survey over with a bigger sample size. Once you know how big n must be for the distribution to be normal, you usually have to make it even bigger in order to achieve whatever margin of error you want. The industry standard is to use a 95% confidence level; i.e., for the margin of error to be big enough so that 95% of randomly chosen samples will have p̂ fall within the margin of error. This means that the margin of error must be ±2σ. (A 68% confidence level would use a ±σ margin of error and a 99.7% confidence level would use a ±3σ margin of error. This makes sense; the bigger you set your range to be, the more confident you can be that p̂ will end up in that range, since the bigger a target is the easier it is to hit it.) Since the industry standard is ±2σ, that’s what you’re stuck with, but what you can control is how big σ is. To make σ smaller, and thus to have a smaller margin of error, you need to make n bigger. This is because in the formula for σ,
Docsity logo



Copyright © 2024 Ladybird Srl - Via Leonardo da Vinci 16, 10126, Torino, Italy - VAT 10816460017 - All rights reserved