Docsity
Docsity

Prepare for your exams
Prepare for your exams

Study with the several resources on Docsity


Earn points to download
Earn points to download

Earn points by helping other students or get them with a premium plan


Guidelines and tips
Guidelines and tips

Understanding Population Parameters: Z-scores, Confidence Intervals & Hypothesis Testing -, Study notes of Data Analysis & Statistical Methods

An in-depth exploration of confidence intervals and hypothesis testing using z-scores. Students will learn how to calculate confidence intervals for population means and hypotheses, understand the concept of sampling variability, and determine the appropriate technique for a given scenario. The document also covers the central limit theorem and ways to reduce the margin of error. Hypothesis testing, including null and alternative hypotheses, test statistics, and p-values, is also discussed.

Typology: Study notes

2009/2010

Uploaded on 04/26/2010

cgilles
cgilles 🇺🇸

3

(1)

9 documents

1 / 10

Toggle sidebar

Related documents


Partial preview of the text

Download Understanding Population Parameters: Z-scores, Confidence Intervals & Hypothesis Testing - and more Study notes Data Analysis & Statistical Methods in PDF only on Docsity! 1 Chapter 6: Confidence Intervals and Hypothesis Testing Using Z for the CI and test of the population mean Learning goals for this chapter: Understand what inference is and why it is needed. Know that all inference techniques give us information about the population parameter. Explain what a confidence interval is and when it is needed. Calculate a confidence interval for the population mean when the population standard deviation is known. Know the assumptions that must be met for doing inference for the population mean. Calculate the needed sample size if you have a pre-determined margin of error. Know how to write hypotheses, calculate a test statistic and P-value, and write conclusions in terms of the story. Draw Normal curve pictures to match the hypothesis test. Understand the logic of hypothesis testing and when a hypothesis test is needed. Use the confidence interval to perform a two-sided hypothesis test. Explain sampling variability and the difference between the population mean and the sample mean. Explain the difference between the population standard deviation and the sample standard deviation. Know which technique is most appropriate for a story: confidence interval, hypothesis test, or simple summary statistics. When we collect data from our sample, we can calculate sample statistics. However, usually we are interested in what is true for the whole population, not just for the sample. (Remember that a census is very hard and expensive to do well.) Why can’t we just accept our sample mean or sample proportion as the official mean or proportion for the population? Every time we estimate the statistics ˆ,x p (sample mean and sample proportion), we get a different answer due to sampling variability. Two most common types of formal statistical inference: Confidence Intervals: when we want to estimate a population parameter Significance Tests: when we want to assess the evidence provided by the data in favor of some claim about the population (yes/no question about the population) Confidence Intervals allow us to estimate the population mean or population proportion. 2 The true mean or proportion for the population exists and is a fixed number, but we just don’t know what it is. Using our sample statistic, we can create a “net” to give us an estimate of where to expect the population parameter to be. Confidence interval = net Population parameter = invisible, stationary butterfly We don’t know exactly where the butterfly is, but from our sample, we have a pretty good estimate of the location. We don’t need to take a lot of random samples to recreate the sampling distribution with the population mean μ at its center. All we need is one Simple Random Sample of size n. Because of what we know about the sample mean distribution, we can use that one sample mean’s confidence interval to infer what the population mean really is. If you increase the sample size (n), you decrease the size of your “net” (or your margin of error). If we just take a single sample, our single confidence interval “net” may or may not include the population parameter. However if we take many samples of the same size and create a confidence interval from each sample statistic, over the long run 95% of our confidence intervals will contain the true population parameter (if we are using a 95% confidence level). 5 Be careful!!!! You can only use the formula * xx z n under certain circumstances: Data must be an SRS from the population. Do not use if the sampling is anything more complicated than an SRS. Data must be collected correctly (no bias). The margin of error covers only random sampling errors. Undercoverage and nonresponse are not covered. Outliers can have a big effect on the confidence interval. (This makes sense because we use the mean and standard deviation to get a CI.) You must know the standard deviation of the population, x . Example: A questionnaire of drinking habits was given to a random sample of fraternity members, and each student was asked to report the # of beers he had drunk in the past month. The sample of 30 students resulted in an average of 22 beers with a population standard deviation of 9 beers. a) Give a 90% confidence interval for the mean number of beers drunk by fraternity members in the past month. b) Is it true that 90% of the fraternity members each month drink the number of beers that lie in the interval you found in part (a)? Explain your answer. No, this is the confidence interval for the population mean, not for individual population members. If we take many 30-frat member samples and make a confidence interval from each sample, 90% of these confidence intervals will contain the true population mean # of beers drunk in a month by fraternity members. c) What is the margin of error for the 90% confidence interval? d) How many students should you sample if you want a margin of error of 1 for a 90% confidence interval? 6 Hypothesis Testing To do a significance test, you need 2 hypotheses: H0, Null Hypothesis: the statement being tested, usually phrased as “no effect” or “no difference”. Ha, Alternative Hypothesis: the statement we hope or suspect is true instead of H0. Hypotheses always refer to some population or model. Not to a particular outcome. Hypotheses can be one-sided or two-sided. One-sided hypothesis: covers just part of the range for your parameter H0: = 10 OR H0: = 10 Ha: > 10 Ha: < 10 Two-sided hypothesis: covers the whole possible range for your parameter H0: = 10 Ha: 10 Even though Ha is what we hope or believe to be true, our test gives evidence for or against H0 only. We never prove H0 true, we can only state whether we have enough evidence to reject H0 (which is evidence in favor of Ha, but not proof that Ha is true) or that we don’t have enough evidence to reject H0. A test statistic measures compatibility between the H0 and the data. P-value: the probability, computed assuming that H0 is true, that the test statistic would take a value as extreme or more extreme than that actually observed due to random fluctuation. It is a measure of how unusual your sample results are. The smaller the P-value, the stronger the evidence against H0 provided by the data. Calculate the P-value by using the sampling distribution of the test statistic (only the normal distribution for Chapter 6). 7 Compare P-value to a significance level, . If the P-value , we can reject H0. If you can reject H0, your results are significant. If you do not reject H0, your results are not significant. The 4 steps common to all tests of significance: 1. State the null hypothesis H0 and the alternative hypothesis Ha. 2. Calculate the value of the test statistic (z-score in Chapter 6). 3. Draw a picture of what Ha looks like, and find the P-value (use the normal table in Chapter 6). 4. State your conclusion about the data in a sentence, using the P-value and/or comparing the P-value to a significance level for your evidence. Z-Test for a Population Mean To test the hypothesis H0: = 0 based on an SRS of size n from a population with unknown mean and known standard deviation , compute the test statistic: 0 / x z n the P-values for a test of H0 against: Ha: > 0 is P( Z z) Ha: < 0 is P( Z z ) Ha: 0 is 2* P( Z | z| )
Docsity logo



Copyright © 2024 Ladybird Srl - Via Leonardo da Vinci 16, 10126, Torino, Italy - VAT 10816460017 - All rights reserved