H : p ≠ p
Step 2 : Decide on a level of significance, α , depending on the seriousness of making a Type I error. ( α will often be given as part of a test or homework question, but this will not be the case in the outside world.)
Step 4 : Determine the P -value.
Step 5 : Reject the null hypothesis if the P -value is less than the level of significance, α.
Step 6 : State the conclusion.
Right-tailed tests.
In a two-tailed test, the P -value = 2P(Z > |z o |).
It may seem odd to multiply the probability by two, since "or more extreme" seems to imply the area in the tail only. The reason why we do multiply by two is that even though the result was on one side, we didn't know before collecting the data , on which side it would be.
Since the P -value represents the probability of observing our result or more extreme, the smaller the P -value, the more unusual our observation was. Another way to look at it is this:
The smaller the P -value, the stronger the evidence supporting the alternative hypothesis. We can use the following guideline:
These values are not hard lines, of course, but they can give us a general idea of the strength of the evidence.
But wait! There is an important caveat here, which was mentioned earlier in the section about The Controversy Regarding Hypothesis Testing . The problem is that it's relatively easy to get a large p-value - just get a really large sample size! So the chart above is really with the caveat " assuming equal sample sizes in comparable studies , ... "
This isn't something every statistics text will mention, nor will every instructor mention, but it's important.
According to the Elgin Community College website , approximately 56% of ECC students are female. Suppose we wonder if the same proportion is true for math courses. If we collect a sample of 200 ECC students enrolled in math courses and find that 105 of them are female, do we have enough evidence at the 10% level of significance to say that the proportion of math students who are female is different from the general population?
Note: Be sure to check that the conditions for performing the hypothesis test are met.
[ reveal answer ]
Before we begin, we need to make sure that our sample is less than 5% of the population, and that np 0 (1-p 0 )≥10.
Since there are roughly 16,000 students at ECC (source: www.elgin.edu ), our sample of 200 is clearly less than 5% of the population. Also, np 0 (1-p 0 ) = 200(0.56)(1-0.56) = 49.28 > 10
Step 1 : H 0 : p = 0.56 H 1 : p ≠ 0.56
Step 2 : α = 0.1
Step 4 : P -value = 2•P(Z < -1.00) ≈ 0.3187 (Note that this is a 2-tailed test.)
Step 5 : Since P -value > α , we do not reject H 0 .
Step 6 : There is not enough evidence at the 5% level of significance to support the claim that the proportion of students in math courses who are female is different from the general population.
> > > and H , then click . > > > and H , then click . * To get the counts, first create a frequency table. If you have a grouping variable, use a contingency table. |
Consider the excerpt shown below (also used in Example 1 , in Section 9.3) from a poll conducted by Pew Research:
Stem cell, marijuana proposals lead in Mich. poll A recent poll shows voter support leading opposition for ballot proposals to loosen Michigan's restrictions on embryonic stem cell research and allow medical use of marijuana. The EPIC-MRA poll conducted for The Detroit News and television stations WXYZ, WILX, WOOD and WJRT found 50 percent of likely Michigan voters support the stem cell proposal , 32 percent against and 18 percent undecided. The telephone poll of 602 likely Michigan voters was conducted Sept. 22 through Wednesday. It has a margin of sampling error of plus or minus 4 percentage points. (Source: Associated Press )
Suppose we wonder if the percent of Elgin Community College students who support stem cell research is different from this. If 61 of 100 randomly selected ECC students support stem cell research, is there enough evidence at the 5% level of signficance to support our claim?
Since there are roughly 16,000 students at ECC (source: www.elgin.edu ), our sample of 100 is clearly less than 5% of the population. Also, np 0 (1-p 0 ) = 100(0.50)(1-0.50) = 25 > 10
Step 1 : H 0 : p = 0.5 H 1 : p ≠ 0.5
Step 2 : α = 0.05
Step 3 : We'll use StatCrunch.
Step 4 : Using StatCrunch:
Step 5 : Since P -value < α , we reject H 0 .
Step 6 : Based on this sample, there is enough evidence at the 5% level of significance to support the claim that the proportion of ECC students who support stem cell research is different from the Michigan poll.
One question you might have is, "What do we do if the conditions for the hypothesis test about p aren't met?" Great question!
The binomial probability distribution function.
The probability of obtaining x successes in n independent trials of a binomial experiment, where the probability of success is p, is given by
Where x = 0, 1, 2, ... , n
Here's a quick overview of the formulas for finding binomial probabilities in StatCrunch.
Click on > > Enter n, p, the appropriate equality/inequality, and x. The figure below shows P(X≥3) if n=4 and p=0.25.
|
Traditionally, about 70% of students in a particular Statistics course at ECC are successful. If only 15 students in a class of 28 randomly selected students are successful, is there enough evidence at the 5% level of significance to say that students of that particular instructor are successful at a rate less than 70%?
Step 1 : H 0 : p = 0.7 H 1 : p < 0.7
Step 4 : If we let X = the number of students who were successful, X follows the binomial distribution. For this example, n=28 and p=0.70, and we want P(X≤15). Using StatCrunch:
Step 5 : Since P -value < α (though it's very close), we reject H 0 .
Step 6 : Based on this sample, there is enough evidence at the 5% level of significance to support the claim that the proportion of students who are successful in this professor's classes are less than 70%. (Keep in mind that this assumes the students were randomly assigned to that class, which is never the case in reality!)
<< previous section | next section >>
Hypothesis test for a population proportion (3 of 3), learning objectives.
The P-value is a probability that describes the likelihood of the data if the null hypothesis is true. More specifically, the P-value is the probability that sample results are as extreme as or more extreme than the data if the null hypothesis is true. The phrase “as extreme as or more extreme than” means farther from the center of the sampling distribution in the direction of the alternative hypothesis.
More generally, we view the P-value a description of the strength of the evidence against the null hypothesis and in support of the alternative hypothesis. But the P-value is a probability about sample results, not about the null or alternative hypothesis.
You may wonder why 5% is often selected as the significance level in hypothesis testing and why 1% is also a commonly used level. It is largely due to just convenience and tradition. When Ronald Fisher (one of the founders of modern statistics) published one of his tables, he used a mathematically convenient scale that included 5% and 1%. Later, these same 5% and 1% levels were used by other people, in part just because Fisher was so highly esteemed. But mostly, these are arbitrary levels.
The idea of selecting some sort of relatively small cutoff was historically important in the development of statistics. But it’s important to remember that there is really a continuous range of increasing confidence toward the alternative hypothesis, not a single all-or-nothing value. There isn’t much meaningful difference, for instance, between the P-values 0.049 and 0.051, and it would be foolish to declare one case definitely a “real” effect and the other case definitely a “random” effect. In either case, the study results are roughly 5% likely by chance if there’s no actual effect.
Whether such a P-value is sufficient for us to reject a particular null hypothesis ultimately depends on the risk of making the wrong decision and the extent to which the hypothesized effect might contradict our prior experience or previous studies.
Consider our earlier example about teenagers and Internet access. According to the Kaiser Family Foundation, 84% of U.S. children ages 8 to 18 had Internet access at home as of August 2009. Researchers wonder if this number has changed since then. The hypotheses we tested were:
The original sample consisted of 500 children, and 86% of them had Internet access at home. The P-value was about 0.22, which was not strong enough to reject the null hypothesis. There was not enough evidence to show that the proportion of all U.S. children ages 8 to 18 have Internet access at home.
Suppose we sampled 2,000 children and the sample proportion was still 86%. Our test statistic would be Z ≈ 2.44, and our P-value would be about 0.015. The larger sample size would allow us to reject the null hypothesis even though the sample proportion was the same.
Why does this happen? Larger samples vary less, so a sample proportion of 0.86 is more unusual with larger samples than with smaller samples if the population proportion is really 0.84. This means that if the alternative hypothesis is true, a larger sample size will make it more likely that we reject the null. Therefore, we generally prefer a larger sample as we have seen previously.
It is tempting to get involved in the details of a hypothesis test without thinking about how the data was collected. Whether we are calculating a confidence interval or performing a hypothesis test, the results are meaningless without a properly designed study. Consider the following exercises about how data collection can affect the results of a study.
Let’s summarize.
In this section, we looked at the four steps of a hypothesis test as they relate to a claim about a population proportion.
Step 1: Determine the hypotheses.
Step 2: Collect the data.
Since the hypothesis test is based on probability, random selection or assignment is essential in data production. Additionally, we need to check whether the sample proportion can be np ≥ 10 and n (1 − p ) ≥ 10.
Step 3: Assess the evidence.
Step 4: Give the conclusion.
Remember that the P-value is the probability of seeing a sample proportion as extreme as the one observed from the data if the null hypothesis is true. The probability is about the random sample, not about the null or alternative hypothesis.
A larger sample size makes it more likely that we will reject the null hypothesis if the alternative is true. Another way of thinking about this is that increasing the sample size will decrease the likelihood of a type II error. Recall that a type II error is failing to reject the null hypothesis when the alternative is true.
Increasing the sample size can have the unintended effect of making the test sensitive to differences so small they don’t matter. A statistically significant difference is one large enough that it is unlikely to be due to sampling variability alone. Even a difference so small that it is not important can be statistically significant if the sample size is big enough.
Finally, remember the phrase “garbage in, garbage out.” If the data collection methods are poor, then the results of a hypothesis test are meaningless. No statistical methods can create useful information if our data comes from convenience or voluntary response samples. Additionally, the results of a hypothesis test apply only to the population from whom the sample was chosen.
Privacy Policy
selected template will load here
This action is not available.
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
Step 1: State your hypotheses about the population proportion. Step 2: Summarize the data. State a significance level. State and check conditions required for the procedure
Step 3: Perform the procedure
Step 4: Make a decision about \(H_{0}\) and \(H_{a}\)
Step 5 : Make a conclusion.
Example \(\pageindex{1}\).
Joon believes that 50% of first-time brides in the United States are younger than their grooms. She performs a hypothesis test to determine if the percentage is the same or different from 50% . Joon samples 100 first-time brides and 53 reply that they are younger than their grooms. For the hypothesis test, she uses a 1% level of significance.
Set up the hypothesis test:
The 1% level of significance means that α = 0.01. This is a test of a single population proportion .
\(H_{0}: p = 0.50\) \(H_{a}: p \neq 0.50\)
The words "is the same or different from" tell you this is a two-tailed test.
Calculate the distribution needed:
Random variable: \(\hat{P} =\) the percent of of first-time brides who are younger than their grooms.
Distribution for the test: The problem contains no mention of a mean. The information is given in terms of percentages. Use the Normal distribution for \hat{P} , the estimated proportion.
\[ \hat{P} - N\left(p, \sqrt{\frac{p(1-p)}{n}}\right)\nonumber \]
\[ \hat{P} - N\left(0.5, \sqrt{\frac{0.5(0.5)}{100}}\right)\nonumber \]
where \(p = 0.50, q = 1−p = 0.50\), and \(n = 100\)
Calculate the p -value using the normal distribution for proportions:
\[p\text{-value} = P( \hat{P} < 0.47 \space or \space \hat{P} > 0.53) = 0.5485\nonumber \]
where \[x = 53, \hat{P} = \frac{x}{n} = \frac{53}{100} = 0.53\nonumber \].
Interpretation of the p-value: If the null hypothesis is true, there is 0.5485 probability (54.85%) that the sample (estimated) proportion \( \hat{P} \) is 0.53 or more OR 0.47 or less (see the graph in Figure).
\(\mu = p = 0.50\) comes from \(H_{0}\), the null hypothesis.
\( \hat{P} = 0.53\). Since the curve is symmetrical and the test is two-tailed, the \( \hat{P} \) for the left tail is equal to \(0.50 – 0.03 = 0.47\) where \(\mu = p = 0.50\). (0.03 is the difference between 0.53 and 0.50.)
Compare \(\alpha\) and the \(p\text{-value}\):
Since \(\alpha = 0.01\) and \(p\text{-value} = 0.5485\). \(\alpha < p\text{-value}\).
Make a decision: Since \(\alpha < p\text{-value}\), you cannot reject \(H_{0}\).
Conclusion: At the 1% level of significance, the sample data do not show sufficient evidence that the percentage of first-time brides who are younger than their grooms is different from 50%.
The \(p\text{-value}\) can easily be calculated.
Press STAT and arrow over to TESTS . Press 5:1-PropZTest . Enter .5 for \(p_{0}\), 53 for \(x\) and 100 for \(n\). Arrow down to Prop and arrow to not equals \(p_{0}\). Press ENTER . Arrow down to Calculate and press ENTER . The calculator calculates the \(p\text{-value}\) (\(p = 0.5485\)) and the test statistic (\(z\)-score). Prop not equals .5 is the alternate hypothesis. Do this set of instructions again except arrow to Draw (instead of Calculate ). Press ENTER . A shaded graph appears with \(z = 0.6\) (test statistic) and \(p = 0.5485\) (\(p\text{-value}\)). Make sure when you use Draw that no other equations are highlighted in \(Y =\) and the plots are turned off.
The Type I and Type II errors are as follows:
The Type I error is to conclude that the proportion of first-time brides who are younger than their grooms is different from 50% when, in fact, the proportion is actually 50%. (Reject the null hypothesis when the null hypothesis is true).
The Type II error is there is not enough evidence to conclude that the proportion of first time brides who are younger than their grooms differs from 50% when, in fact, the proportion does differ from 50%. (Do not reject the null hypothesis when the null hypothesis is false.)
A teacher believes that 85% of students in the class will want to go on a field trip to the local zoo. She performs a hypothesis test to determine if the percentage is the same or different from 85%. The teacher samples 50 students and 39 reply that they would want to go to the zoo. For the hypothesis test, use a 1% level of significance.
First, determine what type of test this is, set up the hypothesis test, find the \(p\text{-value}\), sketch the graph, and state your conclusion.
Since the problem is about percentages, this is a test of single population proportions.
Because \(p > \alpha\), we fail to reject the null hypothesis. There is not sufficient evidence to suggest that the proportion of students that want to go to the zoo is not 85%.
Suppose a consumer group suspects that the proportion of households that have three cell phones is 30%. A cell phone company has reason to believe that the proportion is not 30%. Before they start a big advertising campaign, they conduct a hypothesis test. Their marketing people survey 150 households with the result that 43 of the households have three cell phones.
Set up the Hypothesis Test:
\(H_{0}: p = 0.30, H_{a}: p \neq 0.30\)
Determine the distribution needed:
The random variable is \( \hat{P} =\) proportion of households that have three cell phones.
The distribution for the hypothesis test is \( \hat{P} - N\left(0.30, \sqrt{\frac{0.30 \cdot (0.70)}{150}}\right)\)
a. The value that helps determine the \(p\text{-value}\) is \( \hat{P} \). Calculate \( \hat{P} \).
a. \( \hat{P} = \frac{x}{n}\) where \(x\) is the number of successes and \(n\) is the total number in the sample.
\(x = 43, n = 150\)
\( \hat{P} = 43/150=0.2867\)
b. What is a success for this problem?
b. A success is having three cell phones in a household.
c. What is the level of significance?
c. The level of significance is the preset \(\alpha\). Since \(\alpha\) is not given, assume that \(\alpha = 0.05\).
d. Draw the graph for this problem. Draw the horizontal axis. Label and shade appropriately.
Calculate the \(p\text{-value}\).
d. \(p\text{-value} = 0.7216\)
e. Make a decision. _____________(Reject/Do not reject) \(H_{0}\) because____________.
e. Assuming that \(\alpha = 0.05, \alpha < p\text{-value}\). The decision is do not reject \(H_{0}\) because there is not sufficient evidence to conclude that the proportion of households that have three cell phones is not 30%.
Marketers believe that 92% of adults in the United States own a cell phone. A cell phone manufacturer believes that number is actually lower. 200 American adults are surveyed, of which, 174 report having cell phones. Use a 5% level of significance. State the null and alternative hypothesis, find the p -value, state your conclusion, and identify the Type I and Type II errors.
Because \(p < 0.05\), we reject the null hypothesis. There is sufficient evidence to conclude that fewer than 92% of American adults own cell phones.
The next example is a poem written by a statistics student named Nicole Hart. The solution to the problem follows the poem. Notice that the hypothesis test is for a single population proportion. This means that the null and alternate hypotheses use the parameter \(p\). The distribution for the test is normal. The estimated proportion \(\hat{p}\) is the proportion of fleas killed to the total fleas found on Fido. This is sample information. The problem gives a preconceived \(\alpha = 0.01\), for comparison, and a 95% confidence interval computation. The poem is clever and humorous, so please enjoy it!
My dog has so many fleas,
They do not come off with ease. As for shampoo, I have tried many types Even one called Bubble Hype, Which only killed 25% of the fleas, Unfortunately I was not pleased.
I've used all kinds of soap, Until I had given up hope Until one day I saw An ad that put me in awe.
A shampoo used for dogs Called GOOD ENOUGH to Clean a Hog Guaranteed to kill more fleas.
I gave Fido a bath And after doing the math His number of fleas Started dropping by 3's! Before his shampoo I counted 42.
At the end of his bath, I redid the math And the new shampoo had killed 17 fleas. So now I was pleased.
Now it is time for you to have some fun With the level of significance being .01, You must help me figure out
Use the new shampoo or go without?
\(H_{0}: p \leq 0.25\) \(H_{a}: p > 0.25\)
In words, CLEARLY state what your random variable \(\bar{X}\) or \( \hat{P} \) represents.
\( \hat{P} =\) The proportion of fleas that are killed by the new shampoo
State the distribution to use for the test.
\[N\left(0.25, \sqrt{\frac{0.25 \cdot (1-0.25)}{42}}\right)\nonumber \]
Test Statistic: \(z_{obs} = 2.3163\)
Calculate the \(p\text{-value}\) using the normal distribution for proportions:
\[p\text{-value} = 0.0103\nonumber \]
In one to two complete sentences, explain what the p -value means for this problem.
If the null hypothesis is true (the proportion is 0.25), then there is a 0.0103 probability that the sample (estimated) proportion is 0.4048 \(\left(\frac{17}{42}\right)\) or more.
Use the previous information to sketch a picture of this situation. CLEARLY, label and scale the horizontal axis and shade the region(s) corresponding to the \(p\text{-value}\).
Indicate the correct decision (“reject” or “do not reject” the null hypothesis), the reason for it, and write an appropriate conclusion, using complete sentences.
0.01 | Do not reject \(H_{0}\) | \(\alpha < p\text{-value}\) |
Conclusion: At the 1% level of significance, the sample data do not show sufficient evidence that the percentage of fleas that are killed by the new shampoo is more than 25%.
Construct a 95% confidence interval for the true mean or proportion. Include a sketch of the graph of the situation. Label the point estimate and the lower and upper bounds of the confidence interval.
Confidence Interval: (0.26,0.55) We are 95% confident that the true population proportion p of fleas that are killed by the new shampoo is between 26% and 55%.
This test result is not very definitive since the \(p\text{-value}\) is very close to alpha. In reality, one would probably do more tests by giving the dog another bath after the fleas have had a chance to return.
In a study of 420,019 cell phone users, 172 of the subjects developed brain cancer. Test the claim that cell phone users developed brain cancer at a greater rate than that for non-cell phone users (the rate of brain cancer for non-cell phone users is 0.0340%). Since this is a critical issue, use a 0.005 significance level. Explain why the significance level should be so low in terms of a Type I error.
We will follow the four-step process.
If we commit a Type I error, we are essentially accepting a false claim. Since the claim describes cancer-causing environments, we want to minimize the chances of incorrectly identifying causes of cancer.
According to the US Census there are approximately 268,608,618 residents aged 12 and older. Statistics from the Rape, Abuse, and Incest National Network indicate that, on average, 207,754 rapes occur each year (male and female) for persons aged 12 and older. This translates into a percentage of sexual assaults of 0.078%. In Daviess County, KY, there were reported 11 rapes for a population of 37,937. Conduct an appropriate hypothesis test to determine if there is a statistically significant difference between the local sexual assault percentage and the national sexual assault percentage. Use a significance level of 0.01.
We will follow the five-step plan.
The hypothesis test itself has an established process. This can be summarized as follows:
Notice that in performing the hypothesis test, you use \(\alpha\) and not \(\beta\). \(\beta\) is needed to help determine the sample size of the data that is used in calculating the \(p\text{-value}\). Remember that the quantity \(1 – \beta\) is called the Power of the Test . A high power is desirable. If the power is too low, statisticians typically increase the sample size while keeping α the same.If the power is low, the null hypothesis might not be rejected when it should be.
Run a free plagiarism check in 10 minutes, generate accurate citations for free.
Published on November 8, 2019 by Rebecca Bevans . Revised on June 22, 2023.
Hypothesis testing is a formal procedure for investigating our ideas about the world using statistics . It is most often used by scientists to test specific predictions, called hypotheses, that arise from theories.
There are 5 main steps in hypothesis testing:
Though the specific details might vary, the procedure you will use when testing a hypothesis will always follow some version of these steps.
Step 1: state your null and alternate hypothesis, step 2: collect data, step 3: perform a statistical test, step 4: decide whether to reject or fail to reject your null hypothesis, step 5: present your findings, other interesting articles, frequently asked questions about hypothesis testing.
After developing your initial research hypothesis (the prediction that you want to investigate), it is important to restate it as a null (H o ) and alternate (H a ) hypothesis so that you can test it mathematically.
The alternate hypothesis is usually your initial hypothesis that predicts a relationship between variables. The null hypothesis is a prediction of no relationship between the variables you are interested in.
Discover proofreading & editing
For a statistical test to be valid , it is important to perform sampling and collect data in a way that is designed to test your hypothesis. If your data are not representative, then you cannot make statistical inferences about the population you are interested in.
There are a variety of statistical tests available, but they are all based on the comparison of within-group variance (how spread out the data is within a category) versus between-group variance (how different the categories are from one another).
If the between-group variance is large enough that there is little or no overlap between groups, then your statistical test will reflect that by showing a low p -value . This means it is unlikely that the differences between these groups came about by chance.
Alternatively, if there is high within-group variance and low between-group variance, then your statistical test will reflect that with a high p -value. This means it is likely that any difference you measure between groups is due to chance.
Your choice of statistical test will be based on the type of variables and the level of measurement of your collected data .
Based on the outcome of your statistical test, you will have to decide whether to reject or fail to reject your null hypothesis.
In most cases you will use the p -value generated by your statistical test to guide your decision. And in most cases, your predetermined level of significance for rejecting the null hypothesis will be 0.05 – that is, when there is a less than 5% chance that you would see these results if the null hypothesis were true.
In some cases, researchers choose a more conservative level of significance, such as 0.01 (1%). This minimizes the risk of incorrectly rejecting the null hypothesis ( Type I error ).
The results of hypothesis testing will be presented in the results and discussion sections of your research paper , dissertation or thesis .
In the results section you should give a brief summary of the data and a summary of the results of your statistical test (for example, the estimated difference between group means and associated p -value). In the discussion , you can discuss whether your initial hypothesis was supported by your results or not.
In the formal language of hypothesis testing, we talk about rejecting or failing to reject the null hypothesis. You will probably be asked to do this in your statistics assignments.
However, when presenting research results in academic papers we rarely talk this way. Instead, we go back to our alternate hypothesis (in this case, the hypothesis that men are on average taller than women) and state whether the result of our test did or did not support the alternate hypothesis.
If your null hypothesis was rejected, this result is interpreted as “supported the alternate hypothesis.”
These are superficial differences; you can see that they mean the same thing.
You might notice that we don’t say that we reject or fail to reject the alternate hypothesis . This is because hypothesis testing is not designed to prove or disprove anything. It is only designed to test whether a pattern we measure could have arisen spuriously, or by chance.
If we reject the null hypothesis based on our research (i.e., we find that it is unlikely that the pattern arose by chance), then we can say our test lends support to our hypothesis . But if the pattern does not pass our decision rule, meaning that it could have arisen by chance, then we say the test is inconsistent with our hypothesis .
If you want to know more about statistics , methodology , or research bias , make sure to check out some of our other articles with explanations and examples.
Methodology
Research bias
Hypothesis testing is a formal procedure for investigating our ideas about the world using statistics. It is used by scientists to test specific predictions, called hypotheses , by calculating how likely it is that a pattern or relationship between variables could have arisen by chance.
A hypothesis states your predictions about what your research will find. It is a tentative answer to your research question that has not yet been tested. For some research projects, you might have to write several hypotheses that address different aspects of your research question.
A hypothesis is not just a guess — it should be based on existing theories and knowledge. It also has to be testable, which means you can support or refute it through scientific research methods (such as experiments, observations and statistical analysis of data).
Null and alternative hypotheses are used in statistical hypothesis testing . The null hypothesis of a test always predicts no effect or no relationship between variables, while the alternative hypothesis states your research prediction of an effect or relationship.
If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.
Bevans, R. (2023, June 22). Hypothesis Testing | A Step-by-Step Guide with Easy Examples. Scribbr. Retrieved June 24, 2024, from https://www.scribbr.com/statistics/hypothesis-testing/
Other students also liked, choosing the right statistical test | types & examples, understanding p values | definition and examples, what is your plagiarism score.
If you're seeing this message, it means we're having trouble loading external resources on our website.
If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.
To log in and use all the features of Khan Academy, please enable JavaScript in your browser.
Course: statistics and probability > unit 13.
Last Updated: July 31, 2023
This article was co-authored by Joseph Quinones . Joseph Quinones is a High School Physics Teacher working at South Bronx Community Charter High School. Joseph specializes in astronomy and astrophysics and is interested in science education and science outreach, currently practicing ways to make physics accessible to more students with the goal of bringing more students of color into the STEM fields. He has experience working on Astrophysics research projects at the Museum of Natural History (AMNH). Joseph recieved his Bachelor's degree in Physics from Lehman College and his Masters in Physics Education from City College of New York (CCNY). He is also a member of a network called New York City Men Teach. This article has been viewed 35,001 times.
Hypothesis testing for a proportion is used to determine if a sampled proportion is significantly different from a specified population proportion. For example, if you expect the proportion of male births to be 50 percent, but the actual proportion of male births is 53 percent in a sample of 1000 births. Is this significantly different from the hypothesized population parameter? To find out, follow these steps.
Thanks for reading our article! If you’d like to learn more about teaching, check out our in-depth interview with Joseph Quinones .
Shanaya Malik
Jun 29, 2019
Don’t miss out! Sign up for
wikiHow’s newsletter
Content preview.
Arcu felis bibendum ut tristique et egestas quis:
5.5 - hypothesis testing for two-sample proportions.
We are now going to develop the hypothesis test for the difference of two proportions for independent samples. The hypothesis test follows the same steps as one group.
These notes are going to go into a little bit of math and formulas to help demonstrate the logic behind hypothesis testing for two groups. If this starts to get a little confusion, just skim over it for a general understanding! Remember we can rely on the software to do the calculations for us, but it is good to have a basic understanding of the logic!
We will use the sampling distribution of \(\hat{p}_1-\hat{p}_2\) as we did for the confidence interval.
For a test for two proportions, we are interested in the difference between two groups. If the difference is zero, then they are not different (i.e., they are equal). Therefore, the null hypothesis will always be:
\(H_0\colon p_1-p_2=0\)
Another way to look at it is \(H_0\colon p_1=p_2\). This is worth stopping to think about. Remember, in hypothesis testing, we assume the null hypothesis is true. In this case, it means that \(p_1\) and \(p_2\) are equal. Under this assumption, then \(\hat{p}_1\) and \(\hat{p}_2\) are both estimating the same proportion. Think of this proportion as \(p^*\).
Therefore, the sampling distribution of both proportions, \(\hat{p}_1\) and \(\hat{p}_2\), will, under certain conditions, be approximately normal centered around \(p^*\), with standard error \(\sqrt{\dfrac{p^*(1-p^*)}{n_i}}\), for \(i=1, 2\).
We take this into account by finding an estimate for this \(p^*\) using the two-sample proportions. We can calculate an estimate of \(p^*\) using the following formula:
\(\hat{p}^*=\dfrac{x_1+x_2}{n_1+n_2}\)
This value is the total number in the desired categories \((x_1+x_2)\) from both samples over the total number of sampling units in the combined sample \((n_1+n_2)\).
Putting everything together, if we assume \(p_1=p_2\), then the sampling distribution of \(\hat{p}_1-\hat{p}_2\) will be approximately normal with mean 0 and standard error of \(\sqrt{p^*(1-p^*)\left(\frac{1}{n_1}+\frac{1}{n_2}\right)}\), under certain conditions.
\(z^*=\dfrac{(\hat{p}_1-\hat{p}_2)-0}{\sqrt{\hat{p}^*(1-\hat{p}^*)\left(\dfrac{1}{n_1}+\dfrac{1}{n_2}\right)}}\)
...will follow a standard normal distribution.
Finally, we can develop our hypothesis test for \(p_1-p_2\).
Hypothesis Testing for Two-Sample Proportions
Conditions :
\(n_1\hat{p}_1\), \(n_1(1-\hat{p}_1)\), \(n_2\hat{p}_2\), and \(n_2(1-\hat{p}_2)\) are all greater than five
Test Statistic:
\(z^*=\dfrac{\hat{p}_1-\hat{p}_2-0}{\sqrt{\hat{p}^*(1-\hat{p}^*)\left(\dfrac{1}{n_1}+\dfrac{1}{n_2}\right)}}\)
...where \(\hat{p}^*=\dfrac{x_1+x_2}{n_1+n_2}\).
The critical values, p-values, and decisions will all follow the same steps as those from a hypothesis test for a one-sample proportion.
Hypothesis testing is a crucial statistical method used to determine whether there is enough evidence to reject a null hypothesis. It’s a fundamental tool in research and decision-making, helping us make informed conclusions based on data.
In this article, we’ll delve into hypothesis testing with a specific example involving proportions. We’ll explore the steps involved and how to interpret the results.
Before diving into the example, let’s define some key concepts:
Let’s consider a simple example. We want to test whether a coin is fair. Our null hypothesis is that the coin is fair, meaning the probability of getting heads (p) is 0.5. Our alternative hypothesis is that the coin is not fair, meaning p ≠ 0.5.
We toss the coin 100 times and observe 60 heads. Our sample proportion (p̂) is 60/100 = 0.6. Now, we need to determine if this sample proportion provides enough evidence to reject the null hypothesis.
The hypothesis testing process involves the following steps:
In this example, we successfully performed a hypothesis test for a proportion. We found enough evidence to reject the null hypothesis that the coin is fair. This process demonstrates the power of hypothesis testing in drawing conclusions from data and making informed decisions.
Remember, hypothesis testing is a crucial tool in various fields, from scientific research to business analysis. By understanding the concepts and steps involved, you can effectively analyze data and make informed judgments.
Research Methods
© 2024 by SchoolTube
Username or Email Address
Remember Me
Enter your account data and we will send you a link to reset your password.
Privacy policy.
To use social login you have to agree with the storage and handling of your data by this website. Privacy Policy
Public collection title
Private collection title
Here you'll find all collections you've created before.
Scientific Calculator
selected template will load here
This action is not available.
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
hypothesis test for a population Proportion calculator
Fill in the sample size, n, the number of successes, x, the hypothesized population proportion \(p_0\), and indicate if the test is left tailed, <, right tailed, >, or two tailed, \(\neq\). Then hit "Calculate" and the test statistic and p-Value will be calculated for you.
n: | x: | \(p_0\): |
|
IMAGES
VIDEO
COMMENTS
The p -value for a hypothesis test on a population proportion is the area in the tail (s) of distribution of the sample proportion. If both n× p ≥ 5 n × p ≥ 5 and n ×(1− p) ≥ 5 n × ( 1 − p) ≥ 5, use the normal distribution to find the p -value. If at least one of n× p < 5 n × p < 5 or n×(1 −p) < 5 n × ( 1 − p) < 5, use ...
Step 3) Compute the test statistic. The test statistic is the number of standard deviations the sample proportion is from the known proportion. It is also a Z-score, just like the critical value. z = ˆp − p √p(1 − p) n. For this problem, the test statistic is: z = 0.403 − 0.32 √0.32 ( 1 − 0.32) 196 = 2.49.
For the hypothesis test, she uses a 1% level of significance. Answer. Set up the hypothesis test: The 1% level of significance means that α = 0.01. This is a test of a single population proportion. \(H_{0}: p = 0.50\) \(H_{a}: p \neq 0.50\) The words "is the same or different from" tell you this is a two-tailed test. Calculate the distribution ...
test hypotheses about a population proportion; test hypotheses about a population proportion using the binomial probability distribution; ... Step 5: Reject the null hypothesis if the P-value is less than the level of significance, α. Step 6: State the conclusion. Calculating P-Values Right-Tailed Tests.
Step 1: Check assumptions and write hypotheses. When conducting a chi-square goodness-of-fit test, it makes the most sense to write the hypotheses first. The hypotheses will depend on the research question. The null hypothesis will always contain the equalities and the alternative hypothesis will be that at least one population proportion is ...
The examples on the following pages use the five step hypothesis testing procedure outlined below. This is the same procedure that we used to conduct a hypothesis test for a single mean, single proportion, difference in two means, and difference in two proportions. ... then there is not enough evidence that any of the population proportions are ...
Our sample proportion was 0.02 above the population proportion from the null hypothesis. In a sample of size 500, we would observe a sample proportion 0.02 or more away from 0.84 about 22% of the time by chance alone. Step 4: State a conclusion. Again we compare the P-value to the level of significance, α = 0.05.
At this point you should be more comfortable with the steps of a hypothesis test and not have to number each step, but know what each step means. Critical Value Method . Step 1: State the hypotheses: The key words in this example, "proportion" and "differs," give the hypotheses: H 0: p = 0.856. H 1: p ≠ 0.856 (claim)
Hypothesis Test for Proportions: Step 5 (cont.) •For a left tailed test: 𝐻𝑎: < 0 We have rejection regions for 𝐻 are as follows •Note: all of the rejection region is in the left tail, where is much smaller than 0 Confidence Reject (test stat) Reject (p-value) 0.90 Test-stat<-1.282 P-value<.1 0.95 Test-stat<-1.645 P-value<.05
Step 2: Collect the data. Since the hypothesis test is based on probability, random selection or assignment is essential in data production. Additionally, we need to check whether the sample proportion can be np ≥ 10 and n (1 − p) ≥ 10. Step 3: Assess the evidence. Determine the test statistic which is the z -score for the sample proportion.
The first step in conducting a hypothesis test is to write the hypothesis statements that are going to be tested. ... This is a single sample proportion test. We want to know if the population proportion is different from 0.5, so this is a two-tailed test. Our hypotheses are: \(H_0:p=0.5\) ... Step 4: \(p>0.05\), we should fail to reject the ...
Step 2: Collect the data. Since the hypothesis test is based on probability, random selection or assignment is essential in data production. Additionally, we need to check whether the sample proportion can be np ≥ 10 and n (1 − p) ≥ 10. Step 3: Assess the evidence.
Know the formula for the one-proportion z-test statistic Section 8.2 Hypothesis Testing in Four Steps Write the vocabulary terms in this section on 3 x 5 cards and study them. Important terms include: two-tailed p-value Be sure to also read and study the key points, highlighted in the blue boxes in the text. Know the four steps of hypothesis ...
! ! page!3! STA!205!Notes! Buckley! Fall!2016! Example'2:''In!2013,!18%!of!all!Kentucky!adults!were!uninsured.!With!the!passing!of!the!Affordable!Healthcare!Act,!one ...
Steps for performing Hypothesis Test for a Single Population Proportion. Step 1: State your hypotheses about the population proportion. Step 2: Summarize the data. State a significance level. State and check conditions required for the procedure. P^ = X n P ^ = X n. Conditions for the test:
Table of contents. Step 1: State your null and alternate hypothesis. Step 2: Collect data. Step 3: Perform a statistical test. Step 4: Decide whether to reject or fail to reject your null hypothesis. Step 5: Present your findings. Other interesting articles. Frequently asked questions about hypothesis testing.
The first step in hypothesis testing is to set up two competing hypotheses. The hypotheses are the most important aspect. If the hypotheses are incorrect, your conclusion will also be incorrect. ... Therefore, assuming the true population proportion is 0.5, a sample proportion of 0.556 is 2.504 standard deviations above the mean. The \(z^*\) ...
Solution. We will use the critical value approach to perform the test. The same test will be performed using the p -value approach in Example 8.5.3. We must check that the sample is sufficiently large to validly perform the test. Since ˆp = 270 / 500 = 0.54, √ˆp(1 − ˆp) n = √(0.54)(0.46) 500 ≈ 0.02. hence.
Our alternative hypothesis is that there is a difference. Or that P1 does not equal P2. Or that P1 minus P2, the proportion of men voting minus the proportion of women voting, the true population proportions, do not equal 0. And we're going to do the hypothesis test with a significance level of 5%.
In your example, you can use a two-tailed test to see if the sample proportion of male births, 0.53, is different from the hypothesized population proportion of 0.50. So H0: p=0.50; Ha: p<>0.50. Typically, if there is no a priori reason to believe that any differences must be unidirectional, the two-tailed test is preferred as it is a more ...
Statistics and Probability questions and answers. Step 5: Hypothesis Test for the Population Proportion Suppose the management claims that the proportion of games that your team wins when scoring 80 or more points is 0.50. Test this claim using a 5% level of significance. Make the following edits to the code block below: 1.
5.5 - Hypothesis Testing for Two-Sample Proportions. We are now going to develop the hypothesis test for the difference of two proportions for independent samples. The hypothesis test follows the same steps as one group. These notes are going to go into a little bit of math and formulas to help demonstrate the logic behind hypothesis testing ...
Question: 5. Hypothesis Test for the Population Proportion Suppose the management claims that the proportion of games that your team wins when scoring 20 or more points is 0.50. You tested this claim using a 5% level of significance. Explain the steps you took to test this problem and interpret your results. See Step 5 in the Python script to ...
We toss the coin 100 times and observe 60 heads. Our sample proportion (p̂) is 60/100 = 0.6. Now, we need to determine if this sample proportion provides enough evidence to reject the null hypothesis. Steps of Hypothesis Testing. The hypothesis testing process involves the following steps: State the null and alternative hypotheses: H 0: p = 0.5
3/8/2020. 94% (98) View full document. Step 5: Hypothesis Test for the Population Proportion ¶ Suppose the management claims that the proportion of games that your team wins when scoring 80 or more points is 0.50. Test this claim using a 5% level of significance. Make the following edits to the code block below:
A hypothesis test can help determine if a difference in the estimated proportions reflects a difference in the population proportions. The difference of two proportions follows an approximate normal distribution. Generally, the null hypothesis states that the two proportions are the same. That is, \(H_{0}: p_{A} = p_{B}\).
hypothesis test for a population Proportion calculator. Fill in the sample size, n, the number of successes, x, the hypothesized population proportion p0 p 0, and indicate if the test is left tailed, <, right tailed, >, or two tailed, ≠ ≠ . Then hit "Calculate" and the test statistic and p-Value will be calculated for you. n: x: p0 p 0