Stack Exchange network consists of 183 Q&A communities including Stack Overflow , the largest, most trusted online community for developers to learn, share their knowledge, and build their careers.
Q&A for work
Connect and share knowledge within a single location that is structured and easy to search.
I have 3 populations, let's call them cluster 1, cluster 2 and cluster 3. The data are continuous. I want to see if there's a difference between the means of the three clusters. I know that the t-test tests for the difference of means, but that is only for 2 samples. What test should I use for multiple samples, i.e. 3, in my case?
If you want a multi-group analog of a t-test it sounds like you just want ANOVA (analysis of variance) or something similar to it. That's exactly what it's for - comparing group means.
Specifically, you seem to be asking for one-way analysis of variance .
Any decent statistics package does ANOVA.
If you don't want to assume normality (just as you would for a t-test), there are a variety of options that still allow a test of means (including permutation tests and GLMs), but if your samples are large, moderate non-normality won't impact things much.
There's also the issue of potential heteroskedasticity; in the normal case many packages offer an approximation via an adjustment to error degrees of freedom (Welch-Satterthwaite) that often performs quite well. If heteroskedasticity is related to mean, you may be better off looking at an ANOVA-like model fitted as a GLM.
However, if the clusters are generated by performing cluster analysis on data, the theory for t-tests, ANOVA, GLMs, permutation tests, etc no longer holds. None of the p-values would be correct
Sign up or log in, post as a guest.
Required, but never shown
By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy .
Teach yourself statistics
This lesson explains how to conduct a hypothesis test of a mean, when the following conditions are met:
Generally, the sampling distribution will be approximately normally distributed if any of the following conditions apply.
This approach consists of four steps: (1) state the hypotheses, (2) formulate an analysis plan, (3) analyze sample data, and (4) interpret results.
Every hypothesis test requires the analyst to state a null hypothesis and an alternative hypothesis . The hypotheses are stated in such a way that they are mutually exclusive. That is, if one is true, the other must be false; and vice versa.
The table below shows three sets of hypotheses. Each makes a statement about how the population mean μ is related to a specified value M . (In the table, the symbol ≠ means " not equal to ".)
Set | Null hypothesis | Alternative hypothesis | Number of tails |
---|---|---|---|
1 | μ = M | μ ≠ M | 2 |
2 | μ M | μ < M | 1 |
3 | μ M | μ > M | 1 |
The first set of hypotheses (Set 1) is an example of a two-tailed test , since an extreme value on either side of the sampling distribution would cause a researcher to reject the null hypothesis. The other two sets of hypotheses (Sets 2 and 3) are one-tailed tests , since an extreme value on only one side of the sampling distribution would cause a researcher to reject the null hypothesis.
The analysis plan describes how to use sample data to accept or reject the null hypothesis. It should specify the following elements.
Using sample data, conduct a one-sample t-test. This involves finding the standard error, degrees of freedom, test statistic, and the P-value associated with the test statistic.
SE = s * sqrt{ ( 1/n ) * [ ( N - n ) / ( N - 1 ) ] }
SE = s / sqrt( n )
t = ( x - μ) / SE
As you probably noticed, the process of hypothesis testing can be complex. When you need to test a hypothesis about a mean score, consider using the Sample Size Calculator. The calculator is fairly easy to use, and it is free. You can find the Sample Size Calculator in Stat Trek's main menu under the Stat Tools tab. Or you can tap the button below.
If the sample findings are unlikely, given the null hypothesis, the researcher rejects the null hypothesis. Typically, this involves comparing the P-value to the significance level , and rejecting the null hypothesis when the P-value is less than the significance level.
In this section, two sample problems illustrate how to conduct a hypothesis test of a mean score. The first problem involves a two-tailed test; the second problem, a one-tailed test.
Problem 1: Two-Tailed Test
An inventor has developed a new, energy-efficient lawn mower engine. He claims that the engine will run continuously for 5 hours (300 minutes) on a single gallon of regular gasoline. From his stock of 2000 engines, the inventor selects a simple random sample of 50 engines for testing. The engines run for an average of 295 minutes, with a standard deviation of 20 minutes. Test the null hypothesis that the mean run time is 300 minutes against the alternative hypothesis that the mean run time is not 300 minutes. Use a 0.05 level of significance. (Assume that run times for the population of engines are normally distributed.)
Solution: The solution to this problem takes four steps: (1) state the hypotheses, (2) formulate an analysis plan, (3) analyze sample data, and (4) interpret results. We work through those steps below:
Null hypothesis: μ = 300
Alternative hypothesis: μ ≠ 300
SE = s / sqrt(n) = 20 / sqrt(50) = 20/7.07 = 2.83
DF = n - 1 = 50 - 1 = 49
t = ( x - μ) / SE = (295 - 300)/2.83 = -1.77
where s is the standard deviation of the sample, x is the sample mean, μ is the hypothesized population mean, and n is the sample size.
Since we have a two-tailed test , the P-value is the probability that the t statistic having 49 degrees of freedom is less than -1.77 or greater than 1.77. We use the t Distribution Calculator to find P(t < -1.77) is about 0.04.
Note: If you use this approach on an exam, you may also want to mention why this approach is appropriate. Specifically, the approach is appropriate because the sampling method was simple random sampling, the population was normally distributed, and the sample size was small relative to the population size (less than 5%).
Problem 2: One-Tailed Test
Bon Air Elementary School has 1000 students. The principal of the school thinks that the average IQ of students at Bon Air is at least 110. To prove her point, she administers an IQ test to 20 randomly selected students. Among the sampled students, the average IQ is 108 with a standard deviation of 10. Based on these results, should the principal accept or reject her original hypothesis? Assume a significance level of 0.01. (Assume that test scores in the population of engines are normally distributed.)
Null hypothesis: μ >= 110
Alternative hypothesis: μ < 110
SE = s / sqrt(n) = 10 / sqrt(20) = 10/4.472 = 2.236
DF = n - 1 = 20 - 1 = 19
t = ( x - μ) / SE = (108 - 110)/2.236 = -0.894
Here is the logic of the analysis: Given the alternative hypothesis (μ < 110), we want to know whether the observed sample mean is small enough to cause us to reject the null hypothesis.
The observed sample mean produced a t statistic test statistic of -0.894. We use the t Distribution Calculator to find P(t < -0.894) is about 0.19.
IMAGES
VIDEO
COMMENTS
10.3 - Multiple Comparisons. If our test of the null hypothesis is rejected, we conclude that not all the means are equal: that is, at least one mean is different from the other means. The ANOVA test itself provides only statistical evidence of a difference, but not any statistical evidence as to which mean or means are statistically different.
Terminology. The factor that varies between samples is called the factor. (Every once in a while things are easy.) The r different values or levels of the factor are called the treatments.Here the factor is the choice of fat and the treatments are the four fats, so r = 4.. The computations to test the means for equality are called a 1-way ANOVA or 1-factor ANOVA.
Step 1: Determine the hypotheses. The hypotheses for a difference in two population means are similar to those for a difference in two population proportions. The null hypothesis, H 0, is again a statement of "no effect" or "no difference.". H 0: μ 1 - μ 2 = 0, which is the same as H 0: μ 1 = μ 2. The alternative hypothesis, H a ...
1. Let's say I have a dataset with two groups (male and female), a target variable ( y y) and multiple features ( X1 X 1, X2 X 2 and X3 X 3 ). I can test the hypothesis that the population means of X1 X 1 are equal between men and women with a simple t t test. H0 H 0: μm X1 = μf X2 μ X 1 m = μ X 2 f.
There are 5 main steps in hypothesis testing: State your research hypothesis as a null hypothesis and alternate hypothesis (H o) and (H a or H 1 ). Collect data in a way designed to test the hypothesis. Perform an appropriate statistical test. Decide whether to reject or fail to reject your null hypothesis. Present the findings in your results ...
Step 1: Determine the hypotheses. The hypotheses for a difference in two population means are similar to those for a difference in two population proportions. The null hypothesis, H 0, is again a statement of "no effect" or "no difference.". H 0: μ 1 - μ 2 = 0, which is the same as H 0: μ 1 = μ 2. The alternative hypothesis, H a ...
We then determine the appropriate test statistic (Step 2) for the hypothesis test. The formula for the test statistic is given below. Test Statistic for Testing H0: p = p 0. if min (np 0 , n (1-p 0 )) > 5. The formula above is appropriate for large samples, defined when the smaller of np 0 and n (1-p 0) is at least 5.
Why Multiple Testing Matters Genomics = Lots of Data = Lots of Hypothesis Tests A typical microarray experiment might result in performing 10000 separate hypothesis tests. If we use a standard p-value cut-off of 0.05, we'd expect 500 genes to be deemed "significant" by chance.
Classification of multiple hypothesis tests. The following table defines the possible outcomes when testing multiple null hypotheses. ... Based on the Poisson distribution with mean 50, the probability of observing more than 61 significant tests is less than 0.05, so if more than 61 significant results are observed, it is very likely that some ...
Dudoit S et al (2003) Multiple hypothesis testing in microarray experiments. Stat Sci 18:71-103. Article Google Scholar Higdon R, van Belle G, Kolker E (2008) A note on the false discovery rate and inconsistent comparison between experiments. Bioinformatics 24:1225-1228
The multiple hypothesis testing is the scenario that we are conducting several hypothesis tests at the same time. Suppose we have ntests, each leads to a p-value. So we can view the 'data' as P 1; ;P n 2[0;1], where P i is the p-value of the i-th test. We can think of this problem as conducting hypothesis tests of n nulls: H 1;0; ;H n;0 ...
simply due to chance keeps going up.Methods for dealing with multiple testing frequently call for adjusting in some way, so that the probability of observing at least one signi cant result due to chance remains. e level.2 The Bonferroni correctionThe Bonferroni correct. on sets the signi cance cut-o at =n. For example, in the example above ...
Since we are being asked for convincing statistical evidence, a hypothesis test should be conducted. In this case, we are dealing with averages from two samples or groups (the home run distances), so we will conduct a Test of 2 Means. n1 = 70 n 1 = 70 is the sample size for the first group. n2 = 66 n 2 = 66 is the sample size for the second group.
discussing the problem of multiple hypothesis testing We begin by assuming that some set of null hypotheses is of primary interest, and that we have a set of observations with a joint distribution ... Consider a hypothesis set involving 4 means, with the highest hypothesis in the hierarchy H 1234 and the six hypotheses H ij;i 6= j = 1;2;3;4 as ...
The .gov means it's official. ... Multiple testing refers to situations where a dataset is subjected to statistical testing multiple times - either at multiple time-points or through multiple subgroups or for multiple end-points. ... "Family" is defined as a set of tests related to the same hypothesis. Various approaches for correcting ...
The plasma glucose concentration means in at least two categories are significantly different. Naturally, we will want to know which category pair has different glucose concentrations. One way to answer this question is to conduct several two-sample tests and then adjust for multiple testing using the Bonferroni correction.
Step 1: Determine the hypotheses. The hypotheses for a difference in two population means are similar to those for a difference in two population proportions. The null hypothesis, H 0, is again a statement of "no effect" or "no difference.". H 0: μ 1 - μ 2 = 0, which is the same as H 0: μ 1 = μ 2. The alternative hypothesis, H a ...
This is a test of two independent groups, two population means. Random variable: ˉXg − ˉXb = difference in the sample mean amount of time girls and boys play sports each day. H0: μg = μb. H0: μg − μb = 0. Ha: μg ≠ μb.
ANOVA and MANOVA tests are used when comparing the means of more than two groups (e.g., the average heights of children, teenagers, and adults). Predictor variable. Outcome variable. Research question example. Paired t-test. Categorical. 1 predictor. Quantitative. groups come from the same population.
Table 8.3: One-sided hypothesis testing for the mean: H0: μ ≤ μ0, H1: μ > μ0. Note that the tests mentioned in Table 8.3 remain valid if we replace the null hypothesis by μ = μ0. The reason for this is that in choosing the threshold c, we assumed the worst case scenario, i.e, μ = μ0 .
I have 3 populations, let's call them cluster 1, cluster 2 and cluster 3. The data are continuous. I want to see if there's a difference between the means of the three clusters. I know that the t-test tests for the difference of means, but that is only for 2 samples. What test should I use for multiple samples, i.e. 3, in my case?
The first set of hypotheses (Set 1) is an example of a two-tailed test, since an extreme value on either side of the sampling distribution would cause a researcher to reject the null hypothesis. The other two sets of hypotheses (Sets 2 and 3) are one-tailed tests, since an extreme value on only one side of the sampling distribution would cause a researcher to reject the null hypothesis.
Example 8.6.4. Statistics students believe that the mean score on the first statistics test is 65. A statistics instructor thinks the mean score is higher than 65. He samples ten statistics students and obtains the scores 65 65 70 67 66 63 63 68 72 71. He performs a hypothesis test using a 5% level of significance.