Q: How large does my sample need to be for the t-test to be valid?

As a practical rule of thumb, n ≥ 30 is often cited as sufficient for the Central Limit Theorem to ensure the sampling distribution of the mean is approximately normal, making the t-test robust even if the underlying data are moderately non-normal. For normally distributed data, the t-test is valid for any sample size. For small samples (n < 15) from clearly non-normal or heavily skewed populations, consider the Wilcoxon signed-rank test instead.

Q: What is the difference between the t-test and the z-test?

The z-test is used when the population standard deviation σ is known, using the standard normal distribution as the reference. The t-test is used when σ is unknown and must be estimated from the sample using s — which is almost always the case in practice. As sample size increases, the t-distribution converges to the standard normal, so for n ≥ 120 the two tests produce nearly identical results.

Q: How do I compute Cohen's d from these results to measure effect size?

Cohen's d for a one-sample t-test is computed as d = (&x̄; − μ 0 ) / s — the raw mean difference divided by the sample standard deviation (not the standard error). Conventional benchmarks are d = 0.2 (small), d = 0.5 (medium), and d = 0.8 (large). Effect size is independent of sample size, which makes it essential for interpreting whether a statistically significant result is also practically meaningful.

Question 1

What is a good t-statistic value for rejecting the null hypothesis?

Accepted Answer

There is no universally 'good' t-value because the critical threshold depends on both the degrees of freedom and your chosen significance level. For a two-tailed test at &alpha; = 0.05, the critical value approaches &plusmn;1.96 as the sample size grows large (df &rarr; &infin;), but is &plusmn;2.306 at df = 8 and &plusmn;2.042 at df = 30. You must compare your computed t-statistic to the appropriate critical value from a t-distribution table or use software to obtain an exact p-value.

Question 2

What is the difference between a one-tailed and two-tailed t-test?

Accepted Answer

A two-tailed test asks whether the sample mean is significantly different from μ₀ in either direction (higher or lower), while a one-tailed test asks specifically whether it is significantly greater than, or significantly less than, μ₀. For the same t-statistic, a one-tailed test yields half the p-value of a two-tailed test. Use a one-tailed test only when the direction of the effect was specified before data collection; otherwise the two-tailed test is the safer default.

Question 3

How large does my sample need to be for the t-test to be valid?

Accepted Answer

As a practical rule of thumb, n &ge; 30 is often cited as sufficient for the Central Limit Theorem to ensure the sampling distribution of the mean is approximately normal, making the t-test robust even if the underlying data are moderately non-normal. For normally distributed data, the t-test is valid for any sample size. For small samples (n < 15) from clearly non-normal or heavily skewed populations, consider the Wilcoxon signed-rank test instead.

Question 4

What is the difference between the t-test and the z-test?

Accepted Answer

The z-test is used when the population standard deviation &sigma; is known, using the standard normal distribution as the reference. The t-test is used when &sigma; is unknown and must be estimated from the sample using s — which is almost always the case in practice. As sample size increases, the t-distribution converges to the standard normal, so for n &ge; 120 the two tests produce nearly identical results.

Question 5

How do I compute Cohen's d from these results to measure effect size?

Accepted Answer

Cohen's d for a one-sample t-test is computed as d = (&x̄; − μ₀) / s — the raw mean difference divided by the sample standard deviation (not the standard error). Conventional benchmarks are d = 0.2 (small), d = 0.5 (medium), and d = 0.8 (large). Effect size is independent of sample size, which makes it essential for interpreting whether a statistically significant result is also practically meaningful.

T-Test Calculator

Formula

How it works

Worked example

Limitations & notes

Frequently asked questions

What is a good t-statistic value for rejecting the null hypothesis?

What is the difference between a one-tailed and two-tailed t-test?

How large does my sample need to be for the t-test to be valid?

What is the difference between the t-test and the z-test?

How do I compute Cohen's d from these results to measure effect size?

T-Test Calculator

Formula

How it works

Worked example

Limitations & notes

Related calculators

Frequently asked questions

What is a good t-statistic value for rejecting the null hypothesis?

What is the difference between a one-tailed and two-tailed t-test?

How large does my sample need to be for the t-test to be valid?

What is the difference between the t-test and the z-test?

How do I compute Cohen's d from these results to measure effect size?