Question 1

What is the difference between the one-sample and two-sample KS tests?

Accepted Answer

The one-sample KS test (computed here) compares a dataset against a specific theoretical distribution such as normal or uniform. The two-sample KS test compares two empirical datasets to determine whether they were drawn from the same underlying distribution, without specifying what that distribution is. Both use the same D statistic concept but different critical value tables.

Question 2

Should I use the KS test or the Shapiro-Wilk test to check normality?

Accepted Answer

For testing normality specifically, the Shapiro-Wilk test is generally more powerful than the KS test, particularly for small to moderate sample sizes (n ≤ 50). The KS test is more general and can test against any continuous distribution, not just normal. For normality testing with estimated parameters, the Lilliefors test (a modified KS test) is more appropriate than the standard KS test.

Question 3

What does it mean if the p-value is greater than 0.05?

Accepted Answer

A p-value greater than 0.05 means you fail to reject the null hypothesis at the 5% significance level. This indicates that the observed maximum difference between your sample's ECDF and the theoretical CDF is not statistically significant — the data are consistent with having been drawn from the specified distribution. It does not prove the distribution is correct, only that you lack sufficient evidence to rule it out.

Question 4

Can I use the KS test if I estimated the distribution parameters from the data?

Accepted Answer

Not with standard KS critical values. When you estimate parameters (e.g., mean and standard deviation) from the same sample, the actual Type I error rate is much lower than the nominal level, making the test too conservative. In this situation you should use the Lilliefors test, which provides corrected critical values specifically for parameters estimated from sample data.

Question 5

How large a sample do I need for the KS test to be reliable?

Accepted Answer

The asymptotic p-value approximation becomes increasingly accurate as n grows and is generally reliable for n ≥ 30. For smaller samples, the test has low power — it may fail to detect even substantial departures from the hypothesized distribution. For very small samples (n < 10), use exact KS tables rather than the asymptotic approximation. Power increases substantially as n exceeds 50 or 100.

Kolmogorov-Smirnov Test Calculator

Formula

How it works

Worked example

Limitations & notes

Frequently asked questions

What is the difference between the one-sample and two-sample KS tests?

Should I use the KS test or the Shapiro-Wilk test to check normality?

What does it mean if the p-value is greater than 0.05?

Can I use the KS test if I estimated the distribution parameters from the data?

How large a sample do I need for the KS test to be reliable?