Download Article
A simple guide to understanding and applying the empirical rule
Download Article
The empirical rule, also known as the 68-95-99.7 rule, is a handy way to analyze statistical data. It only work for a normal distribution (bell curve), however, and can only produce estimates. You’ll need to know the mean and standard deviation of your data. If you’re using the empirical rule for a class or test, this information should be given to you. Then, you can use the rule to do things like estimate how much of your data falls within a given range.
Steps
-
Draw out and divide a bell curve. Sketch out a normal curve, where the highest point is in the middle, and the ends slope down and taper off symmetrically to the left and right. Next, draw several vertical lines intersecting the curve: [1] X Research source
- 1 line should divide the curve in half.
- Draw 3 lines to the right of this middle line, and 3 more to the left. These should divide each of the curve's halves into 3 evenly spaced sections and one tiny section at the tip.
-
Write the values from your normal distribution on the dividing lines. Mark the line in the middle with the mean of your data. Then, add standard deviations to get the values for the 3 lines to the right. Subtract standard deviations from your mean to get values for the 3 lines to the left. For example: [2] X Research source
- Suppose your data has a mean of 16 and a standard deviation of 2. Mark the center line with. 16.
- Add standard deviations to mark the first line to the right of the center with 18, the next to the right with 20, and the rightmost line with 22.
- Subtract standard deviations to mark the first line to the left of the center with 14, the next line to the left with 12, and the leftmost line with 10.
Advertisement -
Mark the percentages for each section. The basic point empirical rule is easy to grasp: 68 percent of data points for a normal distribution will fall within 1 standard deviation of the mean, 95 percent within 2 standard deviations, and 99.7 percent within 3 standard deviations. To remind yourself, mark each section with a percentage: [3] X Research source
- Each section immediately to the right and left of the center line will contain 34%, for a total of 68.
- The next sections to the right and left will each contain 13.5%. Add these to the 68 percent to get 95% of your data.
- The next sections over on each side will each contain 2.35% of your data. Add these to the 95 percent to get 99.7% of your data.
- The remaining tiny left and right tips of the data each contain 0.15% of the remaining data, for a total of 100%.
Advertisement
-
Find the distributions of your data. Take your mean, and use the empirical rule to find the distributions of data 1, 2, and 3 standard deviations from the mean. Write these on your curve for reference. For example, imagine you are analyzing the weights of a population of cats, where the mean weight is 4 kilograms, with a standard deviation of 0.5 kilograms: [4] X Research source
- 1 standard deviation above the mean would equal 4.5 kg, and 1 standard deviation below equals 3.5 kg.
- 2 standard deviations above the mean would equal 5 kg, and 2 standard deviations below would equal 3 kg.
- 3 standard deviations above the mean would equal kg, and 3 standard deviations below would equal 2.5 kg.
-
Determine the section of the curve your question asks you to analyze. Once you have your curve set up, you can use the Empirical Rule and simple arithmetic to solve data analysis questions. Start by reading your question carefully to figure out the sections you need to work with. For example: [5] X Research source
- Imagine you are asked to find the upper and lower weights for 68% of a population of cats. You would need to look at the two centermost sections, where 68% of data will fall.
- Similarly, imagine the mean weight is 4 kilograms, with a standard deviation of 0.5 kilograms. If you are asked to find the proportion of cats above 5 kilograms, you need to look at the rightmost section (2 standard deviations away from the mean).
-
Find the percentage of your data within a certain range. If you’re asked to find the percentage of the population between a certain range, all you have to do is add up the percentages within a given set of standard deviations. For instance, if you are asked to find the percentage of cats who weigh between 3.5 and 5 kilograms, if the mean weight is 4 kilograms, with a standard deviation of 0.5 kilograms: [6] X Research source
- 2 standard deviations above the mean will be 5 kilograms, and 1 standard deviation below the mean will be 3.5 kilograms.
- This means that 81.5% (68% + 13.5%) of the cats weigh between 3.5 and 5 kilograms.
-
Use the section percentages to find data points and ranges. Take the information provided by the percentage distributions and standard deviations to find upper and lower limits for portions of your data. For instance, a question about your cat weight data may ask: “What is the upper weight limit of the lowest 2.5% of cats?” [7] X Research source
- The lowest 2.5% of data would fall below 2 standard deviations from the mean.
- If the mean is 4 kilograms, and standard deviation is 0.5, then the lowest 2.5% of cats will weight 3 kilograms or less (4 - 0.5 x 2).
Advertisement
Community Q&A
Search
-
QuestionWhat is the percentage of people who lost 10 pounds or more? What is its percentile?David ReynoldsCommunity AnswerTo determine the percentage of people who have lost 10 pounds or more, you would need specific data from a study or survey that tracks weight loss in a population. Without such data, it's impossible to provide an accurate percentage. The concept of a percentile in this context would refer to where an individual's weight loss falls in comparison to others in a dataset. For example, if you are in the 80th percentile for weight loss, it means you have lost more weight than 80% of the people in that group. Again, determining this requires specific data on weight loss across a population.
-
QuestionA normal distribution has a mean of 60 and a standard deviation of 17. Between what values would you expect to find 95% of the data?David ReynoldsCommunity AnswerIn a normal distribution with a mean of 60 and a standard deviation of 17 you would expect to find approximately 95% of the data between the values of 26.68 and 93.32.
-
QuestionHow do you find the mean using the empirical rule?David ReynoldsCommunity AnswerThe empirical rule (or the 68-95-99.7 rule) is not used for finding the mean. It's used when the mean and standard deviation of a normally distributed dataset are known. It states that about 68% of values are within one standard deviation of the mean, 95% within two, and 99.7% within three. To find the mean, you typically sum all the values in a dataset and divide by the number of values.
Ask a Question
200 characters left
Include your email address to get a message when this question is answered.
Submit
Advertisement
Tips
Submit a Tip
All tip submissions are carefully reviewed before being published
Name
Please provide your name and last initial
Thanks for submitting a tip for review!
References
- ↑ https://online.stat.psu.edu/stat200/lesson/2/2.2/2.2.7
- ↑ https://www.khanacademy.org/math/probability/xa88397b6:analyze-quantitative/normal-distributions-a2ii/v/ck12-org-normal-distribution-problems-empirical-rule
- ↑ https://www.omnicalculator.com/statistics/empirical-rule
- ↑ https://www.youtube.com/watch?v=T7-eeg6rhjY
- ↑ https://stats.libretexts.org/Bookshelves/Introductory_Statistics/Book%3A_Introductory_Statistics_(Shafer_and_Zhang)/02%3A_Descriptive_Statistics/2.05%3A_The_Empirical_Rule_and_Chebyshev's_Theorem
- ↑ https://www.khanacademy.org/math/probability/xa88397b6:analyze-quantitative/normal-distributions-a2ii/v/ck12-org-normal-distribution-problems-empirical-rule
- ↑ https://stats.libretexts.org/Bookshelves/Introductory_Statistics/Book%3A_Introductory_Statistics_(Shafer_and_Zhang)/02%3A_Descriptive_Statistics/2.05%3A_The_Empirical_Rule_and_Chebyshev's_Theorem
About This Article
Thanks to all authors for creating a page that has been read 82,553 times.
Reader Success Stories
- "I was struggling the whole period, but I found this and was done with my assignment super fast."
Advertisement