PDF download Download Article PDF download Download Article

The standard error of estimate measures the accuracy of the predictions made by a regression model. In other words, it determines how well the regression line describes the values of a data set. If you have a collection of data from an experiment, survey, or other source, follow along with us below to learn how to calculate your data set’s standard error of estimate.

Part 1
Part 1 of 2:

Tabulating Your Data

PDF download Download Article
  1. Any statistical work is generally made easier by having your data in a concise format. A simple table serves this purpose very well. To calculate the standard error of estimate, you will be using five different measurements or calculations. Therefore, creating a five-column table is helpful. Label the five columns as follows: [1]
    • Note that the table shown in the image above performs the opposite subtractions, . The more standard order, however, is . Because the values in the final column are squared, the negative is not problematic and will not change the outcome. Nevertheless, you should recognize that the more standard calculation is .
  2. After collecting your data, you will have pairs of data values. For these statistical calculations, the independent variable is labeled and the dependent, or resulting, variable is . Enter these values into the first two columns of your data table. [2]
    • The order of the data and the pairing is important for these calculations. You need to be careful to keep your paired data points together in order.
    • For the sample calculations shown above, the data pairs are as follows:
      • (1,2)
      • (2,4)
      • (3,5)
      • (4,4)
      • (5,5)
    Advertisement
  3. Using your data results, you will be able to calculate a regression line. This is also called a line of best fit or the least squares line. The calculation is tedious but can be done by hand. Alternatively, you can use a handheld graphing calculator or some online programs that will quickly calculate a best fit line using your data. [3]
    • For this article, it is assumed that you will have the regression line equation available or that it has been predicted by some prior means.
    • For the sample data set in the image above, the regression line is .
  4. Using the equation of that line, you can calculate predicted y-values for each x-value in your study, or for other theoretical x-values that you did not measure. [4]
    • Using the equation of the regression line, calculate or “predict” values of for each value of x. Insert the x-value into the equation, and find the result for as follows:
  5. Advertisement
Part 2
Part 2 of 2:

Performing the Calculations

PDF download Download Article
  1. In the fourth column of your data table, you will calculate and record the error of each predicted value. Specifically, subtract the predicted value ( ) from the actual observed value ( ). [5]
    • For the data in the sample set, these calculations are as follows:
  2. Take each value in the fourth column and square it by multiplying it by itself. Fill in these results in the final column of your data table.
    • For the sample data set, these calculations are as follows:
  3. The statistical value known as the sum of squared errors (SSE) is a useful step in finding standard deviation, variance and other measurements. To find the SSE from your data table, add the values in the fifth column of your data table. [6]
    • For this sample data set, this calculation is as follows:
  4. The Standard Error of the Estimate is the square root of the average of the SSE. It is generally represented with the Greek letter . Therefore, the first calculation is to divide the SSE score by the number of measured data points. Then, find the square root of that result. [7]
    • If the measured data represents an entire population, then you will find the average by dividing by N, the number of data points. However, if you are working with a smaller sample set of the population, then substitute N-2 in the denominator.
    • For the sample data set in this article, we can assume that it is a sample set and not a population, just because there are only 5 data values. Therefore, calculate the Standard Error of the Estimate as follows:
  5. The Standard Error of the Estimate is a statistical figure that tells you how well your measured data relates to a theoretical straight line, the line of regression. A score of 0 would mean a perfect match, that every measured data point fell directly on the line. Widely scattered data will have a much higher score. [8]
    • With this small sample set, the standard error score of 0.894 is quite low and represents well organized data results.
  6. Advertisement

Expert Q&A

Ask a Question
      Advertisement

      Video

      Tips

      Submit a Tip
      All tip submissions are carefully reviewed before being published
      Name
      Please provide your name and last initial
      Thanks for submitting a tip for review!

      About This Article

      Article Summary X

      To calculate the standard error of estimate, create a five-column data table. In the first two columns, enter the values for your measured data, and enter the values from the regression line in the third column. In the fourth column, calculate the predicted values from the regression line using the equation from that line. These are the errors. Fill in the fifth column by multiplying each error by itself. Add together all of the values in column 5, then take the square root of that number to get the standard error of estimate. To learn how to organize the data pairs, keep reading!

      Did this summary help you?
      Thanks to all authors for creating a page that has been read 217,164 times.

      Did this article help you?

      Advertisement