PDF download Download Article PDF download Download Article

The sum of squared errors, or SSE, is a preliminary statistical calculation that leads to other data values. When you have a set of data values, it is useful to be able to find how closely related those values are. You need to get your data organized in a table, and then perform some fairly simple calculations. Once you find the SSE for a data set, you can then go on to find the variance and standard deviation.

Method 1
Method 1 of 3:

Calculating SSE by Hand

PDF download Download Article
  1. The clearest way to calculate the sum of squared errors is begin with a three column table. Label the three columns as , , and . [1]
  2. The first column will hold the values of your measurements. Fill in the column with the values of your measurements. These may be the results of some experiment, a statistical study, or just data provided for a math problem. [2]
    • In this case, suppose you are working with some medical data and you have a list of the body temperatures of ten patients. The normal body temperature expected is 98.6 degrees. The temperatures of ten patients are measured and give the values 99.0, 98.6, 98.5, 101.1, 98.3, 98.6, 97.9, 98.4, 99.2, and 99.1. Write these values in the first column.
    Advertisement
  3. Before you can calculate the error for each measurement, you must calculate the mean of the full data set. [3]
    • Recall that the mean of any data set is the sum of the values, divided by the number of values in the set. This can be represented symbolically, with the variable representing the mean, as:
    • For this data, the mean is calculated as:
  4. In the second column of your table, you need to fill in the error measurements for each data value. The error is the difference between the measurement and the mean. [4]
    • For the given data set, subtract the mean, 98.87, from each measured value, and fill in the second column with the results. These ten calculations are as follows:
  5. In the third column of the table, find the square of each of the resulting values in the middle column. These represent the squares of the deviation from the mean for each measured value of data. [5]
    • For each value in the middle column, use your calculator and find the square. Record the results in the third column, as follows:
  6. The final step is to find the sum of the values in the third column. The desired result is the SSE, or the sum of squared errors. [6]
    • For this data set, the SSE is calculated by adding together the ten values in the third column:
  7. Advertisement
Method 2
Method 2 of 3:

Creating an Excel Spreadsheet to Calculate SSE

PDF download Download Article
  1. You will create a three column table in Excel, with the same three headings as above. [7]
    • In cell A1, type in the heading “Value.”
    • In cell B1, enter the heading “Deviation."
    • In cell C1, enter the heading “Deviation squared.”
  2. In the first column, you need to type in the values of your measurements. If the set is small, you can simply type them in by hand. If you have a large data set, you may need to copy and paste the data into the column. [8]
  3. Excel has a function that will calculate the mean for you. In some vacant cell underneath your data table (it really doesn’t matter what cell you choose), enter the following: [9]
    • =Average(A2:___)
    • Do not actually type a blank space. Fill in that blank with the cell name of your last data point. For example, if you have 100 points of data, you will use the function:
      • =Average(A2:A101)
      • This function includes data from A2 through A101 because the top row contains the headings of the columns.
    • When you press Enter or when you click away to any other cell on the table, the mean of your data values will automatically fill the cell that you just programmed.
  4. In the first empty cell in the “Deviation” column, you need to enter a function to calculate the difference between each data point and the mean. To do this, you need to use the cell name where the mean resides. Let’s assume for now that you used cell A104. [10]
    • The function for the error calculation, which you enter into cell B2, will be:
      • =A2-$A$104. The dollar signs are necessary to make sure that you lock in cell A104 for each calculation.
  5. In the third column, you can direct Excel to calculate the square that you need. [11]
    • In cell C2, enter the function
      • =B2^2
  6. After you have entered the functions in the top cell of each column, B2 and C2 respectively, you need to fill in the full table. You could retype the function in every line of the table, but this would take far too long. Use your mouse, highlight cells B2 and C2 together, and without letting go of the mouse button, drag down to the bottom cell of each column.
    • If we are assuming that you have 100 data points in your table, you will drag your mouse down to cells B101 and C101.
    • When you then release the mouse button, the formulas will be copied into all the cells of the table. The table should be automatically populated with the calculated values.
  7. Column C of your table contains all the square-error values. The final step is to have Excel calculate the sum of these values. [12]
    • In a cell below the table, probably C102 for this example, enter the function:
      • =Sum(C2:C101)
    • When you click Enter or click away into any other cell of the table, you should have the SSE value for your data.
  8. Advertisement
Method 3
Method 3 of 3:

Relating SSE to Other Statistical Data

PDF download Download Article
  1. Finding the SSE for a data set is generally a building block to finding other, more useful, values. The first of these is variance. The variance is a measurement that indicates how much the measured data varies from the mean. It is actually the average of the squared differences from the mean. [13]
    • Because the SSE is the sum of the squared errors, you can find the average (which is the variance), just by dividing by the number of values. However, if you are calculating the variance of a sample set, rather than a full population, you will divide by (n-1) instead of n. Thus:
      • Variance = SSE/n, if you are calculating the variance of a full population.
      • Variance = SSE/(n-1), if you are calculating the variance of a sample set of data.
    • For the sample problem of the patients’ temperatures, we can assume that 10 patients represent only a sample set. Therefore, the variance would be calculated as:
  2. The standard deviation is a commonly used value that indicates how much the values of any data set deviate from the mean. The standard deviation is the square root of the variance. Recall that the variance is the average of the square error measurements. [14]
    • Therefore, after you calculate the SSE, you can find the standard deviation as follows:
    • For the data sample of the temperature measurements, you can find the standard deviation as follows:
  3. This article has focused on data sets that measure only a single value at a time. However, in many studies, you may be comparing two separate values. You would want to know how those two values relate to each other, not only to the mean of the data set. This value is the covariance. [15]
    • The calculations for covariance are too involved to detail here, other than to note that you will use the SSE for each data type and then compare them. For a more detailed description of covariance and the calculations involved, see Calculate Covariance .
    • As an example of the use of covariance, you might want to compare the ages of the patients in a medical study to the effectiveness of a drug in lowering fever temperatures. Then you would have one data set of ages and a second data set of temperatures. You would find the SSE for each data set, and then from there find the variance, standard deviations and covariance.
  4. Advertisement

Community Q&A

Search
Add New Question
  • Question
    The Excel method did not give me the correct value. What should I do?
    Sidhu Mossewala
    Community Answer
    If an Excel method didn't provide the expected value, double-check the formula syntax, verify the data range, evaluate the formula step-by-step, check data types, review criteria, ensure software compatibility, consult Excel documentation, or seek expert help for troubleshooting and resolution. To get help with an Excel issue, call 412-567-0408 to get a live technician.
Ask a Question
      Advertisement

      Tips

      Submit a Tip
      All tip submissions are carefully reviewed before being published
      Name
      Please provide your name and last initial
      Thanks for submitting a tip for review!

      About This Article

      Article Summary X

      To calculate the sum of squares for error, start by finding the mean of the data set by adding all of the values together and dividing by the total number of values. Then, subtract the mean from each value to find the deviation for each value. Next, square the deviation for each value. Finally, add all of the squared deviations together to get the sum of squares for error. To learn how to calculate the sum of squares for error using Microsoft Excel, scroll down!

      Did this summary help you?
      Thanks to all authors for creating a page that has been read 552,768 times.

      Reader Success Stories

      • Christina Nicklin

        Jan 16, 2022

        "Really needed brushing up on statistics (if I ever really understood it) and this was a helpful, well laid out ..." more
      Share your story

      Did this article help you?

      Advertisement