PDF download Download Article

Easily create a multiple regression model in your Excel spreadsheet

PDF download Download Article

Trying to create a multiple regression model in Excel? It's pretty easy to do using the built-in data analysis tools. Multiple regression is a great way to examine how multiple independent variables explain the variation in a dependent variable. This wikiHow guide will show you how to run a multiple regression in Microsoft Excel on Windows or Mac.

Things You Should Know

  • You'll need to enable the Analysis ToolPak, a built-in add-in for Excel, before running a regression analysis.
  • Make sure your data is arranged in adjacent columns with the first row being the headers.
  • Click "Data Analysis" in the "Data" tab and select "Regression" to set up the analysis.
  1. Whether you're studying statistics or doing regression professionally, Excel is a great tool for running the analysis. Excel has a built-in data analysis add-in called "Analysis ToolPak." You can check to see if it's active by clicking the Data tab. If you don't see the Data Analysis option, you will need to enable it: [1]
    • Windows:
      • Open the File tab (or press Alt+F) and select Options (Windows).
      • Click Add-Ins on the left side of the window.
      • Select Excel Add-ins next to "manage" and click Go .
      • In the new window, check the box next to "Analysis ToolPak", then click OK . This will enable the built-in data analysis add-in.
    • Mac:
      • Click Tools and then Excel Add-ins .
      • Check the box next to Analysis ToolPak and click OK .
      • Note that you may need to click Browse to find the Analysis ToolPak .
      • If the Data Analysis tool doesn't appear in the Data tab, close and reopen Excel.
  2. Data must be arranged in immediately adjacent columns and labels should be in the first row of each column. This is a typical format for databases .
    Advertisement
  3. This option is in the "Analysis" section near the far right of Data tab options.
    • Another powerful tool is Excel's Solver function, an optimization model feature.
  4. This will open a new window for inputting the parameters of the regression model.
  5. To do so:
    • Click the "Input Y Range" field.
    • Highlight the column containing your dependent variable values.
    • Click the Labels checkbox if your data has a header row.
  6. To do so:
    • Click the "Input X Range" field.
    • Highlight the column or columns containing your dependent variable values. This can include multiple columns if you have more than one independent variable.
    • Note: The independent variable data columns must be adjacent to one another for the input to work.
  7. You can change the following parts of the analysis in the Regression window:
    • The default confidence level is 95%. If you want to change this value, click the box next to Confidence Level and modify the adjacent value.
    • Under "Output options," select where you want the regression results to output.
    • Select the desired options in the "Residuals" category. Graphical residual outputs are created by the Residual Plots and Line Fit Plots options.
  8. You'll see the following information:
    • Regression Statistics includes the correlation values, standard error, and number of observations.
    • ANOVA is a table displaying the degrees of freedom, sum squares, mean squares, F value, and F significance. You can use this table to assess the statistical significance of the model.
    • The confidence interval table shows the regression coefficient, standard error, significance, and confidence interval for each regression parameter (the intercept and the independent variable slopes).
  9. Advertisement

Expert Q&A

Search
Add New Question
  • Question
    what is output range
    Kyle Smith
    wikiHow Technology Writer
    Kyle Smith is a wikiHow Technology Writer, learning and sharing information about the latest technology. He has presented his research at multiple engineering conferences and is the writer and editor of hundreds of online electronics repair guides. Kyle received a BS in Industrial Engineering from Cal Poly, San Luis Obispo.
    wikiHow Technology Writer
    Expert Answer
    The output range in the regression window is where the results of the analysis will appear. This can be in an existing sheet, a new sheet, or an entirely new workbook file.
  • Question
    getting #NUM error p- values. What does it mean?
    Kyle Smith
    wikiHow Technology Writer
    Kyle Smith is a wikiHow Technology Writer, learning and sharing information about the latest technology. He has presented his research at multiple engineering conferences and is the writer and editor of hundreds of online electronics repair guides. Kyle received a BS in Industrial Engineering from Cal Poly, San Luis Obispo.
    wikiHow Technology Writer
    Expert Answer
    This could indicate that you don't have enough observations to compute the significance of the model or its parameters. Try adding additional observations if this is the case.
  • Question
    What is the formula of multiple regression used by Excel in the background?
    Kyle Smith
    wikiHow Technology Writer
    Kyle Smith is a wikiHow Technology Writer, learning and sharing information about the latest technology. He has presented his research at multiple engineering conferences and is the writer and editor of hundreds of online electronics repair guides. Kyle received a BS in Industrial Engineering from Cal Poly, San Luis Obispo.
    wikiHow Technology Writer
    Expert Answer
    Excel is using the linear regression formula y = m1x1 + m2x2 + ... + b. It calculates the values using the least squares method.
See more answers
Ask a Question
      Advertisement

      Video

      Tips

      Submit a Tip
      All tip submissions are carefully reviewed before being published
      Name
      Please provide your name and last initial
      Thanks for submitting a tip for review!

      About This Article

      Thanks to all authors for creating a page that has been read 868,652 times.

      Reader Success Stories

      • Lal Jayasinghe

        Jul 27, 2016

        "I knew it was possible to predict future values of a variable using multiple regression, but I had absolutely no ..." more
      Share your story

      Is this article up to date?

      Advertisement