Download Article
Download Article
Spearman's rank correlation coefficient allows you to identify whether two variables relate in a monotonic function (i.e., that when one number increases, so does the other, or vice versa). To calculate Spearman's rank correlation coefficient, you'll need to rank and compare data sets to find Σd 2 , then plug that value into the standard or simplified version of Spearman's rank correlation coefficient formula. You can also calculate this coefficient using Excel formulas or R commands.
Things You Should Know
- Calculate the coefficient by finding Σd 2 , then plugging that value into the Spearman's rank correlation coefficient formula.
- You can also easily calculate this coefficient using Excel.
- Alternatively, you can find this coefficient using R commands.
Steps
Calculation Help
-
Draw your data table. This will organize the information you need to calculate Spearman's Rank Correlation Coefficient. You will need: [1] X Research source
- 6 Columns, with headers as shown below.
- As many rows as you have pairs of data.
-
Fill in the first two columns with your pairs of data.Advertisement
-
In your third column rank the data in your first column from 1 to n (the number of data you have). Give the lowest number a rank of 1, the next lowest number a rank of 2, and so on.
-
In your fourth column do the same as in step 3, but instead rank the second column.
-
If two (or more) pieces of data in one column are the same, find the mean of the ranks as if those pieces of data had been ranked normally, then rank the data with this mean.
In the example at right, there are two 5s that would otherwise have ranks of 2 and 3. Since there are two 5s, take the mean of their ranks. The mean of 2 and 3 is 2.5, so assign the rank 2.5 to both 5s.
-
If two (or more) pieces of data in one column are the same, find the mean of the ranks as if those pieces of data had been ranked normally, then rank the data with this mean.
-
In the "d" column calculate the difference between the two numbers in each pair of ranks. That is, if one is ranked 1 and the other 3 the difference would be 2. (The sign doesn't matter, since the next step is to square this number.) [2] X Research source
-
Square each of the numbers in the "d" column and write these values in the "d 2 " column.
-
Choose one of these formulae:
- If there was no tie in previous steps, insert this value into the simplified Spearman's Rank Correlation Coefficient formula
and replace the "n" with the number of pairs of data you have to calculate the answer. [3] X Research source
- If there were ties in any of previous steps, use the standard Spearman's Rank Correlation Coefficient formula instead:
- If there was no tie in previous steps, insert this value into the simplified Spearman's Rank Correlation Coefficient formula
-
9Interpret your result. It can vary between -1 and 1.
- Close to -1 - Negative correlation.
- Close to 0 - No linear correlation.
- Close to 1 - Positive correlation.
Advertisement
-
1Create new columns with the ranks of your existing columns. For example, if your data is in Column A2:A11, you want to use the formula "=RANK(A2,A$2:A$11)", and copy it down and across for all your rows and columns. [4] X Research source
-
2Break ties as described in step 3, 4 method 1.
-
3In a new cell, do a correlation between the two rank columns with something like "=CORREL(C2:C11,D2:D11)". In this case, C and D would correspond to the rank columns. The correlation cell will have your Spearman's Rank Correlation.
Advertisement
-
1
-
2Save your data as a CSV file with the data you want to correlate in the first two columns. You can typically do this through the "Save as" menu.
-
3Open the R editor. If you are on the terminal, simply run R. From the desktop, you want to click on the R logo.
-
4Type the commands:
- d <- read.csv("NAME_OF_YOUR_CSV.csv") and hit enter
- cor(rank(d[,1]),rank(d[,2])) [5] X Research source
Advertisement
Community Q&A
Search
-
QuestionHow do I find coefficients?Community AnswerThe coefficient is next to the variable. With the example, "4pq," the coefficient is 4.
-
QuestionIs there a correlation between data 1 and data 2?Community AnswerNo, there should not be a correlation.
Ask a Question
200 characters left
Include your email address to get a message when this question is answered.
Submit
Advertisement
Tips
- Most data sets should contain at least 5 pairs of data in order to identify a trend (3 were used for the example to make it easier to demonstrate).Thanks
Submit a Tip
All tip submissions are carefully reviewed before being published
Name
Please provide your name and last initial
Thanks for submitting a tip for review!
Advertisement
Warnings
- Spearman's rank correlation coefficient will only identify the strength of correlation where the data is consistently increasing or decreasing. If a scatter graph of the data any other trend Spearman's rank will not give an accurate representation of its correlation.Thanks
- This formula is based on the assumption that there are no ties. When there are ties such as in the example one should use the definition: the product moment correlation coefficient based on the ranks.Thanks
Advertisement
References
- ↑ https://www.simplilearn.com/tutorials/statistics-tutorial/spearmans-rank-correlation
- ↑ https://www.simplilearn.com/tutorials/statistics-tutorial/spearmans-rank-correlation
- ↑ https://geographyfieldwork.com/SpearmansRank.htm
- ↑ http://www.real-statistics.com/correlation/spearmans-rank-correlation/
- ↑ http://www.sthda.com/english/wiki/correlation-test-between-two-variables-in-r#compute-correlation-in-r
About This Article
Thanks to all authors for creating a page that has been read 851,710 times.
Advertisement