
What Is Correlation Formula Types and How to Calculate
The concept of correlation plays a key role in mathematics and statistics. It helps us measure and understand how two variables or sets of numbers are related. This is essential in subjects like Probability, Statistics, and even real-life scenarios such as marks and attendance, weather patterns, and more.
What Is Correlation?
Correlation in Maths is a statistical measurement that describes how two variables change together. In simple words, it shows if and how strongly pairs of numbers are related. For example, if you study more hours and your marks increase, these two variables have a positive correlation. You’ll find this concept applied in areas such as statistics, data analysis, and probability.
Key Formula for Correlation
Here’s the standard formula for the Pearson correlation coefficient:
\( r = \frac{n\sum xy - (\sum x)(\sum y)}{\sqrt{[n\sum x^2 - (\sum x)^2][n\sum y^2 - (\sum y)^2]}} \)
Where:
n = Number of observations
x, y = Values of the two variables
Σ = Sigma, means sum of all values
Types of Correlation
| Type | Description | Example |
|---|---|---|
| Positive Correlation | Both variables increase or decrease together | Study time & exam marks |
| Negative Correlation | One variable increases, the other decreases | Temperature & heater usage |
| Zero Correlation | No linear relationship | Shoe size & intelligence |
Interpreting the Correlation Coefficient (r)
The correlation coefficient (r) tells you the strength and direction of a linear relationship:
| r Value | Relationship |
|---|---|
| +1 | Perfect positive correlation |
| 0 | No correlation |
| -1 | Perfect negative correlation |
Values closer to +1 or -1 mean stronger relationships. The sign shows the direction (positive or negative).
Methods of Calculating Correlation
| Method | Where Used |
|---|---|
| Pearson’s Correlation | For linear relationships with interval/ratio data |
| Spearman’s Rank Correlation | For ranked or ordinal data |
| Kendall’s Tau | For small sample sizes or tied ranks |
Step-by-Step Illustration
- Suppose you have two lists: X (hours studied) = 2, 4, 6 and Y (marks) = 50, 65, 80
- Find Σx = 2 + 4 + 6 = 12, Σy = 50 + 65 + 80 = 195
- Find Σxy = (2×50) + (4×65) + (6×80) = 100 + 260 + 480 = 840
- Find Σx² = (2²) + (4²) + (6²) = 4 + 16 + 36 = 56
- Find Σy² = (50²) + (65²) + (80²) = 2500 + 4225 + 6400 = 13125
- Substitute into the Pearson formula:
\( r = \frac{3(840) - (12)(195)}{\sqrt{[3(56)-(12)^2][3(13125)-(195)^2]}} \)
Continue solving the numerator and denominator for the final answer.
Frequent Errors and Misunderstandings
- Confusing correlation (relationship) with causation (cause and effect).
- Forgetting to use correct formulas when sample size is small or using ranks.
- Reading "0 correlation" to mean negative correlation — it actually means no relationship.
Speed Trick or Vedic Shortcut
When checking direction quickly, just scan the pattern: if both X and Y tend to rise together, the correlation is likely positive. If one rises while the other falls, it’s negative. A quick glance at a scatter plot saves time in exams.
Try These Yourself
- Given X = 1, 2, 3 and Y = 2, 4, 6, find the correlation coefficient.
- Describe if shoe size and intelligence will be positively, negatively, or not correlated.
- Draw a scatter plot for X: 3, 4, 5 and Y: 9, 8, 7. Is the correlation positive or negative?
- Explain how you would distinguish between correlation and regression.
Relation to Other Concepts
The idea of correlation connects closely with topics such as Correlation Coefficient, Regression Analysis, and Probability. Understanding correlation is key for data analysis, exams, and research.
Classroom Tip
A simple way to remember: If two lines on a scatter plot go up together, correlation is positive. If one goes up and the other down, it’s negative. A scattered, cloud-like plot means no correlation. Vedantu’s teachers break down such visual rules for faster learning in live sessions.
Wrapping It All Up
We explored correlation—from definition, formulas, types, interpretation of the coefficient, worked steps, and its connection to concepts like Variance and Standard Deviation. Practice more with Vedantu and become confident in reading relationships between numbers. Correlation is your basic tool for statistics, science, and exam success!
FAQs on Correlation in Statistics Explained Clearly
1. What is correlation in statistics?
Correlation is a statistical measure that describes the strength and direction of the relationship between two variables. In statistics, correlation shows how one variable changes in relation to another.
- If both variables increase together, it is positive correlation.
- If one increases while the other decreases, it is negative correlation.
- If there is no clear pattern, it is zero correlation.
2. What is the formula for the correlation coefficient?
The formula for the Pearson correlation coefficient is r = Cov(X,Y) / (σX σY). This measures the linear relationship between two variables.
- Cov(X,Y) = covariance between X and Y
- σX = standard deviation of X
- σY = standard deviation of Y
3. What does the correlation coefficient tell you?
The correlation coefficient tells you the strength and direction of a linear relationship between two variables. The value of r ranges from -1 to +1.
- r = +1: perfect positive linear correlation
- r = -1: perfect negative linear correlation
- r = 0: no linear correlation
4. How do you calculate correlation step by step?
You calculate correlation by finding covariance and dividing it by the product of standard deviations. The steps to compute the Pearson correlation coefficient are:
- Find the mean of X and Y.
- Subtract the mean from each value to get deviations.
- Multiply corresponding deviations and sum them to get covariance.
- Divide covariance by (standard deviation of X × standard deviation of Y).
5. What is the difference between positive and negative correlation?
Positive correlation means both variables move in the same direction, while negative correlation means they move in opposite directions.
- Positive correlation (r > 0): As X increases, Y increases.
- Negative correlation (r < 0): As X increases, Y decreases.
6. What is zero correlation?
Zero correlation means there is no linear relationship between two variables. When the correlation coefficient r = 0, changes in one variable do not show a consistent linear pattern with the other.
- It does not mean the variables are independent.
- There may still be a nonlinear relationship.
7. Can you give an example of correlation with numbers?
Yes, if X = {1, 2, 3} and Y = {2, 4, 6}, the correlation coefficient is r = +1. This is because Y increases exactly twice as fast as X.
- As X increases, Y increases proportionally.
- The points lie perfectly on a straight line.
8. What is the difference between correlation and causation?
Correlation measures association, while causation means one variable directly causes a change in another. Correlation only shows that two variables move together.
- High correlation does not prove cause-and-effect.
- There may be a third variable influencing both.
9. What are the types of correlation?
The main types of correlation are positive, negative, and zero correlation. In statistics, correlation can also be classified by form.
- Positive correlation
- Negative correlation
- Zero correlation
- Linear correlation
- Nonlinear (curvilinear) correlation
10. What are the limitations of correlation?
The main limitation of correlation is that it does not prove causation and only measures linear relationships. Important limitations of correlation analysis include:
- It cannot establish cause-and-effect.
- It only measures linear association.
- It is sensitive to outliers.
- A high correlation may be misleading due to hidden variables.

































