Correlation and Association
Correlation does not imply causation. Two variables can move together perfectly without one causing the other.
Correlation Coefficient
The Pearson correlation coefficient () measures the linear relationship between two variables, ranging from -1 to 1.
Rank Correlation
Spearman's Rank Correlation evaluates how well the relationship between two variables can be described using a monotonic function, without requiring it to be perfectly linear.
Homoscedasticity vs. Heteroscedasticity
- Homoscedasticity: The variance of errors is constant across all levels of the independent variable.
- Heteroscedasticity: The variance of errors changes (often forming a cone shape in a scatter plot), which violates many linear regression assumptions.
Interactive Correlation Explorer
Correlation scatter plot
Adjust r and see how the cloud tightens/loosens. Toggle heteroscedasticity for cone-shaped variance.
Test Your Knowledge
Example: Covariance and Pearson r
Two variables and have a Covariance of . The standard deviation of is and the standard deviation of is . Calculate the Pearson correlation coefficient .
View Step-by-Step Solution
The formula for the Pearson correlation coefficient is:
The correlation is 0.75, indicating a strong positive linear relationship between and .