Latest revision as of 03:49, 23 April 2025

Introduction

This is all about R².

My Thoughts

Least Squares Review

Most of this requires you to think about a dataset with lots of points. What we are trying to do is with least squares is find the best fit for a line for our data points. Once we have this we could maybe predict for a new data point what the y-value might be given the x-value. Here is the formula
$S = \sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})^{2}$
And here is an example of usage

R Squared

With $R^{2}$ we are looking at the variances (changes) using the mean and the line. Squaring means we don't care about negative or positive.

What is the difference

Well I guess R² = R squared. R² is the variance between a dependent variable and an independent variable in terms of percentage. Therefore 0.4 R² = 40% and R = 0.2. I guess I agree that using R² does provide an easier way to understand what you mean however there is no sign on R².

Formula for R²

This is given by

A reminder of how we calculate variance, we add up the differences from the mean like below. Note this shows a population and we should divide by n-1 not n but I liked the graphic.

This was a nice picture

F

Spent a lot of time looking a this but got nowhere. It is a way to get a p-value. For F the following formula we given.

The Pfit and Pmean explanations were provided. Pfit was the number of parameters. E.g. for mouse we could look at weight, size and tail. And the Pmean was the number of means, which is normally 1. The n is the number not explained by the fit. This took a while but I believe this to be the residuals Observed (y) - Predicted (y_hat).

@@ Line 24: / Line 24: @@
 [[File:Variance cal.png|400px]]
 =F=
-So we know this is formula for <math>R^2</math><br>
+Spent a lot of time looking a this but got nowhere. It is a way to get a p-value. For F the following formula we given.<br>
-<math>
+[[File:F formula.png| 200px]]<br>
-R^2 = 1 - \frac{\text{SS}_{\text{res}}}{\text{SS}_{\text{tot}}}
+The Pfit and Pmean explanations were provided. Pfit was the number of parameters. E.g. for mouse we could look at weight, size and tail. And the Pmean was the number of means, which is normally 1. The n is the number not explained by the fit. This took a while but I believe this to be the residuals Observed (y) - Predicted (y_hat).
-</math>
+[[File:R2 Summary.png]]
-<br>
-Where
-*<math>{\text{SS}_{\text{res}}}</math> is the sum of squared residuals, which measures the variability of the observed data around the predicted values.
-*<math>{\text{SS}_{\text{tot}}}</math> is the total sum of squares, which measures the variability of the observed data around the mean.
-So now we move on to F
-<br>
-<math>
-F = \frac{\frac{\sum (pfit_i - pmean)^2}{\text{df}_{\text{model}}}}{\frac{\sum (y_i - pfit_i)^2}{\text{df}_{\text{residual}}}}
-</math>
-<br>

R Squared: Difference between revisions

Latest revision as of 03:49, 23 April 2025

Contents

Introduction

My Thoughts

Least Squares Review

R Squared

What is the difference

Formula for R²

F

Navigation menu

R Squared: Difference between revisions

Latest revision as of 03:49, 23 April 2025

Introduction

My Thoughts

Least Squares Review

R Squared

What is the difference

Formula for R²

F

Navigation menu

Search