R Squared: Difference between revisions

From bibbleWiki
Jump to navigation Jump to search
Line 24: Line 24:
[[File:Variance cal.png|400px]]
[[File:Variance cal.png|400px]]
=F=
=F=
So we know this is formula for <math>R^2</math><br>
Spent a lot of time looking a this but got nowhere. It seems to be a way to get a p-value which you can use to see if the data is valuable
<math>
R^2 = \frac{\text{SS}_{\text{model}}}{\text{SS}_{\text{tot}}}
</math>
<br>
Where
*<math>{\text{SS}_{\text{res}}}</math> is the sum of squared residuals, which measures the variability of the observed data around the predicted values.
*<math>{\text{SS}_{\text{tot}}}</math> is the total sum of squares, which measures the variability of the observed data around the mean.
So now we move on to F
<br>
<math>
F = SS_{\text{mean}} - \frac{\frac{SS_{\text{fit}}}{Pfit - Pmean}}{\frac{SS_{\text{fit}}}{n - Pfit}}
</math>
<br>
*𝑆𝑆 mean: Represents the sum of squares around the mean.
*𝑆𝑆 fit: Represents the sum of squares explained by the fit or model.
*𝑃𝑓𝑖𝑡−𝑃𝑚𝑒𝑎𝑛: Captures the difference between individual predicted values (𝑃𝑓𝑖𝑡) and their mean (𝑃𝑚𝑒𝑎𝑛).
*𝑛−𝑃𝑓𝑖𝑡: Adjusts for the total observations relative to predicted fit values.

Revision as of 02:57, 23 April 2025

Introduction

This is all about R².

My Thoughts

Least Squares Review

Most of this requires you to think about a dataset with lots of points. What we are trying to do is with least squares is find the best fit for a line for our data points. Once we have this we could maybe predict for a new data point what the y-value might be given the x-value. Here is the formula
S=i=1n(yiy^i)2
And here is an example of usage

R Squared

With R2 we are looking at the variances (changes) using the mean and the line. Squaring means we don't care about negative or positive.

What is the difference

Well I guess R² = R squared. R² is the variance between a dependent variable and an independent variable in terms of percentage. Therefore 0.4 R² = 40% and R = 0.2. I guess I agree that using R² does provide an easier way to understand what you mean however there is no sign on R².

Formula for R²

This is given by

A reminder of how we calculate variance, we add up the differences from the mean like below. Note this shows a population and we should divide by n-1 not n but I liked the graphic.

This was a nice picture

F

Spent a lot of time looking a this but got nowhere. It seems to be a way to get a p-value which you can use to see if the data is valuable