Question 1

What exactly does R² measure?

Accepted Answer

R² (the coefficient of determination) measures the proportion of total variance in the dependent variable (y) that is explained by the fitted model. An R² of 0.85 means the model accounts for 85% of the variability in y; the remaining 15% is unexplained — attributable to noise, measurement error, or missing variables. Technically, R² = 1 minus the ratio of residual sum of squares to total sum of squares. A perfect fit gives R² = 1.0; a model no better than a horizontal line through the mean gives R² = 0.

Question 2

What is a residual and why does its distribution matter?

Accepted Answer

A residual is the vertical distance between a measured data point and the corresponding point on the fitted curve. If the model is correct, residuals should be small and randomly scattered around zero with no discernible pattern. A systematic pattern — a curve, a slope, or a wave — in the residual plot is strong evidence that the chosen model type is wrong for the data. Checking residuals is often more informative than checking R² alone, because R² can be high even when the model is structurally wrong.

Question 3

Why does adding more data points typically improve the fit quality?

Accepted Answer

More data points give the fitting algorithm more information to work with, reducing the influence of any single outlier and producing more stable, reliable estimates of the model parameters. With only 5 points, one misplaced measurement can dramatically shift the fitted line. With 40 points, the fit averages over many measurements and converges closer to the true underlying relationship. More data points typically improve estimate stability and reduce outlier influence, not necessarily R².

Question 4

Can I use a linear fit for data that looks slightly curved?

Accepted Answer

You can, but you should check the residual plot carefully. If the residuals show a systematic U-shape or arch pattern, a linear model is inadequate — a quadratic or higher-degree polynomial would be more appropriate. However, if the curvature is slight and your theoretical model predicts a linear relationship, consider whether the apparent curve could be due to noise rather than a real nonlinear effect. Physical reasoning about the expected relationship should guide model selection, not just R².

Question 5

How is sinusoidal curve fitting different from polynomial fitting?

Accepted Answer

Polynomial fits (linear, quadratic, cubic) are suited for data that increases, decreases, or has a single arch shape over the measured range. Sinusoidal fits are appropriate when the data oscillates — repeating peaks and troughs — such as temperature over a day, tidal heights, or AC voltage. The sinusoidal model fits amplitude, frequency, and phase. It would be inappropriate to apply a sinusoidal fit to data that simply increases monotonically, and vice versa — model choice should always reflect the physical process generating the data.

Curve Fitting

Unlock Curve Fitting

What is Curve Fitting?

Parameters explained

Common misconceptions

How teachers use this lab

Frequently asked questions