Multivar linear regression should be mathematically undetermined (Octave) [closed]

https://stackoverflow.com/questions/18156583

24-06-2022
|

Question

I apologize in advance for the rather abstract nature of my question, but it is indirectly a question about programming algorithms, and I don't think I'll be the only programmer to wonder about this.

This is about the implementation of the multi-variable ordinary least squares (OLS) regression algorithm in Octave (and, I assume, in MatLab as well). As far as I can tell, if one inputs two variables into a linear regression with just one single measurement, the result (i.e. the coefficients) should be mathematically undetermined: unless you accept black magic as a valid premise, how could one possibly tell in which way each of the variables affects the final result? In the more general case, the number of measurements must (I think) be at least equal to the number of variables for the resulting coefficients to make any sense (let alone statistical errors and all that).

Octave, however, is all too happy to compute a result, with no warnings whatsoever:

octave:1> ols([1], [1, 1])
ans =

   0.50000
   0.50000

In other words -- if I got this right -- given the equation 1 = x + y, Octave joyfully concludes that x = y = 0.5.

As such, assuming (as I am) that Octave has no direct connection to Satan, here are my questions:

Am I misunderstanding the mathematical foundation? In other words, is this possibly a legitimate result?
If I'm right, why isn't Octave spitting an error -- or, at the very least, quite a stern warning regarding the totally moronic data I'm asking it to analyze?

Solution

take a look at this Octave documentation:

http://www.gnu.org/software/octave/doc/interpreter/Linear-Least-Squares.html

In the description of output beta, it says that the value will be the pseudo-inverse of x times y when the matrix is not of full-rank (as is your case for matrix [1, 1]. [0.5; 0.5] is the pseudo-inverse of [1, 1].

Hope that helps!

OTHER TIPS

Your system simply isn't full rank. According to the documentation ols solves such a system as

b = pinv(x)*y

or, in your case, simply

b = pinv([1 1])

ans = 

    0.5000
    0.5000

where pinv is the Moore–Penrose pseudoinverse.

Ordinary least square regression is expressed as:

Ax = y

which is usually directly solved using the pseudo-inverse:

x = inv(A'*A)*A'*y

x = pinv(A) * y

In the case of full rank matrix, we can perform Cholesky decomposition: R = chol(A'*A) so that (A'*A) = R'R. This can be used as:

Ax = y
A'Ax = A'y
R'Rx = A'y
Rx = R'\(A'y)
x = R\(R'\(A'y))

Note that in the last step, the backslash operator (mldivide) performs a simple forward-backward substitution using the triangular matrix R

In fact that's how Octave implements it: http://hg.octave.org/octave/file/tip/scripts/statistics/base/ols.m#l110

There are other ways to solve the system such as iterative methods.

Licensed under: CC-BY-SA with attribution

Not affiliated with StackOverflow