Fit regression model on
βs
using least squares (PLS)
Hope that lower dimensions means less
overfitting
PCA
-
First
PC
Projection
onto
first
PC
Drawing
points
in
PC
space
What
will
you
learn
from
this
lecture?
Why do you want to use the PCs in regression models?
What
assumptions
do
you
have
to
make
for
it
to
be
a
good
idea
to
use
principal
component
regression
(PCR)?
Or
conversely,
what
is
a
typical
bad
scenario
to
use
PCR?
How do you implement PCR in Python?
How do you interpret the model coefficients when using PCR?
How do you choose the number of PC to use in PCR?
Given
a
figure
of,
e.g.,
cross-validation
score
as
a
function
of
the
number
of
PCs,
you
should
be
able
to
choose
appropriately
and
provide
rationales
in
terms
of
bias-variance
tradeoff.
You
should
be
able
to
generate
such
figures
in
Python.
What is the relationship/differences between PCR and feature selection and regularization
methods that you learned in this part of the course?