The article is silly because PCA cannot select features. It is all about dimensi...

dkersten · on April 1, 2022

So would this be a more-or-less correct tl;dr:

Dimensionality reduction is a compressing data in a way that retains the most important information for the task

Feature selection is removing unimportant information (keeping/collecting, or selecting, only the important parts)

Both cut down on the amount of data you end up with, but one does it by finding a representation that is smaller, the other does it by discarding unnecessary data (or, rather, telling you which data is necessary, so you can stop collecting the unnecessary data).

chombier · on April 3, 2022

> so you can stop collecting the unnecessary data

I think that's the key, thanks.

But still, if some inputs are redundant shouldn't this be somehow apparent in the eigen-vectors/values of the covariance matrix (making PCA an indirect feature selection algorithm)?

naijaboiler · on April 1, 2022

So they are functionally doing the same thing, reducing the amount of data used. I find the debate about what we should call it useless and pointless.

Understand what it is, what it doing, what it s limitations are, and use it appropriately based on your needs. Done.

bllguo · on April 1, 2022

indeed, predictable and disappointing how the discussion devolved into pedantry. should have been obvious what the author meant (plus it is clarified at the very beginning of the article). I'm not sure if this is a ML practitioner vs. statisticians thing or what

platz · on April 1, 2022

if you read the actual article, dimensionality reduction by PCA does not retain the most important information about the data

hervature · on April 1, 2022

Yes, I would say this is entirely correct.

mrow84 · on April 1, 2022

One method is to drop variables with small coefficients in the top n principal components. This is feature selection, by your definition.