Assignment 5¶
Key Takeaways
-
Problem 1(b): K-means clustering (and any algorithm that utilizes means) are sensitive to observations that are outliers. A fix to this is to instead compute the geometric median.
-
Problem 2(a-d): PC directions can also be interpreted as directions where the variance of the data is maximized.