Skip to content

Assignment 5

Key Takeaways

  • Problem 1(b): K-means clustering (and any algorithm that utilizes means) are sensitive to observations that are outliers. A fix to this is to instead compute the geometric median.

  • Problem 2(a-d): PC directions can also be interpreted as directions where the variance of the data is maximized.