MARK C. answered 11/07/25
Dual Master's Degrees · 20+ Years Experience · Math · Computer Science
Multicollinearity causes redundant features to dominate distance calculations. If correlated features are weighted equally, they effectively get counted multiple times, distorting the true similarity between data points.
Result: Clusters group primarily by these correlated physical characteristics rather than the illness symptoms you actually care about. The redundant features drown out the important but less-correlated symptom patterns.