Question about PLS-DA hyperparameter tuning [R]
![Question about PLS-DA hyperparameter tuning [R]](/_next/image?url=https%3A%2F%2Fpreview.redd.it%2F701knkltbdzg1.png%3Fwidth%3D140%26height%3D86%26auto%3Dwebp%26s%3Dbe7986f3d4908ceaf4f9969b26a3eafd1012a3e5&w=3840&q=75)
| Hi all! I am a bioinformatician and I am working on learning some ML tools for some disease/biomarker stuff. I am working with sparse PLS-DA at the moment. Before actually tuning the model, I run on overall global model (without sparsity) to get an idea of what my data looks like and to get to a starting point. Here is what that global model ends up looking like: So from this, I'm seeing that I should include 2 latent components in my model tuning and I chose to use the centroids.dist. So I tune the model with two components, it gives me the # of features to keep on each component and then I run the final model. However, when I do performance assessment on the final model, it looks like this: I guess I am a little confused. From what I am reading online, and from my own data, error rates should go down with added components. It also doesn't make a ton of sense to me because I should have only picked the features that best distinguish two conditions, so again, I should be seeing error rates decrease. Can someone please help me understand what I'm seeing here and what could be causing this? I am still learning how all of this works, so amy sort of guidance is appreciated. Thank you! [link] [comments] |
Want to read more?
Check out the full article on the original site