Compare feature extraction, dimensionality reduction, and classifier combinations
Reduces dimensions while preserving variance (90% or 95%).
📚 DocsProbabilistic classifier based on Bayes' theorem. Fast and works well with text data.
📚 DocsLinear model for classification. Simple, interpretable, and often the best baseline.
📚 DocsInstance-based learning. Classifies based on similarity to training examples.
📚 DocsReady-to-run Python code with all 240 pipeline combinations. Includes dataset download, feature extraction, dimensionality reduction, classification, and evaluation.
Interactive notebook to explore and compare all 240 pipelines.
Top 3 by Accuracy, Training Speed, and Inference Speed