Dimensions to reduce input dataset to.
The type-safe method that ML developers need to implement when writing new Estimators.
The type-safe method that ML developers need to implement when writing new Estimators.
The estimator's training data.
A new transformer
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
The training data
A pipeline that fits this estimator and applies the result to inputs.
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
The training data
A pipeline that fits this estimator and applies the result to inputs.
Estimates a PCA model for dimensionality reduction based on a sample of a larger input dataset. Treats each column of the input matrices like a separate DenseVector input to PCAEstimator or DistributedPCAEstimator.
Automatically decides between distributed and local implementations when node-level optimization is enabled. The default weights were determined empirically via results run on a 16 r3.4xlarge node cluster.
Dimensions to reduce input dataset to.