The type-safe method that ML developers need to implement when writing new Estimators.
The type-safe method that ML developers need to implement when writing new Estimators.
The estimator's training data.
A new transformer
This method merges two seqs and keeps the top numFeatures
The number of features to keep
Returns the first K elements from the input as defined by the specified implicit Ordering[T] and maintains the ordering.
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
The training data
A pipeline that fits this estimator and applies the result to inputs.
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
Constructs a pipeline that fits this estimator to training data, then applies the resultant transformer to the Pipeline input.
The training data
A pipeline that fits this estimator and applies the result to inputs.
An Estimator that chooses the most frequently observed sparse features when training, and produces a transformer which builds a sparse vector out of them
Deterministically orders the feature mappings first by decreasing number of appearances, then by earliest appearance in the RDD
The number of features to keep