Chains a label estimator onto the end of this pipeline, producing a new pipeline.
Chains a label estimator onto the end of this pipeline, producing a new pipeline. If this pipeline has already been executed, it will not need to be fit again.
The estimator to chain onto the end of this pipeline
The training data to use (the estimator will be fit on the result of passing this data through the current pipeline)
The labels to use when fitting the LabelEstimator. Must be zippable with the training data.
Chains a label estimator onto the end of this pipeline, producing a new pipeline.
Chains a label estimator onto the end of this pipeline, producing a new pipeline. If this pipeline has already been executed, it will not need to be fit again.
The estimator to chain onto the end of this pipeline
The training data to use (the estimator will be fit on the result of passing this data through the current pipeline)
The labels to use when fitting the LabelEstimator. Must be zippable with the training data.
Chains a label estimator onto the end of this pipeline, producing a new pipeline.
Chains a label estimator onto the end of this pipeline, producing a new pipeline. If this pipeline has already been executed, it will not need to be fit again.
The estimator to chain onto the end of this pipeline
The training data to use (the estimator will be fit on the result of passing this data through the current pipeline)
The labels to use when fitting the LabelEstimator. Must be zippable with the training data.
Chains a label estimator onto the end of this pipeline, producing a new pipeline.
Chains a label estimator onto the end of this pipeline, producing a new pipeline. If this pipeline has already been executed, it will not need to be fit again.
The estimator to chain onto the end of this pipeline
The training data to use (the estimator will be fit on the result of passing this data through the current pipeline)
The labels to use when fitting the LabelEstimator. Must be zippable with the training data.
Chains an estimator onto the end of this pipeline, producing a new pipeline.
Chains an estimator onto the end of this pipeline, producing a new pipeline. If this pipeline has already been executed, it will not need to be fit again.
The estimator to chain onto the end of this pipeline
The training data to use (the estimator will be fit on the result of passing this data through the current pipeline)
Chains an estimator onto the end of this pipeline, producing a new pipeline.
Chains an estimator onto the end of this pipeline, producing a new pipeline. If this pipeline has already been executed, it will not need to be fit again.
The estimator to chain onto the end of this pipeline
The training data to use (the estimator will be fit on the result of passing this data through the current pipeline)
Chains a pipeline onto the end of this one, producing a new pipeline.
Chains a pipeline onto the end of this one, producing a new pipeline. If either this pipeline or the following has already been executed, it will not need to be fit again.
the pipeline to chain
The application of this Transformer to a single input item.
The application of this Transformer to a single input item. This method MUST be overridden by ML developers.
The input item to pass into this transformer
The output value
The application of this Transformer to an RDD of input items.
The application of this Transformer to an RDD of input items. This method may optionally be overridden by ML developers.
The bulk RDD input to pass into this transformer
The bulk RDD output for the given input
the weighting scheme to apply to the frequencies (defaults to identity)
A method that converts this object into a Pipeline.
A method that converts this object into a Pipeline. Must be implemented by anything that extends Chainable.
Transformer that maps a Seq[Any] of objects to a Seq[(Any, Double)] of (unique object, weighting_scheme(tf)), where tf is the number of times the unique object appeared in the original Seq[Any], and the weighting_scheme is a lambda of Double => Double that defaults to the identity function.
As an example, the following would return a transformer that maps a Seq[Any] to all objects seen with the log of their count plus 1:
the weighting scheme to apply to the frequencies (defaults to identity)