public class StringIndexerModel extends Model<StringIndexerModel> implements MLWritable
StringIndexer.
NOTE: During transformation, if the input column does not exist,
StringIndexerModel.transform would return the input dataset unmodified.
This is a temporary fix for the case when target labels do not exist during prediction.
param: labels Ordered list of labels, corresponding to indices to be assigned.
| Constructor and Description |
|---|
StringIndexerModel(String[] labels) |
StringIndexerModel(String uid,
String[] labels) |
| Modifier and Type | Method and Description |
|---|---|
StringIndexerModel |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
String |
getHandleInvalid() |
String |
getInputCol() |
String |
getOutputCol() |
Param<String> |
handleInvalid()
Param for how to handle invalid entries.
|
Param<String> |
inputCol()
Param for input column name.
|
String[] |
labels() |
static StringIndexerModel |
load(String path) |
Param<String> |
outputCol()
Param for output column name.
|
static MLReader<StringIndexerModel> |
read() |
StringIndexerModel |
setHandleInvalid(String value) |
StringIndexerModel |
setInputCol(String value) |
StringIndexerModel |
setOutputCol(String value) |
DataFrame |
transform(DataFrame dataset)
Transforms the input dataset.
|
StructType |
transformSchema(StructType schema)
:: DeveloperApi ::
|
String |
uid()
An immutable unique ID for the object and its derivatives.
|
StructType |
validateAndTransformSchema(StructType schema)
Validates and transforms the input schema.
|
org.apache.spark.ml.feature.StringIndexerModel.StringIndexModelWriter |
write()
Returns an
MLWriter instance for this ML instance. |
transform, transform, transformequals, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn, validateParamstoStringsaveinitializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningpublic StringIndexerModel(String uid,
String[] labels)
public StringIndexerModel(String[] labels)
public static MLReader<StringIndexerModel> read()
public static StringIndexerModel load(String path)
public String uid()
Identifiableuid in interface Identifiablepublic String[] labels()
public StringIndexerModel setHandleInvalid(String value)
public StringIndexerModel setInputCol(String value)
public StringIndexerModel setOutputCol(String value)
public DataFrame transform(DataFrame dataset)
Transformertransform in class Transformerdataset - (undocumented)public StructType transformSchema(StructType schema)
PipelineStageDerives the output schema from the input schema.
transformSchema in class PipelineStageschema - (undocumented)public StringIndexerModel copy(ParamMap extra)
Paramscopy in interface Paramscopy in class Model<StringIndexerModel>extra - (undocumented)defaultCopy()public org.apache.spark.ml.feature.StringIndexerModel.StringIndexModelWriter write()
MLWritableMLWriter instance for this ML instance.write in interface MLWritablepublic StructType validateAndTransformSchema(StructType schema)
public Param<String> inputCol()
public String getInputCol()
public Param<String> outputCol()
public String getOutputCol()
public Param<String> handleInvalid()
public String getHandleInvalid()