public class Tokenizer extends UnaryTransformer<java.lang.String,scala.collection.Seq<java.lang.String>,Tokenizer>
RegexTokenizer,
Serialized Form| Constructor and Description |
|---|
Tokenizer() |
Tokenizer(java.lang.String uid) |
| Modifier and Type | Method and Description |
|---|---|
Tokenizer |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
protected scala.Function1<java.lang.String,scala.collection.Seq<java.lang.String>> |
createTransformFunc()
Creates the transform function using the given param map.
|
protected DataType |
outputDataType()
Returns the data type of the output column.
|
java.lang.String |
uid()
An immutable unique ID for the object and its derivatives.
|
protected void |
validateInputType(DataType inputType)
Validates the input type.
|
setInputCol, setOutputCol, transform, transformSchematransform, transform, transformtransformSchemaclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitinitializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn, validateParamstoStringpublic java.lang.String uid()
Identifiableprotected scala.Function1<java.lang.String,scala.collection.Seq<java.lang.String>> createTransformFunc()
UnaryTransformercreateTransformFunc in class UnaryTransformer<java.lang.String,scala.collection.Seq<java.lang.String>,Tokenizer>protected void validateInputType(DataType inputType)
UnaryTransformervalidateInputType in class UnaryTransformer<java.lang.String,scala.collection.Seq<java.lang.String>,Tokenizer>inputType - (undocumented)protected DataType outputDataType()
UnaryTransformeroutputDataType in class UnaryTransformer<java.lang.String,scala.collection.Seq<java.lang.String>,Tokenizer>public Tokenizer copy(ParamMap extra)
Paramscopy in interface Paramscopy in class UnaryTransformer<java.lang.String,scala.collection.Seq<java.lang.String>,Tokenizer>extra - (undocumented)defaultCopy()