public class StopWordsRemover extends Transformer
| Constructor and Description |
|---|
StopWordsRemover() |
StopWordsRemover(java.lang.String uid) |
| Modifier and Type | Method and Description |
|---|---|
BooleanParam |
caseSensitive()
whether to do a case sensitive comparison over the stop words
Default: false
|
StopWordsRemover |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
boolean |
getCaseSensitive() |
java.lang.String[] |
getStopWords() |
StopWordsRemover |
setCaseSensitive(boolean value) |
StopWordsRemover |
setInputCol(java.lang.String value) |
StopWordsRemover |
setOutputCol(java.lang.String value) |
StopWordsRemover |
setStopWords(java.lang.String[] value) |
StringArrayParam |
stopWords()
the stop words set to be filtered out
Default:
StopWords.English |
DataFrame |
transform(DataFrame dataset)
Transforms the input dataset.
|
StructType |
transformSchema(StructType schema)
:: DeveloperApi ::
|
java.lang.String |
uid()
An immutable unique ID for the object and its derivatives.
|
transform, transform, transformtransformSchemaclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitclear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn, validateParamstoStringinitializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarningpublic StopWordsRemover(java.lang.String uid)
public StopWordsRemover()
public java.lang.String uid()
Identifiablepublic StopWordsRemover setInputCol(java.lang.String value)
public StopWordsRemover setOutputCol(java.lang.String value)
public StringArrayParam stopWords()
StopWords.Englishpublic StopWordsRemover setStopWords(java.lang.String[] value)
public java.lang.String[] getStopWords()
public BooleanParam caseSensitive()
public StopWordsRemover setCaseSensitive(boolean value)
public boolean getCaseSensitive()
public DataFrame transform(DataFrame dataset)
Transformertransform in class Transformerdataset - (undocumented)public StructType transformSchema(StructType schema)
PipelineStageDerives the output schema from the input schema.
transformSchema in class PipelineStageschema - (undocumented)public StopWordsRemover copy(ParamMap extra)
Paramscopy in interface Paramscopy in class Transformerextra - (undocumented)defaultCopy()