Extracting, transforming and selecting features

Extracting, transforming and selecting features

Extraction: Extracting features from β€œraw” data

​TF-IDF​
​Word2Vec​
​CountVectorizer​
​FeatureHasher​

Transformation: Scaling, converting, or modifying features

​Tokenizer
​StopWordsRemover ​
​n-gram​
​Binarizer​
​PCA​
​StringIndexer​
​IndexToString​
​OneHotEncoder (Deprecated since 2.3.0)
​VectorIndexer​
​Interaction​
​Normalizer​
​StandardScaler​
​MinMaxScaler​
​MaxAbsScaler​
​Bucketizer​
​ElementwiseProduct​
​SQLTransformer​
​VectorAssembler​
​VectorSizeHint​
​Imputer​
Selection: Selecting a subset from a larger set of features
​VectorSlicer​
​ChiSqSelector​
​RFormula​
Last modified 1yr ago
Copy link