Preprocessing and Feature Engineering Steps for Modeling


[Up] [Top]

Documentation for package ‘recipes’ version 1.0.6

Help Pages

A B C D F G H I J M N P R S T U V misc

-- A --

add_check Add a New Operation to the Current Recipe
add_role Manually Alter Roles
add_step Add a New Operation to the Current Recipe
all_date Role Selection
all_datetime Role Selection
all_datetime_predictors Role Selection
all_date_predictors Role Selection
all_double Role Selection
all_double_predictors Role Selection
all_factor Role Selection
all_factor_predictors Role Selection
all_integer Role Selection
all_integer_predictors Role Selection
all_logical Role Selection
all_logical_predictors Role Selection
all_nominal Role Selection
all_nominal_predictors Role Selection
all_numeric Role Selection
all_numeric_predictors Role Selection
all_ordered Role Selection
all_ordered_predictors Role Selection
all_outcomes Role Selection
all_predictors Role Selection
all_string Role Selection
all_string_predictors Role Selection
all_unordered Role Selection
all_unordered_predictors Role Selection
are_weights_used Helpers for steps with case weights
averages Helpers for steps with case weights

-- B --

bake Apply a trained preprocessing recipe
bake.recipe Apply a trained preprocessing recipe

-- C --

case-weight-helpers Helpers for steps with case weights
case_weights Using case weights with recipes
check_class Check Variable Class
check_cols Check if all Columns are Present
check_missing Check for Missing Values
check_new_values Check for New Values
check_range Check Range Consistency
correlations Helpers for steps with case weights
covariances Helpers for steps with case weights
current_info Role Selection

-- D --

denom_vars Ratio Variable Creation
detect_step Detect if a particular step or check is used in a recipe
discretize Discretize Numeric Variables
discretize.default Discretize Numeric Variables
discretize.numeric Discretize Numeric Variables
dummy_extract_names Naming Tools
dummy_names Naming Tools

-- F --

formula.recipe Create a Formula from a Prepared Recipe
fully_trained Check to see if a recipe is trained/prepared

-- G --

get_case_weights Helpers for steps with case weights

-- H --

has_role Role Selection
has_type Role Selection

-- I --

imp_vars Impute via bagged trees

-- J --

juice Extract transformed training set

-- M --

medians Helpers for steps with case weights

-- N --

names0 Naming Tools

-- P --

pca_wts Helpers for steps with case weights
predict.discretize Discretize Numeric Variables
prep Estimate a preprocessing recipe
prep.recipe Estimate a preprocessing recipe
prepper Wrapper function for preparing recipes within resampling
print.recipe Print a Recipe

-- R --

recipe Create a recipe for preprocessing data
recipe.data.frame Create a recipe for preprocessing data
recipe.default Create a recipe for preprocessing data
recipe.formula Create a recipe for preprocessing data
recipe.matrix Create a recipe for preprocessing data
recipes_eval_select Evaluate a selection with tidyselect semantics specific to recipes
recipes_extension_check Checks that steps have all S3 methods
remove_role Manually Alter Roles
roles Manually Alter Roles

-- S --

selection Methods for selecting variables in step functions
selections Methods for selecting variables in step functions
step_arrange Sort rows using dplyr
step_bagimpute Impute via bagged trees
step_bin2factor Create a Factors from A Dummy Variable
step_BoxCox Box-Cox Transformation for Non-Negative Data
step_bs B-Spline Basis Functions
step_center Centering numeric data
step_classdist Distances to Class Centroids
step_corr High Correlation Filter
step_count Create Counts of Patterns using Regular Expressions
step_cut Cut a numeric variable into a factor
step_date Date Feature Generator
step_depth Data Depths
step_discretize Discretize Numeric Variables
step_dummy Create traditional dummy variables
step_dummy_extract Extract patterns from nominal data
step_dummy_multi_choice Handle levels in multiple predictors together
step_factor2string Convert Factors to Strings
step_filter Filter rows using dplyr
step_filter_missing Missing Value Column Filter
step_geodist Distance between two locations
step_harmonic Add sin and cos terms for harmonic analysis
step_holiday Holiday Feature Generator
step_hyperbolic Hyperbolic Transformations
step_ica ICA Signal Extraction
step_impute_bag Impute via bagged trees
step_impute_knn Impute via k-nearest neighbors
step_impute_linear Impute numeric variables via a linear model
step_impute_lower Impute numeric data below the threshold of measurement
step_impute_mean Impute numeric data using the mean
step_impute_median Impute numeric data using the median
step_impute_mode Impute nominal data using the most common value
step_impute_roll Impute numeric data using a rolling window statistic
step_indicate_na Create Missing Data Column Indicators
step_integer Convert values to predefined integers
step_interact Create Interaction Variables
step_intercept Add intercept (or constant) column
step_inverse Inverse Transformation
step_invlogit Inverse Logit Transformation
step_isomap Isomap Embedding
step_knnimpute Impute via k-nearest neighbors
step_kpca Kernel PCA Signal Extraction
step_kpca_poly Polynomial Kernel PCA Signal Extraction
step_kpca_rbf Radial Basis Function Kernel PCA Signal Extraction
step_lag Create a lagged predictor
step_lincomb Linear Combination Filter
step_log Logarithmic Transformation
step_logit Logit Transformation
step_lowerimpute Impute numeric data below the threshold of measurement
step_meanimpute Impute numeric data using the mean
step_medianimpute Impute numeric data using the median
step_modeimpute Impute nominal data using the most common value
step_mutate Add new variables using dplyr
step_mutate_at Mutate multiple columns using dplyr
step_naomit Remove observations with missing values
step_nnmf Non-Negative Matrix Factorization Signal Extraction
step_nnmf_sparse Non-Negative Matrix Factorization Signal Extraction with lasso Penalization
step_normalize Center and scale numeric data
step_novel Simple Value Assignments for Novel Factor Levels
step_ns Natural Spline Basis Functions
step_num2factor Convert Numbers to Factors
step_nzv Near-Zero Variance Filter
step_ordinalscore Convert Ordinal Factors to Numeric Scores
step_other Collapse Some Categorical Levels
step_pca PCA Signal Extraction
step_percentile Percentile Transformation
step_pls Partial Least Squares Feature Extraction
step_poly Orthogonal Polynomial Basis Functions
step_poly_bernstein Generalized Bernstein Polynomial Basis
step_profile Create a Profiling Version of a Data Set
step_range Scaling Numeric Data to a Specific Range
step_ratio Ratio Variable Creation
step_regex Detect a regular expression
step_relevel Relevel factors to a desired level
step_relu Apply (Smoothed) Rectified Linear Transformation
step_rename Rename variables by name using dplyr
step_rename_at Rename multiple columns using dplyr
step_rm General Variable Filter
step_rollimpute Impute numeric data using a rolling window statistic
step_sample Sample rows using dplyr
step_scale Scaling Numeric Data
step_select Select variables using dplyr
step_shuffle Shuffle Variables
step_slice Filter rows by position using dplyr
step_spatialsign Spatial Sign Preprocessing
step_spline_b Basis Splines
step_spline_convex Convex Splines
step_spline_monotone Monotone Splines
step_spline_natural Natural Splines
step_spline_nonnegative Non-Negative Splines
step_sqrt Square Root Transformation
step_string2factor Convert Strings to Factors
step_time Time Feature Generator
step_unknown Assign missing categories to "unknown"
step_unorder Convert Ordered Factors to Unordered Factors
step_window Moving Window Functions
step_YeoJohnson Yeo-Johnson Transformation
step_zv Zero Variance Filter
summary.recipe Summarize a recipe

-- T --

tidy.check Tidy the Result of a Recipe
tidy.check_class Tidy the Result of a Recipe
tidy.check_cols Tidy the Result of a Recipe
tidy.check_missing Tidy the Result of a Recipe
tidy.check_new_values Tidy the Result of a Recipe
tidy.check_range Tidy the Result of a Recipe
tidy.recipe Tidy the Result of a Recipe
tidy.step Tidy the Result of a Recipe
tidy.step_arrange Tidy the Result of a Recipe
tidy.step_bin2factor Tidy the Result of a Recipe
tidy.step_BoxCox Tidy the Result of a Recipe
tidy.step_bs Tidy the Result of a Recipe
tidy.step_center Tidy the Result of a Recipe
tidy.step_classdist Tidy the Result of a Recipe
tidy.step_corr Tidy the Result of a Recipe
tidy.step_count Tidy the Result of a Recipe
tidy.step_cut Tidy the Result of a Recipe
tidy.step_date Tidy the Result of a Recipe
tidy.step_depth Tidy the Result of a Recipe
tidy.step_discretize Tidy the Result of a Recipe
tidy.step_dummy Tidy the Result of a Recipe
tidy.step_dummy_extract Tidy the Result of a Recipe
tidy.step_dummy_multi_choice Tidy the Result of a Recipe
tidy.step_factor2string Tidy the Result of a Recipe
tidy.step_filter Tidy the Result of a Recipe
tidy.step_filter_missing Tidy the Result of a Recipe
tidy.step_geodist Tidy the Result of a Recipe
tidy.step_harmonic Tidy the Result of a Recipe
tidy.step_holiday Tidy the Result of a Recipe
tidy.step_hyperbolic Tidy the Result of a Recipe
tidy.step_ica Tidy the Result of a Recipe
tidy.step_impute_bag Tidy the Result of a Recipe
tidy.step_impute_knn Tidy the Result of a Recipe
tidy.step_impute_linear Tidy the Result of a Recipe
tidy.step_impute_lower Tidy the Result of a Recipe
tidy.step_impute_mean Tidy the Result of a Recipe
tidy.step_impute_median Tidy the Result of a Recipe
tidy.step_impute_mode Tidy the Result of a Recipe
tidy.step_impute_roll Tidy the Result of a Recipe
tidy.step_indicate_na Tidy the Result of a Recipe
tidy.step_integer Tidy the Result of a Recipe
tidy.step_interact Tidy the Result of a Recipe
tidy.step_intercept Tidy the Result of a Recipe
tidy.step_inverse Tidy the Result of a Recipe
tidy.step_invlogit Tidy the Result of a Recipe
tidy.step_isomap Tidy the Result of a Recipe
tidy.step_kpca Tidy the Result of a Recipe
tidy.step_kpca_poly Tidy the Result of a Recipe
tidy.step_kpca_rbf Tidy the Result of a Recipe
tidy.step_lag Tidy the Result of a Recipe
tidy.step_lincomb Tidy the Result of a Recipe
tidy.step_log Tidy the Result of a Recipe
tidy.step_logit Tidy the Result of a Recipe
tidy.step_mutate Tidy the Result of a Recipe
tidy.step_mutate_at Tidy the Result of a Recipe
tidy.step_naomit Tidy the Result of a Recipe
tidy.step_nnmf Tidy the Result of a Recipe
tidy.step_nnmf_sparse Tidy the Result of a Recipe
tidy.step_normalize Tidy the Result of a Recipe
tidy.step_novel Tidy the Result of a Recipe
tidy.step_ns Tidy the Result of a Recipe
tidy.step_num2factor Tidy the Result of a Recipe
tidy.step_nzv Tidy the Result of a Recipe
tidy.step_ordinalscore Tidy the Result of a Recipe
tidy.step_other Tidy the Result of a Recipe
tidy.step_pca Tidy the Result of a Recipe
tidy.step_percentile Tidy the Result of a Recipe
tidy.step_pls Tidy the Result of a Recipe
tidy.step_poly Tidy the Result of a Recipe
tidy.step_poly_bernstein Tidy the Result of a Recipe
tidy.step_profile Tidy the Result of a Recipe
tidy.step_range Tidy the Result of a Recipe
tidy.step_ratio Tidy the Result of a Recipe
tidy.step_regex Tidy the Result of a Recipe
tidy.step_relevel Tidy the Result of a Recipe
tidy.step_relu Tidy the Result of a Recipe
tidy.step_rename Tidy the Result of a Recipe
tidy.step_rename_at Tidy the Result of a Recipe
tidy.step_rm Tidy the Result of a Recipe
tidy.step_sample Tidy the Result of a Recipe
tidy.step_scale Tidy the Result of a Recipe
tidy.step_select Tidy the Result of a Recipe
tidy.step_shuffle Tidy the Result of a Recipe
tidy.step_slice Tidy the Result of a Recipe
tidy.step_spatialsign Tidy the Result of a Recipe
tidy.step_spline_b Tidy the Result of a Recipe
tidy.step_spline_convex Tidy the Result of a Recipe
tidy.step_spline_monotone Tidy the Result of a Recipe
tidy.step_spline_natural Tidy the Result of a Recipe
tidy.step_spline_nonnegative Tidy the Result of a Recipe
tidy.step_sqrt Tidy the Result of a Recipe
tidy.step_string2factor Tidy the Result of a Recipe
tidy.step_time Tidy the Result of a Recipe
tidy.step_unknown Tidy the Result of a Recipe
tidy.step_unorder Tidy the Result of a Recipe
tidy.step_window Tidy the Result of a Recipe
tidy.step_YeoJohnson Tidy the Result of a Recipe
tidy.step_zv Tidy the Result of a Recipe

-- U --

update.step Update a recipe step
update_role Manually Alter Roles
update_role_requirements Update role specific requirements

-- V --

variances Helpers for steps with case weights

-- misc --

.get_data_types Get types for use in recipes
.get_data_types.character Get types for use in recipes
.get_data_types.Date Get types for use in recipes
.get_data_types.default Get types for use in recipes
.get_data_types.double Get types for use in recipes
.get_data_types.factor Get types for use in recipes
.get_data_types.hardhat_case_weights Get types for use in recipes
.get_data_types.integer Get types for use in recipes
.get_data_types.list Get types for use in recipes
.get_data_types.logical Get types for use in recipes
.get_data_types.numeric Get types for use in recipes
.get_data_types.ordered Get types for use in recipes
.get_data_types.POSIXct Get types for use in recipes
.get_data_types.Surv Get types for use in recipes
.get_data_types.textrecipes_tokenlist Get types for use in recipes