Skip to content

Diadochokinetic/data_preprocessing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 

Repository files navigation

data_preprocessing

This repository contains some useful modules to preprocess data for machine learning. There is a test file for each module to verify its functions.

table of contents

  1. columns_to_string

Machine learning often requires categorical data to be encoded into binary variables. Sometimes categorical data is represented by numeric values. Most encoders ignore numeric columns. This module converts numeric columns, storing categorical data, into dtype "string".

  1. outlier_imputer

Machine learning often requires numeric data to be free of missing values. To avoid dropping all rows with at least one numeric value missing, those values get imputed by an outlier.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published