The kctools
package includes a suite of helper functions I frequently use in my data wrangling and analysis.
This includes functions to quickly generate a data frame of summary statistics and tabulations, creating indicators for duplicate observations, functions for spatial analysis, among others. Many of the custom functions were created based on the dplyr
and tidyr
packages as well as their counterparts in tidytable
, a package created by Mark Fairbanks which provides a tidy interface to data.table
operations.
The package also contains tools for storing figures and regression tables in Excel and LaTeX files. These rely on the brilliant openxlsx
and modelsummary
packages.
You can install the latest version of kctools from my Github site with:
devtools::install_github("kennchua/kctools")
Check the pertinent function for use cases and examples.
library(kctools)
# Data exploration tools:
# kc_sum(): generate summary statistics
# kc_tab(): tabulate variables
# kc_dup(): create indicators for duplicated observations
# kc_cor(): create correlation matrix with lower triangular and diagonal elements only
# See also kcf_* counterparts for tidytable implementation
# Spatial analysis tools:
# kc_st_distance(): create a data frame with pairwise distances of two sf objects
# Tools for presenting results and figures:
# kc_regtex(): Ouput list of regression results in tex file
# kc_regxls(): Output list of regression results to Excel sheet
# kc_figxls(): Output list of figures to Excel sheets
# Others:
# kc_baltab(): create balance table with group means and sd as well as difference-in-means test and F-test