Nodestat

A Node JS package for data wrangling and analysis

Project Goals

Both Javascript and Node already have a variety of packages that deal with dataframes and statistical computation. The goal with Nodestat is to create a unified grammar of data analysis that can transform Javascript into a fully capable language for data analysis. Specific design goals are as follows:

An Intuitive Grammar. Even without a third party package, Javascript already contains some functionality for handling data using Javascript Objects and JSON. In fact, many of the base functions that Nodestat implements through its Dataframe module are hardly novel and can be written by any experienced Javascript developer. With Nodestat, however, the aim is to create a grammar for data analysis that is intuitive and efficient, allowing the same split-apply-combine strategy for data wrangling that the plyr/dplyr package provides for the R language, and that pandas provides for the Python language.
Close Integration with Statistical Packages. The Dataframe module isn't just another package for handling data in Javascript. It's designed to play well with packages for data analysis and statistical computation. The Stats module demonstrates this capability. In the future, hopefully additional modules and independent packages will be developed following the Dataframe grammar for more advanced data analysis and machine learning.

If you are interested in contributing to this project, please see our contribution guidelines for more information.

Usage

Install

Install using NPM as

$ npm install @dominicdayta/nodestat

API Documentation

The package currently contains two primary modules:

stat: Contains basic statistical formulas and tests.

df: Contains useful functions for creating and managing dataframes.

const nstat = require('@dominicdayta/nodestat');

let stats = nstat.stat; // for shorthand

// initiate the titanic dataset
let titanic = stats.dataset("Titanic");

// get the subset containing only survivors
let titanicSurvivors = titanic.subset(col = "Survived", 
    function(x){
        return(x == "Yes")
    }
);
console.log(titanicSurvivors.data);

// aggregate the total number of survivors by sex and class
let freqSurvivedBySexClass = titanic
    .select(["Class","Sex","Freq"])
    .aggregate(by = ["Class","Sex"], stats.sum);
console.log(freqSurvivedBySexClass.data);

// aggregate the total number of non-survivors by sex and age
let freqDiedSexAge = titanic
    .subset(col = "Survived", function(x){return(x == "No")})
    .select(["Sex","Age","Freq"])
    .aggregate(by = ["Sex","Age"], stats.sum)
    .data;
console.log(freqDiedSexAge);

You can look into some sample use cases in the ./demo directory. For full documentation on how to use the API, please look into the ./docs directory.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
data		data
dataframe		dataframe
demo		demo
docs		docs
stats		stats
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
contributing.md		contributing.md
index.js		index.js
package.json		package.json
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nodestat

Project Goals

Usage

Install

API Documentation

License

About

Releases

Packages

Languages

License

dominicdayta/nodestat

Folders and files

Latest commit

History

Repository files navigation

Nodestat

Project Goals

Usage

Install

API Documentation

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages