Skip to content

Chunk array of objects by their size in JSON

License

Notifications You must be signed in to change notification settings

shelfio/array-chunk-by-size

Repository files navigation

array-chunk-by-size CircleCI

Chunk array of objects by their size in JSON

Install

$ yarn add @shelf/array-chunk-by-size

Usage

Useful if you want to split large array into smaller, but limited by JSON size.

Each array chunk will be up to specified amount of bytes when stringified into JSON.

import {chunkArray} from '@shelf/array-chunk-by-size';

const bigArray = [{a: 1}, {b: 2}, {c: 3}];
const twoKilobytes = 2 * 1024;

const smallerArrays = chunkArray({input: bigArray, bytesSize: twoKilobytes});
// => [ [ ... ], [ ... ] ] and so on

Alternatively, you might pass a custom size calculation function. For example, to chunk array by LLM tokens size:

import {chunkArray} from '@shelf/array-chunk-by-size';
import {encode} from 'gpt-3-encoder';

const bigArray = ['msg-1', 'msg-2'];
const gpt3MaxTokens = 4000;

const smallerArrays = chunkArray({
  input: bigArray,
  bytesSize: gpt3MaxTokens,
  sizeCalcFunction: item => encode(item).length,
});

See Also

Publish

$ git checkout master
$ yarn version
$ yarn publish
$ git push origin master --tags

License

MIT © Shelf