Home

Awesome

groupby

Build Status

Split up JSON data into multiple files based on shared characteristics. Groupby is a command-line utility but can also be used from node.js.

groupby staff.json 'staff/{department}.json'

In the example above, the resulting output will one file for each department, each file containing an array of staff member objects for that department.

Installation

npm install groupby-cli -g

Usage

Groupby expects an input JSON file that contains an array of similarly-structured objects, and will group those objects when they have matching values for whatever keys you specify as placeholders in the output pattern.

For example, staff/{department}.json will group objects together into the same file if their department key matches.

Grouping on multiple keys is supported too:

groupby staff.json 'staff/{department}/{country}/{role}.json'

The only requirement for groups is that values can be turned into a string (and thus into a filename to which we can write the resulting JSON.) Values will be slugified for use in filenames but will be left as-is in the JSON.

In some cases, your output pattern uniquely identifies each individual object, e.g.

groupby staff.json 'staff/{username}.json'

To save just the objects without wrapping each of them in an array, use the --unique flag. In --unique mode, Groupby will throw an error if your output pattern does unexpectedly lead to groups that contain more than one item.

Use from node.js

// basic usage
var groupby = require('groupby-cli');
var groups = groupby.group(list, facets);

// usage that is more advanced, and more 
// similar to the command-line
var keyPattern = 'staff/{departments}';
var staffByDepartment = groupby.group(staff, keyPattern);
var sales = staffByDepartment['staff/sales'];

groupby.group takes an options object as a third argument:

License

Groupby comes with a permissive ISC license.

The countries.json dataset included among the examples comes with an Open Database License. For the latest version, see @mledoze's countries repository on GitHub.