Skip to content

Latest commit

 

History

History
19 lines (15 loc) · 821 Bytes

convert_jsont_to_jsonl.md

File metadata and controls

19 lines (15 loc) · 821 Bytes

JSONT to JSONL

Convert a JSONT file into a JSONL file. A JSONL file is a single file with the extension of .jsonl that contains several individual JSON documents, one per line. A JSONT file is a single tar file that contains only individual JSON documents, one per file.

Steps

Below are the steps needed to run the conversion process. The pathing can be changed by updating the parameters.

  1. Clone this repository.
  2. Open a PowerShell window to the ~/src directory.
  3. Convert a JSONT file into a JSONL file.
    • The -in/-out parameters control the source and destination files. WARNING: If the output file exists, it is deleted.
    python convert_jsont_to_jsonl.py -in d:/corpus_in.jsont -out d:/corpus_out.jsonl