Convert a JSONT
file into a JSONL
file.
A JSONL
file is a single file with the extension of .jsonl
that contains several individual JSON
documents, one per line.
A JSONT
file is a single tar
file that contains only individual JSON
documents, one per file.
Below are the steps needed to run the conversion process. The pathing can be changed by updating the parameters.
- Clone this repository.
- Open a PowerShell window to the
~/src
directory. - Convert a
JSONT
file into aJSONL
file.- The
-in
/-out
parameters control the source and destination files. WARNING: If the output file exists, it is deleted.
python convert_jsont_to_jsonl.py -in d:/corpus_in.jsont -out d:/corpus_out.jsonl
- The