Skip to content

Latest commit

 

History

History
20 lines (11 loc) · 545 Bytes

README.md

File metadata and controls

20 lines (11 loc) · 545 Bytes

text-analyzer

Python script that outputs stats on input text e.g. number of sentences, etc.

Usage: python text_analyzer.py <input-file> <output-file>

The program accepts as input a plain text document and extracts some properties of the content.

Note: Sample data provided to use with program.

Output:

Number of paragraphs.

Number of sentences.

Number of words (i.e., "tokens").

Number of distinct words (i.e., "word types").

List of word frequency counts: Words are ordered by frequency (in the descending order).