The Philadelphia Reflections website contains several hundred haiku. This analysis shows how these haiku can be clustered (categorized) based on the apparent significance of the word usage in each haiku, compared to the overall word usage in the group of haiku as a whole.
The text files are contained in corpus_haiku.zip
See HaikuClusterReport.pdf for a detailed report. The Technology appendix of that report describes the technology used.