Skip to content

Commit

Permalink
Add files via upload
Browse files Browse the repository at this point in the history
  • Loading branch information
soberbichler authored Jul 6, 2020
1 parent 5ba0be2 commit bda8a9f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion Text_classification_of_newspaper_clippings_notebook.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@
"\n",
"For classification, topic modelling (LDA) was chosen because it showed the best performance in classification (after experiments with word embeddings or LDA and word embeddings combined). LDA provides a way to group documents by topic and perform similarity searches and improve precision. Thanks to sklearn, it is relatively easy to test different classifiers for a given topic classification task. Logistic regression was chosen as binary classifier. \n",
"\n",
"*Following graph demonstrates the distribution of the pre-defined categories in newspaper clippings of seleceted Austrian Newspapers (~1000 clippings) on the topic of emigration.* \n",
"*Following graph demonstrates the distribution of the pre-defined categories in newspaper clippings of seleceted Austrian Newspapers (790 clippings) on the topic of emigration.* \n",
"\n",
"![Collection on the topic of Emigration](images/categories.PNG)\n",
"\n",
Expand Down

0 comments on commit bda8a9f

Please sign in to comment.