Skip to content

Commit

Permalink
Improve misleading help text
Browse files Browse the repository at this point in the history
Dirichlet Distribution := Distribution over K-dimensional positive vectors that sum to one (i.e., points on the probability simplex)
Two parameters:
- Base measure (positive vector; sums to one)
- Concentration parameter α (positive scalar)

See https://people.cs.umass.edu/~wallach/talks/priors.pdf
  • Loading branch information
jonaschn committed May 21, 2021
1 parent 5d1afa6 commit fe6838a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/cc/mallet/topics/tui/TopicTrainer.java
Original file line number Diff line number Diff line change
Expand Up @@ -147,7 +147,7 @@ public class TopicTrainer {
"The number of iterations to run before first estimating dirichlet hyperparameters.", null);

static CommandOption.Boolean useSymmetricAlpha = new CommandOption.Boolean(TopicTrainer.class, "use-symmetric-alpha", "true|false", false, false,
"Only optimize the concentration parameter of the prior over topic-words distributions. This may reduce the number of very small, poorly estimated topics, but may disperse common words over several topics.", null);
"Optimize the concentration parameter (SumAlpha) of the prior over document-topic distributions while keeping it symmetric. This may reduce the number of very small, poorly estimated topics, but may disperse common words over several topics.", null);

static CommandOption.Double alpha = new CommandOption.Double(TopicTrainer.class, "alpha", "DECIMAL", true, 5.0,
"SumAlpha parameter: sum over topics of smoothing over doc-topic distributions. alpha_k = [this value] / [num topics]",null);
Expand Down

0 comments on commit fe6838a

Please sign in to comment.