-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ids of adjective_animal
not unique
#20
Comments
This is because we sample without replacement; the ids come from a pool of some size and collisions are very possible. We discuss this in the vignette - see the description of the "pool size":
With 1 adjective you have 15.6 million possibilities, and if you draw 10000 from this your chance of seeing a collision is quite high:
(this is the probability of at least one collision within the set of 10000 samples; or alternatively only a 4% chance of not seeing a collision). If collision avoidance is important, and you need a large set of identifiers you should either add more adjectives (adding two will reduce the collision probability to about 1/2,500 for 10,000 samples), or use one of the more boring identifiers. If you set a |
Given that the primary pursue of an id is often to be a unique identifier, it could be valuable:
|
Hi, thank you for the package.
I discovered that the ids resulting from
adjective_animal
are not unique:Perhaps this might result from homonyms getting listed as separate entries in the word list, but I haven't checked. Anyway, I thought it was important to raise an issue.
The text was updated successfully, but these errors were encountered: