You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
firstnames may be only one letter -> we can use a dictionary-syle lookup of the full name, based on the names from genderize we have, and the census names too.
for any given linked subsample, we can use the first name in the other dataset if it is spelled out
For example for spanish we currently have:
However, the main last name is
iglesias
(the first last name)Proposal: use https://nationalize.io to predict which country/language a name is from and implement specific rules for those.
Caveat: For spanish names, sometimes people give just the first lastname and sometimes both. So it is not obvious how to handle it automatically
The text was updated successfully, but these errors were encountered: