Skip to content

Name Entity Recognition (NER)

richardyy1188 edited this page Aug 21, 2018 · 8 revisions

Goal

Find out all names.

Process

overview

diagram of proposed NER method

1. Use NER Tools to get names

Use as many NER tools as we can, to increase recall.

  • Stanford CoreNLP: PERSON tag from NER output
  • Jieba: nr tag from pos output

2. Use regex to get names

we use regular expression to find the name of and relations with biographee's family at the same time.
see Use regex to extract kinship

3. filter the names

Difficult Points & Possible Solutions

The same name

Valid surname but not name