WIP: Resolve "Enhancing R: Optimize GetBirthdate.R"
Adds use of annotations to birthdate extraction script
-
Use Infobox to extract date, if there is no infobox get first date entity from text assumption: first date entity is birthdate
-
There are still results like this from infobox search:
[[972]]
[1] "Frits Zernike16 July 1888"
- If I try to remove the name with regex, I remove the whole string in some cases and get a lot of empty strings as results.
What do you think? Leave errors like this inside?
Closes #39 (closed)