This made me think of a much more interesting project. A compendium of information automatically extracted from research articles.
Essentially one totalizing meta analysis.
E.g. If it reads an article about the relationship between height and various life outcomes in Indonesian men, then first, it would store the average height of Indonesian men, the relationship between the average height of Indonesian men and each life outcome in Indonesian men, the type of relationship (e.g. Pearson's correlation), the relationship values (r value), etc. It would store the entity, the relationship, the relationship values, and the doi source.
Something like a quantitative Wikipedia.