When the API was public academics and independent scholars used to do this sort of research all the time, but now it's prohibitively expensive. Read up on the search/streaming API and reflect on the fact that it used to be free.