Well, yeah, that's my point. As are most of us, I am "ordinary public", restricted to doing it with whatever is generally and freely available, so I don't have a choice: it's scrape the PDFs or nothing.
It would work, if only PDF wasn't such a horribly unscrapable format.