If you want to host it locally, Stirling PDF can be run in docker, and uses a library that uses Tesseract. Has a bunch of other handy PDF operations, too. I keep it around for the two times a year I need to merge, split, or decrypt PDFs.
https://github.com/Frooodle/Stirling-PDF/blob/main/HowToUseOCR.md
It can do it straight from PDF and do multiple files at a time.
They’re also leaving almost half the volume in bits on the ground.
Volume of a sphere of diameter 1 = ~0.52 Volume of a cube of 1 unit = 1 Volume of a cylinder = ~0.79
So not only are they not doing the required job. They’re wasting twice as much time and material than needed to make the job easier…
And you could still build a pyramid with cylinders…