Comments by "LoneTech" (@0LoneTech) on "The Atlantic"
channel.
-
@lawrence-yx1ew PDF is a document format for page based documents. It could be as simple as blank pages with dimensions, or as complex as having text, forms, embedded programs, and 3D models. Many PDF scanners can also perform OCR to detect text, but a scanned page at its most basic is merely an image. Only those that contain a text layer (whether that's recognized and annotated in the background, like OCR, or the actual graphical elements, like when saved from a word processor) are searchable, though it's also possible to make image search systems. In this case, the actually significant data in all these forms is hand written, and has terrible odds of being recognized correctly by a computer. The first thing this office needs is a crank to stop hitting that arrow down key, that's a clear RSI recipe.
12
-
2