Simon Willison’s Weblog

Subscribe

Tuesday, 9th November 2021

TIL Using Tesseract.js to OCR every image on a page — Pasting this code into a DevTools console should load [Tesseract.js](https://github.com/naptha/tesseract.js) from a CDN, loop through every image loaded by that page (every PNG, GIF, JPG or JPEG), run OCR on them and output the result to the DevTools console.
Sunday, 7th November 2021
Wednesday, 10th November 2021