Image Text OCR
Extract text from an image using the tesseract package.
image_ocr(image, language = "eng", HOCR = FALSE, ...) image_ocr_data(image, language = "eng", ...)
image |
magick image object returned by |
language |
passed to tesseract. To install additional languages see instructions in tesseract_download(). |
HOCR |
if |
... |
additional parameters passed to tesseract |
To use this function you need to tesseract first:
install.packages("tesseract")
Best results are obtained if you set the correct language in tesseract. To install additional languages see instructions in tesseract_download().
if(require("tesseract")){ img <- image_read("http://jeroen.github.io/images/testocr.png") image_ocr(img) image_ocr_data(img) }
Please choose more modern alternatives, such as Google Chrome or Mozilla Firefox.