cpp11tesseract: Open Source OCR Engine

Bindings to 'tesseract': 'tesseract' (<https://github.com/tesseract-ocr/tesseract>) is a powerful optical character recognition (OCR) engine that supports over 100 languages. The engine is highly configurable in order to tune the detection algorithms and obtain the best possible results.

Version: 5.3.5
Imports: curl, digest
LinkingTo: cpp11
Suggests: spelling, knitr, tibble, rmarkdown, testthat (≥ 3.0.0)
Published: 2025-03-14
DOI: 10.32614/CRAN.package.cpp11tesseract
Author: Mauricio Vargas Sepulveda ORCID iD [aut, cre], Jeroen Ooms ORCID iD [aut] (Author of tesseract R package), HP [cph] (Author of tesseract), Google [cph] (Author of tesseract), Munk School of Global Affairs and Public Policy [fnd]
cpp11tesseract author details
Maintainer: Mauricio Vargas Sepulveda <m.sepulveda at mail.utoronto.ca>
BugReports: https://github.com/pachadotdev/cpp11tesseract/issues
License: Apache License (≥ 2)
URL: https://pacha.dev/cpp11tesseract/
NeedsCompilation: yes
SystemRequirements: Tesseract OCR ( deb: libtesseract-dev libleptonica-dev tesseract-ocr-eng, rpm: tesseract-devel leptonica-devel tesseract-langpack-eng, brew: tesseract leptonica )
Language: en-US
Materials: NEWS
CRAN checks: cpp11tesseract results

Documentation:

Reference manual: cpp11tesseract.pdf
Vignettes: Using the Tesseract OCR engine in R (source)

Downloads:

Package source: cpp11tesseract_5.3.5.tar.gz
Windows binaries: r-devel: cpp11tesseract_5.3.5.zip, r-release: not available, r-oldrel: not available
macOS binaries: r-devel (arm64): not available, r-release (arm64): not available, r-oldrel (arm64): not available, r-devel (x86_64): not available, r-release (x86_64): not available, r-oldrel (x86_64): not available
Old sources: cpp11tesseract archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=cpp11tesseract to link to this page.