Mathematical Formulas and Character Recognition.
Introduction
By calling the 'SimpleTex' https://simpletex.cn/ open API implements text and mathematical formula recognition on the image, and the output formula can be used directly with Markdown and LaTeX.
Installation
# Install development version from GitHub
remotes::install_github("chuxinyuan/simpletex")
# Install from CRAN
install.packages("simpletex")
Configure the ID and SECRET for the SimpleTex API
To use the simpletex package, user need to first register with the SimpleTex platform, and then create application. After creating the application, Then you will get SimpleTex API ID and SECRET.
Put your SimpleTex API ID and SECRET in the following code and run it once.
cat(
'\n# ID and SECRET of SimpleTex',
'SIMPLETEX_APP_ID = "Your SimpleTex API ID"',
'SIMPLETEX_APP_SECRET = "Your SimpleTex API SECRET"',
file = '~/.Renviron', sep = '\n', append = TRUE
)
Usage
imgocr(img = "path/to/image", mode = "latex_ocr")
imgAn image file, supporting jpg, png, bmp format.modeis service model. The value can belatex_ocr,latex_ocr_turbo, orsimpletex_ocr. The default value islatex_ocr. Thelatex_ocrandlatex_ocr_turboare used for formula identification.latex_ocris better thanlatex_ocr_turbo, butlatex_ocr_turbois faster.simpletex_ocris suitable for general image recognition.
For the output, for example: "E=mc^2", you can copy only the parts within quotation marks of the output directly into the '.md' or '.tex' document.
Note: in order to the formula to render properly in '.md' or '.tex' documents, you also need to wrap the formula with $ or $$ on both sides.
License
simpletex is free and open source software, licensed under MIT + file LICENSE.