--------------------------------------------------------------------- Outline of the software --------------------------------------------------------------------- InftyReader Version 3.3 (C) Copyright 2000-2022: Masakazu Suzuki (Kyushu Universirty), InftyProject (http://www.inftyproject.org/), Science Accessibility Net (http://www.sciaccess.net/). --------------------------------------------------------------------- 1. About the package This is a full setup package of InftyReader Version 3.3. If you execute the installer - InftyReaderE33xx_Setup.exe, then InftyReader will be installed to your PC. If you want to use InftyReader on command prompt window, please use Infty.exe refering to the file InftyHelpE.txt included in the package. 2. Outline of the Software InftyReader is a software application to recognize scientific documents including mathematical expressions, developed in Masakazu Suzuki Laboratory, Graduate school of Mathematics, Kyushu University in collaboration with several cooperation partners. The software recognizes only images carefully scanned in either 600 DPI or 400 DPI. Image files have to be prepared in either TIF, PNG or GIF format. InftyReader outputs the recognition results in various format: IML, LaTeX, HR-TeX, XHTML(MathML) and MS Word document(XML). IML is a XML file format related to InftyEditor, an editor of mathematical documents developed in InftyProject and released from Science Accessibility Net. Using InftyEditor, user can correct and edit the recognition results of InftyReader comparing the results with original images, and convert the results into various formats mentioned above. HR-TeX is a simplified LaTeX-like notation easier "to read" specially designed for the blinds. In XHTML output, mathematical expressions are output using MathML notation. Here are some features of InftyReader: 1. It uses the OCR engines of Toshiba Corporation, "ExpressReaderPro", and of MediaDrive Corporation, "WinReader", simultaneously to recognize characters in ordinary text areas. (As for the characters and math symbols in formulae, it uses Infty's OCR). 2. It can recognize tables including math expressions in the cells. 3. It can convert PDF files into LaTeX or XHTML(MathML) including mathematical expressions. It recognizes the page images of PDF files refering the text information imbedded in PDF. 3. Caution ---- Important! 1. The source documents have to be clearly printed. 2. It should be scanned in 600dpi or 400dpi. The scanning as black and white binary image in 600DPI is <>. 3. InftyReader erases small noises, segments page images into picture areas, table areas and text areas automatically, and then recognizes text and table areas including mathematical expressions. However, to get better recognition results, users are <> to erase noises and pictures before the recognition. 4. In binary scanning, it is important to adjust the binarization threshold of the scanner so that the number of the touched or broken characters is less than 1% of the total number of the characters in each scanned page image. 4. Operating Environment InftyReader runs on Windows 10. 5. How to use InftyReader? 1. Select input file(s) or folder. 2. Input/select output docuent name 3. Press the "Start" button. Then, the recognition results of the selected image files are saved in to the file you specified by the "output docuent name". When, you select a folder instead of files, all the image files in the folder of the specified file type (TIF/GIF/PNG/BMP/PDF) are recognized and the results are output into the files having the name(s) of the folders. If you set check to the "Search Sub Folders" item under the "Option" menu, InftyReader recognizes all the image files in the sub folders of the selected folder. For example, if you select the folder "foldertop" having the subfolder structure below, foldertop |-- subfolder1 | |-- a.tif | |-- b.tif | |-- subfolder2 |-- c.tif |-- d.tif and if you select the file type "IML" for the output file type, then, you will get the files "subfolder1.iml", "subfolder2.iml" in the folder "foldertop". The recognition results of a.tif and b.tif (resp. c.tif and d.tif) are saved in the file subfolder1.iml (resp. subfolder2.iml). If you select LaTeX as output file type, you will get "subfolder1.tex", "subfolder2.tex", and it is similar for other file types HR-TeX and XHTML. 6. License Please read the file "License_E.txt" included in the package. 7. Report Any report about the software will be welcome. -------------------------------------- Non Profit Organization Science Accessibility Net (sAccessNet) e-mail: support@sciaccess.net URL: http://www.sciaccess.net/ --------------------------------------