---------------------------------------------------------------------
Outline of the software
---------------------------------------------------------------------
InftyReader Version 3.3
(C) Copyright 2000-2022:
    Masakazu Suzuki (Kyushu Universirty),
    InftyProject (http://www.inftyproject.org/),
    Science Accessibility Net (http://www.sciaccess.net/).
---------------------------------------------------------------------

1. About the package

This is a full setup package of InftyReader Version 3.3. 

If you execute the installer

  - InftyReaderE33xx_Setup.exe,

then InftyReader will be installed to your PC.

If you want to use InftyReader on command prompt window, please use Infty.exe 
refering to the file InftyHelpE.txt included in the package.


2. Outline of the Software

InftyReader is a software application to recognize scientific documents including 
mathematical expressions, developed in Masakazu Suzuki Laboratory, 
Graduate school of Mathematics, Kyushu University in collaboration 
with several cooperation partners. 

The software recognizes only images carefully scanned in either 600 DPI 
or 400 DPI. Image files have to be prepared in either TIF, PNG or GIF format. 

InftyReader outputs the recognition results in various format: 
IML, LaTeX, HR-TeX, XHTML(MathML) and MS Word document(XML).

IML is a XML file format related to InftyEditor, an editor of mathematical 
documents developed in InftyProject and released from Science Accessibility 
Net. 

Using InftyEditor, user can correct and edit the recognition results 
of InftyReader comparing the results with original images, and convert 
the results into various formats mentioned above. 

HR-TeX is a simplified LaTeX-like notation easier "to read" specially 
designed for the blinds. 

In XHTML output, mathematical expressions are output using MathML notation.

Here are some features of InftyReader:

  1. It uses the OCR engines of Toshiba Corporation, "ExpressReaderPro", 
     and of MediaDrive Corporation, "WinReader", simultaneously to recognize 
     characters in ordinary text areas. (As for the characters and math 
     symbols in formulae, it uses Infty's OCR).

  2. It can recognize tables including math expressions in the cells.

  3. It can convert PDF files into LaTeX or XHTML(MathML) including 
     mathematical expressions.
     It recognizes the page images of PDF files refering the text information 
     imbedded in PDF. 


3. Caution ---- Important!

  1. The source documents have to be clearly printed. 
  2. It should be scanned in 600dpi or 400dpi. 
     The scanning as black and white binary image in 600DPI is <<recomended>>.
  3. InftyReader erases small noises, segments page images into picture areas, 
       table areas and text areas automatically, and then recognizes text and 
       table areas including mathematical expressions.
     However, to get better recognition results, users are <<recommended>> 
       to erase noises and pictures before the recognition.
  4. In binary scanning, 
       it is important to adjust the binarization threshold of the scanner so 
       that the number of the touched or broken characters is less than 1% of 
       the total number of the characters in each scanned page image.


4. Operating Environment

InftyReader runs on Windows 10.

5. How to use InftyReader?

   1. Select input file(s) or folder.
   2. Input/select output docuent name
   3. Press the "Start" button.

Then, the recognition results of the selected image files are saved in to 
the file you specified by the "output docuent name". When, you select a 
folder instead of files, all the image files in the folder of the specified 
file type (TIF/GIF/PNG/BMP/PDF) are recognized and the results are output 
into the files having the name(s) of the folders. 

If you set check to the "Search Sub Folders" item under the "Option" menu, 
InftyReader recognizes all the image files in the sub folders of the selected 
folder. For example, if you select the folder "foldertop" having the subfolder 
structure below, 

        foldertop
           |-- subfolder1
           |      |-- a.tif
           |      |-- b.tif
           |
           |-- subfolder2
                  |-- c.tif
                  |-- d.tif

and if you select the file type "IML" for the output file type, then, you will 
get the files "subfolder1.iml", "subfolder2.iml" in the folder "foldertop". 
The recognition results of a.tif and b.tif (resp. c.tif and d.tif) are saved 
in the file subfolder1.iml (resp. subfolder2.iml). 

If you select LaTeX as output file type, you will get "subfolder1.tex", 
"subfolder2.tex", and it is similar for other file types HR-TeX and XHTML.


6. License

Please read the file "License_E.txt" included in the package.

7. Report

Any report about the software will be welcome. 

--------------------------------------
Non Profit Organization
Science Accessibility Net (sAccessNet)
e-mail: support@sciaccess.net
URL: http://www.sciaccess.net/
--------------------------------------