/usr/share/doc/gimagereader/manual.html is in gimagereader 3.1.2+git368fa8f-2build1.
This file is owned by root:root, with mode 0o644.
The actual contents of the file can be viewed below.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 | <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>
<title>gImageReader 3.1.2 Manual</title>
<meta http-equiv="content-type" content="text/html;charset=utf-8" />
</head>
<body>
<h1>gImageReader 3.1.2 Manual</h1>
<h2>Contents</h2>
<ul>
<li><a href="#About">About</a>
<ul>
<li><a href="#Features">Features</a></li>
<li><a href="#News">News</a></li>
</ul>
</li>
<li><a href="#Usage">Usage</a>
<ul>
<li><a href="#Usage_Open">Opening and importing images</a></li>
<li><a href="#Usage_View">Viewing and adjusting images</a></li>
<li><a href="#Usage_Recognize">Recognizing text</a></li>
<li><a href="#Usage_PostProc">Post-processing the recognized text</a></li>
<li><a href="#Usage_Options">Program options</a></li>
</ul></li>
<li><a href="#InstallTessdata">Installation of tesseract language definitions</a></li>
<li><a href="#InstallSpelling">Installation of spelling dictionaries</a></li>
<li><a href="#Contributing">Contributing</a></li>
<li><a href="#Support">Debugging and support</a></li>
</ul>
<h2><a name="About">About</a></h2>
<p>gImageReader is a frontent to <a href="http://code.google.com/p/tesseract-ocr/">tesseract-ocr</a> written in C++.</p>
<h3><a name="Features">Features</a></h3>
<ul>
<li>Allows the user to select the parts of the image they want to be recognized or directly recognize the entire image.</li>
<li>Supports page layout autodetection.</li>
<li>Supports opening images and PDF documents, as well as importing images from scanning devices, from the clipboard and from screenshots.</li>
<li>Recognized text displayed directly next to the image, or copied directly to clipboard.</li>
<li>Multilingual recognition.</li>
<li>Basic editing of output text, including search/replace and removing line breaks on selected text.</li>
<li>Spellcheck enabled for the selected language in the output textfield if corresponding dictionary installed.</li>
<li>User is prompted to install missing spellcheck languages.</li>
</ul>
<h3><a name="News">News</a></h3>
<p><i>A detailed list of changes can be found in the commit log: https://github.com/manisandro/gImageReader/commits/master</i></p>
<p>gImageReader 3.1.2 (Jun 30 2015):</p>
<ul>
<li>Fix incorrect behavior of "Append to current text" with multiple recognition areas</li>
<li>Full details in commit log: https://github.com/manisandro/gImageReader/commits/master</li>
</ul>
<p>gImageReader 3.1.1 (Jun 11 2015):</p>
<ul>
<li>Fix titlebar now shown when window maximized in Gnome 3</li>
<li>New translations: Chinese (Hong Kong), Chinese (Taiwan)</li>
<li>Updated translations: Russian, Portoguese</li>
</ul>
<p>gImageReader 3.1 (May 1 2015):</p>
<ul>
<li>Add option to draw whitespace</li>
<li>Allow searching and replacing only in selected portion of output text</li>
<li>Add "preserve paragraphs" postprocessing option</li>
<li>Allow to open files via drag and drop</li>
<li>Improve rendering of certain PDF files with the Qt interface</li>
<li>Fix scanning broken with certain scanners under Windows</li>
<li>Support automatic spelling dictionary installation under Windows</li>
<li>Allow saving scans in other formats than png</li>
<li>Handful of bugs fixed</li>
</ul>
<p>gImageReader 3.0.1 (Jan 4 2015):</p>
<ul>
<li>Fix a bug in the Qt interface when loading substitutions list from file</li>
<li>Improve behaviour of strip line breaks functionality with multiple line breaks</li>
<li>Small UI improvements</li>
<li>Full details in commit log: https://github.com/manisandro/gImageReader/commits/master</li>
</ul>
<p>gImageReader 3.0 (Dec 12 2014):</p>
<ul>
<li>gImageReader 3.0 stable</li>
<li>New Qt4/5 interface, as alternative to the Gtk interface</li>
<li>Fixed scanning on Windows</li>
<li>Memorize image settings (brightness, contrast, etc) when switching images</li>
<li>Search forward and backward, replace all, case sensitive search</li>
<li>Many bug fixes</li>
<li>Translation updates</li>
<li>Full details in commit log: https://github.com/manisandro/gImageReader/commits/master</li>
</ul>
<p>gImageReader 2.93 (Apr 30 2014):</p>
<ul>
<li>gImageReader 3.0 beta 4</li>
<li>Add possibility to choose multiple recognition languages</li>
<li>Add button to show/hide output pane</li>
<li>Allow toggling spell checking from context menu</li>
<li>Fix a crash when loading a scanned document</li>
</ul>
<p>gImageReader 2.92 (Mar 19 2014):</p>
<ul>
<li>gImageReader 3.0 beta 3</li>
<li>Add replacement-list feature, allowing the user to specify a list of replacements to perform on the recognized text</li>
<li>Fix saving output resulting in empty files</li>
<li>Fix crashes when rendering PDF files</li>
<li>Keep line-breaks if preeded by line-break</li>
<li>Fix localization not working on Windows</li>
</ul>
<p>gImageReader 2.91 (Feb 20 2014):</p>
<ul>
<li>gImageReader 3.0 beta 2</li>
<li>Improve page-layout autodetection by merging overlapping regions</li>
<li>Use native file-chooser dialogs on Gnome/KDE/Windows</li>
<li>Allow performing multipage-recognition with page-layout autodetection</li>
<li>Fix broken search/replace which caused the application to crash</li>
<li>Add Win64 packages</li>
</ul>
<p>gImageReader 2.90 (Feb 11 2014):</p>
<ul>
<li>gImageReader 3.0 beta 1</li>
<li>Support multiple selections (via CTRL-key). Rightclicking a selection opens a context menu which allows to:
<ul>
<li>Deleted and reordered individual selections</li>
<li>Recognize the selected text, either to clipboard or to the output pane</li>
</ul></li>
<li>Basic automatic page layout detection</li>
<li>The output pane now supports undo and redo</li>
<li>Configuration is now automatic</li>
<li>Proper arbitrary rotation of images</li>
<li>Detect deleted/renamed files</li>
<li>Cleaner UI</li>
<li>Port to Gtk+3, rewrite in C++ using the Gtkmm bindings</li>
</ul>
<h2><a name="Usage">Usage</a></h2>
<h3><a name="Usage_Open">Opening and importing images</a></h3>
<ul>
<li>Images can be opened/imported from the <i>sources pane</i>, which is activated by clicking on the top-left button in the <i>main toolbar</i>.</li>
<li>To open an existing image or PDF document, click on the open button in the <i>images tab</i>.</li>
<li>To capture a screenshot, paste image data from the clipboard, or open a recently opened file, click on the arrow next to the open button.</li>
<li>You can manage the list of opened images with the buttons next to the open button. Temporary files (such as screenshots and clipboard data) are automatically deleted when the program exists.</li>
<li>To acquire an image from a scanning device, click on the <i>acquire tab</i> in the <i>sources pane</i>.</li>
</ul>
<h3><a name="Usage_View">Viewing and adjusting images</a></h3>
<ul>
<li>Buttons in the <i>main toolbar</i> to zoom in and out as well as rotate the image by an arbitrary angle. Zooming can also be performed by scrolling on the image with the CTRL key pressed.</li>
<li>Basic image manipulation tools are provided in the <i>image controls toolbar</i>, which is activated by clicking on the <i>image controls button</i> in the <i>main toolbar</i>. The provided tools currently allow brightness and contrast adjustments as well as adjusting the image resolution (through interpolation).</li>
</ul>
<h3><a name="Usage_Recognize">Recognizing text</a></h3>
<ul>
<li>Areas to be recognized can be selected by dragging (left click + mouse move) a rectangular area around portions of the image. Multiple selections are possible by pressing the CTRL key while selecting.</li>
<li>Alternatively, the <i>automatic layout detection button</i>, accessible from the <i>main toolbar</i> will attempt to automatically define appropriate recognition areas, as well as adjust the rotation of the image if necessary.</li>
<li>Selections can be deleted and reordered via the context menu which appears when right-clicking on them. It is also possible to resize existing selections by dragging the corners of the selection rectangle.</li>
<li>The recognition language can be selected from the drop-down menu of the <i>recognize button</i> in the <i>main toolbar</i>. If a spelling dictionary is installed for a tesseract language definition, it is possible to choose between available regional flavors of the language. This will only affect the language for spell-checking the recognized text. Unrecognized tesseract language definitions will appear by their filename prefix, one can however teach the program to recognize such files by defining appropriate rules in the program configuration (see below). Multiple recognition languages can be specified at once from the <i>Multilingual</i> submenu of the drop-down menu.</li>
<li>The selected portions of the image (or the entire image, if no selections are defined) can be recognized by pressing on the <i>recognize button</i> in the <i>main toolbar</i>. Alternatively, individual areas can be recognized by right-clicking a selection. From the selection context menu, it is also possible to redirect the recognized text to the clipboard, instead of the output pane.</li>
</ul>
<h3><a name="Usage_PostProc">Post-processing the recognized text</a></h3>
<ul>
<li>Recognized text will appear in the <i>output pane</i> (unless the text was redirected to the clipboard), which appears as soon as some text was recognized.</li>
<li>If a spelling dictionary for the recognition language is available, automatic spell-checking will be enabled for the outputted text. The used spelling dictionary can be changed either from the language menu next to the <i>recognize button</i>, or from the menu which appears when right-clicking in the text area.</li>
<li>When additional text is recognized, it will either get appended, inserted at cursor position, or replace the previous content of the text buffer, depending on the mode selected in the <i>append mode menu</i>, which can be found in the <i>output pane toolbar</i>.</li>
<li>Other post-processing tools include stripping line breaks, collapsing spaces and more (available from the second button in the <i>output pane toolbar</i>), as well as searching and replacing text. A list of search and replace rules can be defined by clicking on the <i>Find and replace</i> button in the <i>output pane toolbar</i> and then clicking on the <i>Substitutions</i> button.</li>
<li>Changes to the text buffer can be undone and redone by clicking on the appropriate buttons in the <i>output pane toolbar</i>.</li>
<li>The contents of the text buffer can be saved to a file by clicking on the <i>save button</i> in the <i>output pane toolbar</i>.</li>
<li>The output pane can be hidden by clicking the right-most button in the <i>output pane toolbar</i>. This will also clear the contents of the text buffer.</li>
</ul>
<h3><a name="Usage_Options">Program options</a></h3>
<ul>
<li>The program options can be accessed from the <i>application menu</i>, which opens when clicking the right-most button of the <i>main toolbar</i>. When running the application within the Gnome 3 desktop environment, the application menu is part of the top bar of the desktop shell.</li>
<li>Options allow setting the font of the output pane, as well as determining whether the application will notify about missing spelling dictionaries and new program versions.</li>
<li>When running the Gtk+ interface, the options also allow setting the orientation of the <i>output pane</i> (if vertical, it will occupy the right portion of the application, if horizontal, it will occupy the lower portion). When running the Qt interface, the position of the output pane can be freely moved around by dragging on the title bar of the output pane.</li>
<li>Additionally, one can define new rules to match tesseract language definitions to a language (unfortunately, the tesseract language definitions do not include this information). The list of predefined rules can be seen in the <i>Predefined language definitions</i> section. Additional definitions can be added clicking on the <i>Add</i> button below. The rules for a language definition, which consists of three fields, are as follows:
<ul>
<li><b>Filename prefix</b>: The filename of tesseract language data files is of the format <i><prefix>.traineddata</i>, i.e. for English, the file is called <i>eng.traineddata</i> and the prefix is <i>eng</i>.</li>
<li><b>ISO code</b>: This is the <a href="http://en.wikipedia.org/wiki/List_of_ISO_639-1_codes">ISO 639-1 language code</a> (i.e. <i>en</i>), optionally combined by an underscore with the <a href="http://en.wikipedia.org/wiki/ISO_3166-2">ISO 3166-2 country code</a> (i.e. <i>en_US</i>). This information is necessary to match spelling dictionaries to the language. The choice of the actual country code is not strictly relevant, but it is necessary for the automatic installation of spelling dictionaries to find a relevant package of dictionaries. This code can also be made up if no appropriate choices exist, the only result being that no relevant spelling-dictionaries will be matched with the language.</li>
<li><b>Native name</b>: The native name of the language simply determines the label of the entry for the language in the <i>language menu</i>.</li>
</ul></li>
</ul>
<h2><a name="InstallTessdata">Installation of tesseract language definitions</a></h2>
<ul>
<li>On <b>Linux</b>, it's sufficient to install the package corresponding to the language definition one wants to install via the package management application (the packages may be called something like <i>tesseract-langpack-<lang></i>).</li>
<li>On <b>Windows</b>, one needs to download the desired language definitions from the <a href="http://code.google.com/p/tesseract-ocr/downloads/list">project download page</a>, the copy the <i><lang>.traineddata</i> files to <tt>Start→All Programs→gImageReader→Tesseract language definitions</tt>.</li>
<li>To re-detect the available languages, one can restart the program, or select <i>Redetect Languages</i> from the <i>application menu</i>.</li>
</ul>
<h2><a name="InstallSpelling">Installation of spelling dictionaries</a></h2>
<ul>
<li>On <b>Linux</b>, if your distribution supports PackageKit, the program will offer to automatically install missing dictionaries when necessary. If automatic installation does not work for some reason, you can install the spelling dictionaries from the package management application (the packages may be called something like <i>hunspell-<lang></i>).</li>
<li>On <b>Windows</b>, the program will attempt to automaticallly download the desired spelling dictionary from <a href="http://cgit.freedesktop.org/libreoffice/dictionaries/tree">http://cgit.freedesktop.org/libreoffice/dictionaries/tree</a>. Dictionaries can also be installed manually: for a desired language (i.e. <tt>it_IT</tt>), download the <b>*.dic and *.aff</b> files and place them in <tt>Start→All Programs→gImageReader→Spelling dictionaries</tt>.</li>
</ul>
<h2><a name="Contributing">Contributing</a></h2>
<p>
For suggestions and contributions of any kind, please file tickets and/or pull-requests on the <a href="https://github.com/manisandro/gImageReader">GitHub project page</a>, or contact me at <a href="mailto:manisandro@gmail.com">manisandro@gmail.com</a>. I'd especially appreciate translations - here are the main steps for creating a translation:
</p>
<ol>
<li>Download the <a href="https://github.com/manisandro/gImageReader/archive/master.zip">latest source archive</a>.</li>
<li>Enter the <tt>po</tt> folder.</li>
<li>To create a new translation, copy the <tt>gimagereader.pot</tt> file to <tt><language>.po</tt> (i.e. <tt>de.po</tt> for German). To edit an existing translation, simply pick the corresponding file.</li>
<li>Translate the strings in <tt><language>.po</tt>.</li>
<li>Send the <b>po</b> file to <a href="mailto:manisandro@gmail.com">manisandro@gmail.com</a>. Thanks!</li>
</ol>
<h2><a name="Support">Debugging and support</a></h2>
<p>
If you find an issue or have a suggestion, please file a ticket to the <a href="https://github.com/manisandro/gImageReader/issues">gImageReader issue tracker</a>, or contact me directly at <a
href="mailto:manisandro@gmail.com">manisandro@gmail.com</a>. If you are experiencing crashes or hangs, please also try to include the following information in the ticket/email:
</p>
<ul>
<li>If the crash handler appears, include the backtrace which is shown there. To make sure that the backtrace is complete, if you are running the application under Linux, make sure that the <tt>gdb</tt> debugger as well as the debugging symbols are installed if your distribution provides them. The package containing the debugging symbols is usually called <b><packagename>-debuginfo</b> or <b><packagename>-dbg</b>. If you are running the application under Windows, debugging symbols are installed by default.</li>
<li>If you are running under Windows, include the <tt>%ProgramFiles%\gImageReader\gimagereader.log</tt> and <tt>%ProgramFiles%\gImageReader\twain.log</tt> log files.</li>
<li>If you are running under Linux, run the application from a terminal and include any output which appears on the terminal.</li>
<li>Try to describe as best as you can what you were doing and whether the problem is reproducible.</li>
</ul>
<hr />
<p>Copyright ©2009-2015 Sandro Mani, revision: Mon, Apr 20 2015</p>
</body>
</html>
|