background image

file:///C|/VisioneerDoc/html/glossary.htm

character format 

Font and style information applied to characters. Character 
format information includes the font name and type size, as 
attributes such as underline, bold, italic, or some combination 
of these properties. Compare with 

page format

character identification error 

An incorrectly recognized bitmapped character. There are 
two kinds of character identification errors—substitutions 
and rejects. A character substitution occurs when a character 
is incorrectly recognized as another. A reject character results 
from the inability of the OCR application to interpret a 
character image with sufficient confidence. In such cases, 
recognition is not attempted and the character is flagged as 
illegible. Compare with 

layout analysis error

character image 

An arrangement of bits that defines a character in a font. 

character recognition 

The OCR process in which bitmapped character images are 
interpreted and translated into ASCII computer codes. 

character style 

See 

type style

clipboard 

In Windows applications, temporary storage for text that is 
cut or copied from a document. Text saved in the clipboard 
may be pasted back into the same or another document. 

column information 

Part of Pro OCR’s page format information. Column 
information includes the location of the column on the page, 
the width of the column, and its left and right margins. 

compression 

Electronic method for reducing the size of a file without 
losing any information in the file. Compressed TIFF files 
take up significantly less disk space than uncompressed files. 
See also 

TIFF

 and 

CCITT

confidence 

In Pro OCR, a measure of the certainty of an unknown 
character’s identity. Above a certain confidence level, a 
character is automatically recognized. At lower confidence 
levels, a character may either be recognized, but flagged as a 
suspect character, or not recognized and flagged as an 
illegible character. 

file:///C|/VisioneerDoc/html/glossary.htm (4 of 22) [1/20/2003 4:21:13 PM]

Summary of Contents for PRO OCR 100

Page 1: ...Untitled Document Pro OCR User s Guide file C VisioneerDoc Main html 1 20 2003 4 21 09 PM...

Page 2: ...ro OCR Pro OCR is an Optical Character Recognition OCR application An OCR application converts images of text such as those obtained from scanning a document or receiving a fax through your fax modem...

Page 3: ...haracters and transform the image into a plain text file Pro OCR does all of these basic tasks but it can also get the entire page into your word processor or spreadsheet as is retaining the shape for...

Page 4: ...example recognizing at another time Internet readiness supports HTML export format You can convert an image file directly to an HTML page and upload it to the Web site Proofing options Pro OCR has a...

Page 5: ...converts what it sees into an image and stores the image on the computer To transform a scanned text image into something a word processing or spreadsheet application can recognize as characters you...

Page 6: ...recognized as numbers and never mistakenly identified as letters Recognition and retention of fonts characters styles and page formatting Pro OCR recognizes and retains the differences between serif a...

Page 7: ...ive companies Information is subject to change without notice and does not represent a commitment on the part of Visioneer Inc The software described is furnished under a licensing agreement The softw...

Page 8: ...FROM ANY DEFECT IN THE PRODUCT OR FROM ITS USE EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGES All exclusions and limitations in this warranty are made only to the extent permitted by applicable l...

Page 9: ...this equipment Operation with non approved equipment or unshielded cables is likely to result in interference to radio and TV reception The user is cautioned that changes and modifications made to the...

Page 10: ...Documents Chapter 4 Locating Text and Graphics Chapter 5 Setting Recognize Options and Proofing a Recognized Document Chapter 6 Saving and Printing Documents Chapter 7 Creating and Processing Deferred...

Page 11: ...Table of Contents Contents Chapter 1 Introducing Visioneer Pro OCR 100 Why Pro OCR Features and Highlights of Pro OCR Glossary file C VisioneerDoc html toc1 htm 1 20 2003 4 21 11 PM...

Page 12: ...ingle Column locating method Auto OCR Auto brightness automatic document feeder ADF automatic processing background noise backup backwards compatible bit image bitmap bitmapped character bold text bri...

Page 13: ...er image character recognition character style clipboard column information compression confidence consistent document copyrighted document deferred job deferred processing degraded image dialog box d...

Page 14: ...t file extension file formats file type fine resolution flatbed scanner font font family font mapping format retention Gallery Get Page grayscale image hard page breaks heavy character I beam pointer...

Page 15: ...s insertion point italic text justification kerning landscape orientation layout layout analysis error Legal page size Lenient suspect threshold letter quality text line break Locate locate region loc...

Page 16: ...ocating method Normal suspect threshold numeric region OCR On Screen Verifier Optical Character Recognition OCR order of text regions orientation output file formats page controls page format page ima...

Page 17: ...ext portrait orientation printer font Pro OCR Deferred format Pro OCR format Pro OCR process Pro OCR window Proof proportionally spaced font recognition accuracy Recognize recognized text recognizing...

Page 18: ...ner driver scanning screen font scroll bars serif serif font mapping settings file sheetfed scanner side by side columns single bit image single step processing skewed text spell checking standard res...

Page 19: ...pt text superscript text supplementary dictionaries suspect character suspect threshold Tag Image File Format template template matching Template locating method text quality text region text style te...

Page 20: ...ypeface type quality type size type style underline text User Defined page size user dictionary view selector window Windows word wrap zoom controls file C VisioneerDoc html glos htm 9 of 9 1 20 2003...

Page 21: ...d One of Pro OCR s three locating methods Use it when you want Pro OCR to read a page as a single column from left margin to right margin ignoring any column or paragraph spacing Most commonly used fo...

Page 22: ...xtraneous marks dirt or ink bleed Problems with background noise can be reduced by using the brightness setting in Pro OCR to compensate for the type of noise on the page backup n A copy of a disk or...

Page 23: ...to be misrecognized Problems with broken characters can be reduced by using the brightness setting in Pro OCR to darken the image when scanning Compare with heavy character and touching characters bui...

Page 24: ...s in which bitmapped character images are interpreted and translated into ASCII computer codes character style See type style clipboard In Windows applications temporary storage for text that is cut o...

Page 25: ...lly specify Get Page Locate and recognize settings for particular pages when necessary while still being able to automatically process a job at a later time degraded image An image that contains broke...

Page 26: ...ernal format such as a word processor spreadsheet text or standard image file An exported document is created for use outside of Pro OCR export format Pro OCR can save and export documents in a variet...

Page 27: ...le Helvetica is a font family It contains a variety of typefaces including for example Helvetica Helvetica Bold Helvetica Italic Helvetica Bold Italic See also font and typeface font mapping Set in th...

Page 28: ...that resembles an upper case I When the pointer has this shape you can select text See also insertion point icon An image that graphically represents an object a concept or a message Screen icons can...

Page 29: ...gether which can cause letters to touch when the page is scanned See also touching characters landscape orientation When you hold a page of text to read it it is in landscape orientation when the page...

Page 30: ...pplying locate regions on the page according to the current Locate and Pictures settings The current Locate setting may be either Normal As Single Column or Template The current Pictures setting may b...

Page 31: ...A column format where the text flows down the vertical length of the column before moving to the top of the next column As the name suggests this type of column is commonly found in newspaper and mag...

Page 32: ...nter of the next text region in Image View after Locating has been done Text is output to application files in the order in which text regions are specified orientation Determines the angle or rotatio...

Page 33: ...ro OCR can read single PCX files produced by many scanners fax cards and graphics applications A variation of the PCX format is DCX a multi page PCX file Pro OCR can also read DCX files picture elemen...

Page 34: ...format Documents at various stages of processing may be saved in this format and opened later for additional processing Pro OCR process The five stage process that translates printed text or image fi...

Page 35: ...in which bitmapped text images are converted into editable text Recognizes text defined by the text regions on the current page according to the current Recognize setting recognized text The initial...

Page 36: ...ins the I O address of the scanner and specific information about the scanner s characteristics scanning The act of using a scanner to convert or digitize the image of a page into digital form for use...

Page 37: ...olumns is best suited for the As Single Column locate setting in Pro OCR single bit image Also referred to as line art An image format where individual pixels are expressed as a single bit either blac...

Page 38: ...urrent page Stringent suspect threshold Tells Pro OCR to highlight all suspect characters Use it when accuracy is important and when there are many words in the document that are not in the dictionari...

Page 39: ...has a confidence value associated with it Setting the suspect threshold determines the minimum confidence value used to highlight suspect characters A lenient threshold displays only the suspect char...

Page 40: ...criterion commonly used to evaluate OCR performance Compare with recognition accuracy TIFF Tag Image File Format Standard graphic file format for saving high resolution bitmapped images Pro OCR can r...

Page 41: ...le that the user may add words to It is used along with the built in dictionary to assist in recognition and to mark possible misspelled words Compare with built in dictionary and supplementary dictio...

Page 42: ...you change the margins or the type size or the spacing between words in a document lines are often re wrapped When you save documents in any export format text lines are wrapped in the output file Whe...

Page 43: ...in Pro OCR Format Example 2 Opening a File and Saving It in a Word Processor Format Example 3 Scanning a Document of Multi Column Text Example 4 Scanning a Document With Tables and Saving in a Spreads...

Page 44: ...n editable format To complete this conversion you perform the following basic steps 1 Get Page acquire pages either from a scanner or by opening an image file 2 Locate indicate which text on the page...

Page 45: ...nu choose Programs and then choose Visioneer OCR Wizard 2 If you use PaperPort software start PaperPort and then choose the Pro OCR link To start Pro OCR and select processing options 1 From the Windo...

Page 46: ...toolbar Lets you change common settings start Auto OCR or individually perform any of the basic steps required to convert an image to text Several Gallery buttons have drop down lists from which you...

Page 47: ...oftware is installed and the scanner can scan images into your computer Pro OCR works with many TWAIN compliant devices You can select the TWAIN device in the Pro OCR software NOTE If you are using Pr...

Page 48: ...t one You don t have to repeat this procedure unless you want to select a different scanner Learning About the Gallery Toolbar The Gallery toolbar contains buttons for starting the various steps of th...

Page 49: ...ings or scan pages that have mixed orientations portrait and landscape Button Does this Auto OCR Performs Steps 1 2 and 3 Get Locate and Recognize of the OCR process Before you click this button selec...

Page 50: ...rk appears next to the option you selected Tutorial Examples Now that you know the basic steps you can practice them using the sample documents that came with Pro OCR The Pro OCR software comes config...

Page 51: ...ext Only and Single Columns Only 3 From the Recognize drop down list choose Degraded or Fax Quality Starting Auto OCR By clicking the Auto OCR button you can perform the first three steps of the OCR p...

Page 52: ...ogress bar moves down the page When Pro OCR finishes locating it displays text boxes indicating located text regions with arrows connecting each text region to the next Pro OCR outputs text in the ord...

Page 53: ...in a progress bar moves down the page When Pro OCR finishes recognizing the text the Recognition Completed dialog box appears 7 Click OK The document appears in the text view You use the text view to...

Page 54: ...just save it Saving a Document You can save the processed document to disk in different formats For example if you want to open the document again in Pro OCR you select the Pro OCR format To save the...

Page 55: ...the Save As button on the Gallery toolbar The Save As dialog box appears 2 Choose Pro OCR from the Save As drop down list By saving the document in this format you can edit the pages later within Pro...

Page 56: ...rocess a file that was saved on disk You can use this procedure to read TIFF PCX or DCX files produced by Pro OCR or other applications Opening a File For this example use the file SAMPLEB TIF in the...

Page 57: ...gion cannot be recognized but can be saved as an image By specifying the Locate options Pro OCR knows what types of regions are in the document To specify the regions to locate 1 Select Locate Text On...

Page 58: ...ecognize the text in a document Pro OCR reads the text and displays the actual characters Before recognizing the document you should specify the quality of the image text You can do this by using the...

Page 59: ...ognize button drop down list 2 Click the Recognize button in the Gallery toolbar Pro OCR displays a bar that moves through the document as Pro OCR recognizes the text When the process finishes you see...

Page 60: ...1 Click the Proof button in the Gallery toolbar or press the Tab key Pro OCR starts at the current insertion point if there is one Otherwise it starts at the top of the current page Pro OCR highlight...

Page 61: ...formats including Rich Text Format RTF plain text and Microsoft Excel 4 Click Save 5 Choose Close from the File menu Example 3 Scanning a Document of Multi Column Text This example introduces you to p...

Page 62: ...lbar Your scanner software dialog box appears 5 Use the scanner software as you usually do to scan the document After scanning the sample document the document appears in Pro OCR A dialog box appears...

Page 63: ...text region to the next Note that by using Locate Text Only the graphic element in the sample was not located and so a box does not appear around it Pro OCR outputs text in the order in which the arro...

Page 64: ...e for the file in the File Name box 4 Click Save Both the image of the scanned page and the recognized text are saved Always save files in the Pro OCR format when you want to reopen them in Pro OCR NO...

Page 65: ...et format 1 Select Single Columns Only and Locate Text Only from the Locate drop down list in the Gallery toolbar 2 Put Sample Document D in the scanner Make sure to place it in the correct orientatio...

Page 66: ...ng the Single Column locating method you force Pro OCR to ignore columns and tell it to read the page from left to right top to bottom When Pro OCR is finished recognizing the page the Recognition Com...

Page 67: ...ys the document in the text view To save the document 1 Choose Save As from the File menu or click the Save As button in the Gallery toolbar The Save As dialog box appears file C VisioneerDoc html 02l...

Page 68: ...that you just saved in any spreadsheet application that supports the Microsoft Excel format Example 5 Scanning and Saving a Document with Pictures This example shows you how to scan a document with ph...

Page 69: ...page from the scanner When the scanning is done a dialog box appears asking if you want to scan additional pages For this example you won t be scanning any additional pages 5 Click End Automatic proce...

Page 70: ...oose Rich Text Format RTF from the Save as Type drop down list RTF allows you to save the pictures along with the text in the exported file NOTE As an alternative you can save in a format for an appli...

Page 71: ...mple template in this example is designed to create a text region around just the body text during the Locate step The title and copyright in the footer are not recognized saving time during recogniti...

Page 72: ...select the file SAMPLEB TIF and click the Get button The sample file is read in 8 Click the Locate button Notice that text boxes are drawn around just the body text on the page This is the text regio...

Page 73: ...place it in the correct orientation and to align it 2 Select Single Column from the Locate drop down list 3 Click the Get Page button Pro OCR begins getting the page from the scanner and displays your...

Page 74: ...down and to the right until the box following the pointer encloses the first column of the table The box should enclose the items from Gold through Cobalt TIP If you make a mistake select the region a...

Page 75: ...st defined becomes a numeric region To make a table from the selected regions 1 Choose Select All from the Edit menu Pro OCR selects all of the locate regions you defined 2 Choose Make Table from the...

Page 76: ...A message appears asking if you want to save the document 4 Choose Close from the File menu Close the document without saving it Copyright 1998 Visioneer Inc Reach us at www visioneer com file C Visi...

Page 77: ...image file that you want to use to get the page If you select a scanner you also need to select a few other options The following procedure tells you the basic steps to get a page For more detailed in...

Page 78: ...ro OCR may have trouble correctly locating paragraph boundaries Recognition may also be affected resulting in many illegible characters NOTE Processing with the Straighten Skewed Images option selecte...

Page 79: ...page images are read in from the scanner Pages are scanned according to the current page size orientation brightness and scanning settings selected in your scanner s software When you read in additio...

Page 80: ...displayed Go to the page if necessary and then use Get Page to insert the new page after it You can also use single step Get Page to replace a current page To get one page from a scanner using Get Pag...

Page 81: ...rinting Documents Using Auto OCR with Scanners This section tells you how to use Auto OCR with a flatbed scanner or Automatic Document Feed ADF scanner NOTE When scanning pages make sure that pages ar...

Page 82: ...nted correctly for your scanner and the page orientation you have selected in the Gallery 4 To scan more than one page choose Options from the Tools menu and then select the Enable Auto OCR Dialogs pr...

Page 83: ...d scanning is completed Pro OCR begins locating and then recognizing If the Enable Auto OCR Dialogs processing option is selected Pro OCR asks for additional pages to scan after it finishes reading in...

Page 84: ...OCR you need the Pro OCR ISIS upgrade For more information visit Visioneer s Web site at www Visioneer com To automatically process one or more pages with a scanner that has an ADF 1 Make sure you ha...

Page 85: ...for this job click End Scanning is completed Pro OCR displays the first page of the scanned stack in the image view Pro OCR then begins locating and recognizing To scan the second side of double side...

Page 86: ...resolution paint programs Pro OCR can read the following image file formats TIFF Uncompressed PackBits Group 3 Group 3 modified Group 4 PCX DCX Pro OCR can open black and white one bit single page or...

Page 87: ...e most popular scanners directly However if you don t have a scanner that Pro OCR supports directly you may still be able to use Pro OCR with the scanner application you do have Most scanner applicati...

Page 88: ...ead in Pro OCR treats the page as if it had scanned it Getting Fax modem Files Pro OCR can also open fax modem files if they have been saved in one of the supported input file formats Many fax modems...

Page 89: ...atically process from a file 1 Select Open File from the Get Page drop down list in the Gallery toolbar 2 Check the Locate and Recognize options to make sure they are set the way you want them 3 Click...

Page 90: ...each page If the Enable Auto OCR Dialogs processing option is selected when all pages have been read the Get Page dialog box is again displayed 6 To add pages from an additional file or files to the e...

Page 91: ...ever it also means that you have to click Finish to proceed with automatic processing after the Get Page step is done If instead you want the Auto OCR process to continue without interruption you can...

Page 92: ...enable the dialogs select Enable Auto OCR Dialogs To disable the dialog boxes deselect the option Copyright 1998 Visioneer Inc Reach us at www visioneer com file C VisioneerDoc html 03get htm 16 of 1...

Page 93: ...atically You save a document using Save or Save As from the File menu If you close or exit Pro OCR without saving a document a message prompts you to save the current document After you get a document...

Page 94: ...n represented here as XXX according to the document format you select in the Save as Type drop down list 2 Type a new file name if necessary 3 Choose a document format from the Save as Type drop down...

Page 95: ...ctures into the document when it is saved choose Embed in Export File from the Picture Format drop down list The embedding option is only available for the following document formats MS Word for Windo...

Page 96: ...ons dialog box has the following sets of options If page breaks should be inserted between each page If formatting should be preserved or completely discarded or if only certain formatting should be p...

Page 97: ...bout saving multiple documents see Saving Multiple Documents as Separate Files and see Saving Multiple Page Images as Separate Image Files later in this chapter NOTE For Split on Blank Pages to work p...

Page 98: ...eplace the existing document Click No to return to working with the document Click Yes to replace the document NOTE When you want to open a document in an image editing program save it in one of the i...

Page 99: ...page as BOOK001 XLS the next file as BOOK002 XLS and so on 6 Click OK To save multiple single page documents as separate files using the one page option 1 Process the pages as you usually would 2 Whe...

Page 100: ...e Options dialog box appears 4 Select the One Page Per File and click OK When you name the file choose a file name of up to five characters If the file name is longer Pro OCR truncates it to five char...

Page 101: ...o save settings using Save Settings As from the File menu and open those settings later when you need them To save settings 1 Choose Save Settings As from the File menu 2 Enter a file name 3 Select a...

Page 102: ...ed TIFF Group 3 TIFF PackBits TIFF Group 3 Modified PCX TIFF Group 4 Table 6 3 Standard Text File Formats Plain Text Formatted Text Text with Line Breaks Comma Delimited Text Tab Delimited Text Rich T...

Page 103: ...oon as any pages have been recognized you can save the document in all supported file formats NOTE If you save in the Pro OCR Text Only format you won t be able to use the On Screen Verifier during ed...

Page 104: ...rocessing on it using Process Deferred Jobs When you use Open to read in a file that you ve saved in the Pro OCR format or the Pro OCR Deferred format each page is retrieved with the Locate and Recogn...

Page 105: ...image editing programs can only support one image page per file For this reason the Save As command has an option that lets you save a multipage document as a sequence of single page TIFF files PCX f...

Page 106: ...tyle or font information Tab Delimited Text format Preserves text tabs and carriage returns No page formatting character style or font information is preserved When you output a recognized document in...

Page 107: ...zing When you save a document with the Save As dialog box the Save As Options dialog box lets you select a variety of format options Preserve All Formatting Pro OCR saves the current document with all...

Page 108: ...Saves the recognized text including all formatting character styling and font information Yes Pro OCR Deferred Saves the page image any locate regions that have been defined and any recognized text Y...

Page 109: ...er you open the recognized document in a word processor you can print the document There may be times however when you want to print the document directly from within Pro OCR To print a document from...

Page 110: ...Saving and Printing Documents 5 Click OK Copyright 1998 Visioneer Inc Reach us at www visioneer com file C VisioneerDoc html 06save htm 18 of 18 1 20 2003 4 21 18 PM...

Page 111: ...deleting them Kinds of Locate Regions Pro OCR processes three kinds of locate regions Text contains text including letters and numbers Numeric contains only numbers and certain symbols Picture contain...

Page 112: ...etter of the alphabet or a symbol other than one of the numeric symbols in a numeric region Pro OCR converts a letter or symbol to the number or special symbol that it most closely resembles For examp...

Page 113: ...a picture region but can save the image as a picture either embedded within a document file or as a separate image file A picture region is enclosed in a double box You can create Picture regions auto...

Page 114: ...orrections in the text mode You can also create a template to save the locate regions and apply the template automatically to one or more pages in one or more documents The section discusses Pro OCR s...

Page 115: ...ve side by side blocks of text that you want Pro OCR to read from left to right across the page When you use Single Columns Only Pro OCR always creates text regions that go from the left margin to the...

Page 116: ...Locate button in the Gallery toolbar 5 If locate regions are already defined for that page a dialog box appears that asks you if you want to discard previously defined locate regions on the page To p...

Page 117: ...by it and locate the current page with the modified locate regions You can use the modified locate regions on other pages by saving as the same template or with another template name Creating a Templa...

Page 118: ...Using a Template for Locating Regions later in this chapter Using a Template for Locating Regions Often you won t want to recognize all the information on a page Using a template lets you select speci...

Page 119: ...lery toolbar to manually locate and recognize information Order of Locate Regions When a page has more than one locate region Pro OCR automatically orders the locate regions When you manually define l...

Page 120: ...processed and output to a file It is easiest to understand why this is important by seeing what happens to text when it is output to a word processor that has limited support for complex page layouts...

Page 121: ...Start Finish Processing Process Deferred Jobs or single step Locate with the manual locating method Pro OCR automatically orders all locate regions If the assigned order does not correspond with the...

Page 122: ...Usually legal documents such as court papers contain case or document information at the top or top right of the page numbers along the left side of the page and a wide mixture of indented and center...

Page 123: ...ex document layouts while others provide only limited support Defining Locate Regions Manually For most pages you ll locate automatically as part of automatic processing Finish Processing Process Defe...

Page 124: ...other pages in this and other documents You can also locate additional pages of the document and then use Finish Processing or save the document in the Pro OCR Deferred format to be processed later u...

Page 125: ...existing locate region on the page that is last in the sequence to the top center of the new locate region you ve just created Tips When Creating Locate Regions The following tips may help you when c...

Page 126: ...the edges of a text or numeric region you ve defined manually are illegible go back to the image view and check to make sure that the text or numeric region does not cut off any of the edges of the t...

Page 127: ...s are only recognized if fully enclosed by at least one text or numeric region In this example the fifth line is located in the top text region and the sixth line is located in the bottom text region...

Page 128: ...e locate region When the pointer is over a locate region it is the standard arrow pointer 2 Click anywhere in the locate region When you select a locate region when other locate regions are selected t...

Page 129: ...a Locate Region later in this chapter Changing the Kind of a Locate Region You can only redefine a locate region in the image view You redefine a locate region to change it to a different type of loca...

Page 130: ...re more than two locate regions on the page the deleted locate region disappears and the order of the remaining locate regions remains the same Resizing a Locate Region You can only resize a locate re...

Page 131: ...ow pointer 2 Click in the locate region make sure you don t click on a sizing handle 3 Hold down the mouse button and drag the pointer into the locate region to which you want to relink and then relea...

Page 132: ...utton 2 Drag the pointer into locate region 2 then release the mouse button The arrow originally leading into locate region 2 disappears and a new arrow connects locate region 1 to locate region 2 Cop...

Page 133: ...ode View a summary of errors for a recognized document NOTE You can recognize text automatically by using Auto OCR or you can recognize text in a single step For more information about Auto OCR see th...

Page 134: ...options that tell Pro OCR how to recognize a document and display the results In the Options dialog box you can select Display options that Select the fonts with which you want your document displaye...

Page 135: ...threshold doesn t change how Pro OCR decides on or assigns the identity of a character Thus changing the suspect threshold doesn t change how many characters in the document Pro OCR is sure about but...

Page 136: ...critical and when most of the words in the document are likely to be found in the dictionaries Lenient suspect threshold Identifies only suspect characters of which it is very uncertain Typically Pro...

Page 137: ...and in the correct orientation for the scanner and the page orientation you ve selected If necessary make sure the Straighten Skewed Images processing option is selected Also make sure that the brigh...

Page 138: ...serif fonts are mapped to the sans serif font you specify and all monospaced fonts are mapped to the monospaced font you specify NOTE If you are not running Windows with TrueType we recommend that you...

Page 139: ...isplay pictures in text view select the Display Pictures checkbox To prevent pictures from appearing in text view deselect the Display Pictures checkbox 4 Click OK When you return to the document it a...

Page 140: ...dialog box appears that asks you if you want to discard previously recognized text on the page To proceed with recognition click Yes The document window is switched to the image view and the page is...

Page 141: ...xt view Use them to change between zoom levels You cannot zoom in closer than the pixel for pixel level in the image view or 400 in the text view or zoom out farther away than 25 in either view When y...

Page 142: ...rward or backward to a specific page To move forward or backward one page If the document has more than one page click the arrows to change pages If you re on the first page the previous page arrow is...

Page 143: ...keeps track of any characters that it couldn t recognize illegible characters and track of characters that it wasn t certain it had recognized correctly suspect characters and highlights them You can...

Page 144: ...ing options displayed 2 Select one of these options Whole Lines Proofs the entire document one line at a time Each time you choose Proof the insertion point moves to the start of the next line and the...

Page 145: ...e installed in the Dictionaries directory Proofing Numbers and Alphanumeric Words Pro OCR searches for and selects each number or alphanumeric word as it is encountered A number is a word consisting o...

Page 146: ...next marked suspect or illegible character or the next specified word or character is found and selected The document is scrolled so that the character or word is in view If the image of the page exi...

Page 147: ...essage asks if you want to continue from the beginning of the document 5 Click OK to return to the beginning and check the rest of the document NOTE If you ve displayed the page with for example the L...

Page 148: ...it a different line repeat steps 1 and 2 or use the arrow keys to move to a new line NOTE When you ve selected text manually the On Screen Verifier is not automatically displayed You can display it by...

Page 149: ...format for example in a word processor file format that supports text wrap the text can reformat line breaks might not be preserved and might be rewrapped Carriage returns will be preserved when savin...

Page 150: ...ine is drawn When a text line falls within the dotted outline it is highlighted 3 Release the mouse button All text lines that were highlighted are selected To deselect one or more text lines while ke...

Page 151: ...ove the selected text lines A message appears asking you if you want to delete the text 3 Click OK All selected lines are removed from the page The remaining lines do not close up To copy one or more...

Page 152: ...step This automatic spelling verification helps Pro OCR identify suspect characters in the scanned text Pro OCR does this using its General dictionary and if you choose one a user dictionary There are...

Page 153: ...ly add the word to the current user dictionary by choosing Add to User Dictionary from the Edit menu while the word remains selected After you ve added the word to your user dictionary the next time P...

Page 154: ...nu The Select User Dictionary dialog box appears 2 Find the dictionary you want to open and select it By default the user dictionary is stored in the DICT folder Only the dictionaries that Pro OCR rec...

Page 155: ...d Pro OCR selects the word Make sure you have the Misspelled Words proofing option selected OR Double click a word to select it 3 Choose Add to User Dictionary from the Tools menu Pro OCR adds the wor...

Page 156: ...Setting Recognize Options and Proofing a Recognized Document Copyright 1998 Visioneer Inc Reach us at www visioneer com file C VisioneerDoc html 05recog htm 24 of 24 1 20 2003 4 21 21 PM...

Page 157: ...er as the Source Getting a Page Using a Scanner Using Auto OCR with Scanners Getting Pages from an Image File Selecting a File as the Source and Getting Pages Getting Files From Other Scanner Applicat...

Page 158: ...ictures Locating with a Template Order of Locate Regions Examples of Locating Documents Processing Resumes Processing Legal Documents Processing Faxed Documents About Columns Locate Regions and Output...

Page 159: ...ecting and Deselecting Locate Regions Changing the Kind of a Locate Region Deleting a Locate Region Resizing a Locate Region Reordering Locate Regions Glossary file C VisioneerDoc html toc4 htm 2 of 2...

Page 160: ...Symbol Selecting a Display Font Indicating Whether Pictures Appear During Text View Recognizing a Single Page Working with Recognized Pages in Text view Setting the Zoom Levels Selecting a Page to Di...

Page 161: ...Table of Contents Checking Spelling in a Document Adding Words to a User Dictionary Displaying a Summary of Recognized Errors Glossary file C VisioneerDoc html toc5 htm 2 of 2 1 20 2003 4 21 22 PM...

Page 162: ...ported Output File Formats Saving to Proprietary Pro OCR Formats Saving to Standard Image File Formats Saving to Generic Text File Formats Saving to Application Formats Format Suppression and Customiz...

Page 163: ...h Jobs The Advantages of Finish and Deferred Processing Guidelines for Using Finish Processing and Deferred Processing How it Works Setting Up and Processing Deferred Jobs Processing Deferred Jobs Bat...

Page 164: ...utomatically saved in a format that you select to a specified destination directory The Advantages of Finish and Deferred Processing When you use automatic processing Auto OCR you can efficiently and...

Page 165: ...cess individual pages as necessary and save the document in Pro OCR Deferred format You can then complete processing automatically with Process Deferred Jobs How it Works When you select Finish Proces...

Page 166: ...Page drop down list in the Gallery toolbar You can create a deferred job either by scanning pages or by reading them in from a file If your source is a scanner don t forget to specify the appropriate...

Page 167: ...If Use Scanner is selected as the source Pro OCR immediately starts to scan 6 Scan the documents or if your are getting a file select a file in the Auto Get Page dialog box and then click the Get butt...

Page 168: ...by processing individual pages or you can complete processing now by choosing Finish Processing from the Recognize menu or later with by choosing Process Deferred Jobs from the Recognize menu Processi...

Page 169: ...t To select multiple files click the Advanced button choose a file and click Add Repeat this process until you select all files that you want to get then click the Get button Pro OCR reads the deferre...

Page 170: ...document is displayed at the last selected zoom level in the text view 4 Click OK You can now proof the document press Tab and edit it as needed Batch Processing Use Batch Process to convert all of t...

Page 171: ...nize options from the drop down lists in the Gallery toolbar 2 Select Batch Process from the Recognize menu The Batch Process dialog box appears 3 Choose a file type from the Source Information File T...

Page 172: ...ess TXT for Plain Text Text with Line Breaks Comma Delimited Text Formatted Text and Tab Delimited ASCII SAM Lotus Ami Pro WK1 Lotus 1 2 3 XLS Microsoft EXCEL DOC Microsoft Word RTF Rich Text Format W...

Page 173: ...Processing Documents with Different Character Quality Converting Parts of a Page in a Multipage Document Changing the Gallery Options Using Get Page Again Using Locate Again Using Recognize Again Find...

Page 174: ...hen you re recognizing numeric text Putting pages in the scanner correctly Avoiding marks on a page Fixing Broken and Touching Characters Pro OCR is good at recognizing characters that are broken ligh...

Page 175: ...r touching characters decrease the brightness just enough to compensate for the broken characters Adjusting Brightness for Consistent Documents For most documents you ll find that using Auto OCR works...

Page 176: ...ou want to scan the page again with a different brightness setting delete the page by choosing Delete Page from the Edit menu 7 Increase or decrease the brightness setting in your scanner software If...

Page 177: ...f the Gallery as determined in step 1 for the rest of the pages 4 Choose Finish Processing from the Process menu OR Save the document in the Pro OCR Deferred format When you use either Finish Processi...

Page 178: ...cters that are distinct from the background The background in a good image is light and not fuzzy or dotty To process a document with pages that vary in character image quality too dark touching too l...

Page 179: ...y 8 Choose Finish Processing from the Process menu Make sure you set the appropriate Locate and Recognize options OR Save the document in the Pro OCR Deferred format If you save in the Pro OCR Deferre...

Page 180: ...t in the Pro OCR Deferred format If you save the document in the Pro OCR Deferred format you can choose Process Deferred Jobs later to finish processing it Using Locate and Recognize on the Document T...

Page 181: ...OCR Deferred format If you save the document in the Pro OCR Deferred format you can use Process Deferred Jobs at a later time to finish processing it When you do make sure you set the controls in the...

Page 182: ...ou repeat this step Using Locate Again This may be necessary if you decide that a located page has incorrect locate regions on it or if you change your mind about whether or not to locate picture regi...

Page 183: ...a monthly sales report from the XYZ Company typed on a typewriter with a broken X that Pro OCR couldn t identify While you re proofing the document using Proof with the Illegible Characters proofing o...

Page 184: ...The dialog box is displayed with the selected text 6 Type the correct text in the Replace box 7 Click the Replace then Find button The current occurrence is replaced 8 Continue clicking Replace then F...

Page 185: ...ways to fix this problem You can adjust the paper so that the text is scanned in straight or is not skewed more than 2 If text is straight on the page make sure that the paper is put in the scanner st...

Page 186: ...g a photocopy for scanning before you mark it up Using whiteout to remove any markings that don t overlap text Be very careful however about using whiteout on text you may make the text even more ille...

Page 187: ...age dialog box 2 Auto OCR from a file from a scanner with a flatbed with an ADF scanner auto orientation B Batch Process dialog box batch processing explanation selecting brightness adjusting broken c...

Page 188: ...discarding format when saving Display Options command 1 Display Options command 2 Display Options command 3 Display Pictures option E editing all lines applying styles copying deleting text deselecti...

Page 189: ...sheet standard text word processor File menu Process Deferred Job command 1 Process Deferred Job command 2 Save As command File Properties dialog box file getting multiple files from other scanner app...

Page 190: ...ple files 1 getting multiple files 2 one scanned page scanning additional pages setting options 1 setting options 2 setting options 3 single step operation using Auto OCR with files Get Page dialog bo...

Page 191: ...ic order of overlapping regions and skewed text overlapping text and pictures picture redefining reordering resizing resume example selecting and deselecting single or multiple columns tables text tex...

Page 192: ...og box Display options Process options 1 Process options 2 Proof options order of locate regions overlapping text and pictures P Page controls Status bar 1 Page controls Status bar 2 Page Image Rotati...

Page 193: ...d dialog box processing options 1 processing options 2 Proof command proofing combinations of characters and words misspelled words numbers and alphanumeric words punctuation and symbols selecting opt...

Page 194: ...ings multiple documents as separate files pictures pictures example to application formats to generic text file format to MS Word example to Pro OCR example to Pro OCR deferred format to Pro OCR forma...

Page 195: ...g source selecting file selecting scanner source controls Gallery 1 source controls Gallery 2 speed of recognition spellcheck Split Document options Save As Options Split on Blank Pages option Split D...

Page 196: ...e Text Region icon Text Region icon text view editing operations editing text editing within a line selecting Text View icon TIFF tips for locating toolbar tutorial scanning a document using a templat...

Page 197: ...ges zooming in and out view controls 1 view controls 2 Visioneer format W White Out Text option 1 White Out Text option 2 Wizard word processor exporting to unsupported saving to Z Zoom controls Statu...

Reviews: