background image

1–8

TextBridge Professional Edition User's Guide

To maintain the “what-you-see-is-what-you-get” characteristics of
the document, use a fixed-width font such as Courier. This format
is most useful for documents that you do not intend to edit or
tables and numeric data.

TextBridge Pro also includes a markup format called XDOC.
XDOC can be used for conversion to third-party formats.

S

CANNER 

S

UPPORT

TextBridge Pro supports virtually all popular desktop scanners
using the TWAIN device interface standard, Adobe Photoshop
Import Plug-ins
, or ISIS scanner drivers. However, ScanSoft
does not provide any type of scanner driver with TextBridge Pro.
If your scanner does not come with a scanner driver, please
contact the scanner manufacturer.

TWAIN is a non-proprietary standard for acquiring data from a
scanner or modem. ScanSoft supplies the TWAIN source
manager, but not the TWAIN source for a particular scanner.

TextBridge works with any TWAIN-compliant scanner that
connects to a Macintosh and produces binary (black-and-white)
images in a supported size and resolution.

Many scanners come with an Adobe Photoshop Import Plug-in to
drive the scanner. TextBridge works with any properly installed
Photoshop Import Plug-in.

TextBridge Pro also works with ISIS (Image and Scanner
Interface Standard) drivers from Pixel Translations Inc. However,
ScanSoft does not provide these drivers with TextBridge Pro.

Summary of Contents for TextBridge PRO 8.5

Page 1: ...User s Guide PRO TextBridge 8 5...

Page 2: ...r fitness for a particular purpose Some states or jurisdictions do not allow disclaimer of express or implied warranties in certain transactions therefore this statement may not apply to you ScanSoft...

Page 3: ...Pro 1 3 Other TextBridge Pro features 1 3 Documents TextBridge Pro can recognize 1 6 Supported Text Formats 1 7 Scanner Support 1 8 On Line Help for TextBridge Pro 1 9 Where to Go From Here 1 11 2 IN...

Page 4: ...nize menu 3 22 Scanner menu 3 28 Where to Go From Here 3 31 4 USING TEXTBRIDGE PRO Preparing the Job 4 1 Setting job preferences 4 3 Setting scanning preferences 4 9 Scanning and Converting a Document...

Page 5: ...om Here 5 20 6 TIPS AND TECHNIQUES Getting the Best Document Recognition 6 1 Use and maintain your scanner properly 6 2 Adjust scanner brightness 6 3 Adjust for colors 6 5 Use the fax filter 6 5 Proce...

Page 6: ...scanner driver problems A 26 Expected scan options are grayed out TWAIN or Adobe Photoshop Import Plug in A 26 Scan problems after Clicking GO A 27 Crash Problems A 29 Basic Troubleshooting for Crash...

Page 7: ...Objects C 5 application C 5 docFormatter C 6 imageSource C 7 Recognizer C 10 TBDictionary C 15 TBLanguage C 16 TBTrainingData C 17 TBZones C 18 Zone C 19 TextBridge Pro Commands C 20 CancelPage C 20 C...

Page 8: ...g on to find out more about TextBridge Pro please read this preface as it describes these important items About this manual Documentation conventions Other reading material Customer support ABOUT THIS...

Page 9: ...llation provides step by step instructions to install TextBridge Pro software and link it with your scanner or other input device This chapter also provides information about System Configuration and...

Page 10: ...lists the error messages that can be generated during TextBridge Pro operation and suggests ways for correcting the errors Appendix B Sample Documents describes the online sample documents that are pr...

Page 11: ...in a chapter also sometimes used to denote strong in line emphasis italic Denotes titles of other manuals or books Also used to denote generic representations of file name entries in examples for exam...

Page 12: ...xtBridge Pro please read the online ReadMe document which automatically appears in the TextBridge Pro Folder Simply double click the ReadMe icon to view important up to date information that is not in...

Page 13: ...d ways to correct them If you cannot resolve a problem on your own using the documentation and software refer to the following Web site www scansoft com The ScanSoft web site provides a link to TextBr...

Page 14: ...and graphics to a word processor spreadsheet or web browser format OCR can also recognize on line page images from fax modems scanners and other sources In addition to OCR TextBridge Pro offers advan...

Page 15: ...hat can pro duce a fully editable electronic document that retains the original document layout complete with text and pictures Figure 1 2 Original document Recomposed document in word processor Figur...

Page 16: ...e job progresses Document recomposition TextBridge Pro is the first and only desktop document recognition software product to offer true document recomposition When you specify output to Microsoft Wor...

Page 17: ...ge Pro is fully scriptable and recordable It not only responds to Apple events but also allows you to write your own scripts by recording the events as they occur Output text formats TextBridge Pro su...

Page 18: ...ts scientific terminology proper names acronyms and so on in ASCII files and load them into TextBridge Pro A custom dictionary aids in recognition of documents containing that terminology Two sided do...

Page 19: ...degraded or dirty documents documents with single or multiple column layouts documents containing halftone photos and color artwork on line single or multiple page images from fax modems and other so...

Page 20: ...ker See the documentation for your particular applica tion for more information about importing files in RTF format Two of these formats are text only Note This list is subject to change Refer to the...

Page 21: ...type of scanner driver with TextBridge Pro If your scanner does not come with a scanner driver please contact the scanner manufacturer TWAIN is a non proprietary standard for acquiring data from a sc...

Page 22: ...e Apple Guide on line Help system as well as Balloon Help While running TextBridge Pro you can access the TextBridge Pro Guide by selecting it from the Help menu On the TextBridge Pro Guide window cli...

Page 23: ...er right corner of the window Click the box again to expand the window Click the Huh button at the bottom of the Help window to see related instructions Click any text that appears in boldface within...

Page 24: ...tice with TextBridge Pro before applying it to your documents please see Chapter 5 which provides tutorials and sample documents After you have gained some experience with the program see Chapter 6 wh...

Page 25: ...CE TextBridge Pro operates under System 7 1 or higher It requires a Macintosh with a 68030 68040 PowerPC G3 or iMac CPU and at least twenty one megabytes 21Mb of disk space for full installation or fi...

Page 26: ...Pro works with TWAIN compliant devices that provide a binary black and white image in a supported size and resolution ScanSoft provides the TWAIN Source Manager and installs it as part of TextBridge P...

Page 27: ...rer Make sure your scanner runs independently of TextBridge Pro After the scanner is functioning install TextBridge Pro software INSTALLING TEXTBRIDGE PRO SOFTWARE After you have performed the scanner...

Page 28: ...protection software and remove any previous versions of TextBridge that are on your system Some virus checking software interrupts the installation process This may cause installation of TextBridge Pr...

Page 29: ...aining the TextBridge Pro files and the Installer icon 3 Double click the TextBridge Pro Installer icon The TextBridge Professional Splash Screen Figure 2 2 displays Click Continue to proceed with ins...

Page 30: ...gure 2 3 The Installer s Display of Online Release Notes 5 Read save or print the release notes for the latest information then press Continue 6 Choose an installation option You can install all TextB...

Page 31: ...Custom Install go to Step 8 7 Perform an Easy Install With Easy Install selected click on Install as shown in Figure 2 4 below Go directly to Step 9 Click Easy Install for a full installation Click In...

Page 32: ...Click OK to hide the information dialog box Figure 2 6 TextBridge Professional and an information dialog box displayed 9 Specify the location and name of the folder where you want to install TextBridg...

Page 33: ...or If you plan to use TextBridge Pro to process on line images only you can skip the next section and begin using TextBridge Pro See Chapter 5 of this manual for step by step procedures to use the Tex...

Page 34: ...formation Follow the onscreen instructions to register After registering your software the TextBridge Pro Main window will appear Figure 2 8 Note Unless you register your software there will be a remi...

Page 35: ...ISIS driver Click to complete selection Figure 2 9 Select Source dialog box 4 Select the type of scanner driver If you have installed the selected type of driver correctly it will appear in the list b...

Page 36: ...y selecting the scanner again 7 Begin using TextBridge Pro TextBridge Pro automatically selects Scanner as the input source using the driver selected in Step 5 Note When using an ISIS driver if the sc...

Page 37: ...er Screen refer to Figure 2 4 4 Select Uninstall from the installation menu Click Uninstall to remove TextBridge 5 Select the installation location 6 Click Uninstall to remove all TextBridge Pro files...

Page 38: ...references Source Manager Language packs Text conversions TextBridge Professional Sample AppleScripts System Folder Zone templates TextBridge Pro Folder Xerox fonts Sample Docs AppleGuide Help Fonts A...

Page 39: ...ds MAIN WINDOW The control center for TextBridge Pro operation is the main window With the exception of several dialog boxes all preparation and document recognition activity takes place in the main w...

Page 40: ...ences panel For your convenience a set of pop up menus called the preferences panel provides quick access to the preferences that you will most often change from job to job All items on the main toolb...

Page 41: ...button pops back out automatically State buttons in contrast stay in when you push them in The state or mode the button controls stays in effect until you click on it again to pull it back out When p...

Page 42: ...f operation states and to start continue or cancel part or all of the process commands Stop Processing Image sources States Commands Start Processing Save Page Images Defer OCR Cancel Current Page Inp...

Page 43: ...iew area in turn and a preview toolbar is added to the main window In preview mode you can zoom in on magnify pages and create text image and ignore zones to identify specific areas to capture If you...

Page 44: ...card it or continue processing The Go button starts processing and when you are working in preview mode continues processing At this time the Go button changes to a simple green arrow For many documen...

Page 45: ...in to full resolution hold down the option key while clicking If the image is zoomed in press the Zoom Out button to change the mouse pointer to the Zoom Out icon when you place it in the view window...

Page 46: ...pture from the displayed page image click and drag the mouse diagonally to create the image zone Release the mouse when you are done Use the Create Ignore Zone button to change the cursor to the ignor...

Page 47: ...n TextBridge Pro begins recognition it displays the training toolbar with the first suspect word in it Below in the view area TextBridge Pro magnifies and highlights the word image that corresponds to...

Page 48: ...l the sensitiv ity of the training process how frequently suspect words will be displayed for your input Some Words is the default If you want to achieve the highest level of recognition accuracy whil...

Page 49: ...re 3 5 Figure 3 5 Preferences panel Initially only the four most commonly used controls are displayed Click the preferences view bar below the preference pop up menus hold the mouse button down the cu...

Page 50: ...ing topics File menu Edit menu View menu Process menu Recognize menu Scanner menu File menu The File menu holds four commands Using these commands you can specify where you will get the image you will...

Page 51: ...command when it has a check mark next to it instructs TextBridge Pro to use on line image files as the source of pages to be recognized You identify the image files in the Image Queue dialog box that...

Page 52: ...te Group 3 and Group 4 are compression standards specified by the CCITT Consultative Committee of International Telephone and Telegraph an international standards organization When choosing an output...

Page 53: ...f you want to end the document discard it or continue processing Figure 3 8 Figure 3 8 Discard End or Continue dialog box Edit menu The Edit menu provides eight tools that are useful when you are work...

Page 54: ...election and stores it in the Clipboard Copy The Copy command enables you to copy text from a text box onto the Clipboard The Copy command is dimmed unless you are editing text Paste The Paste command...

Page 55: ...s active only when you are in preview mode and you have a zone selected This command moves the selected zone in front of all other zones in the view area This has the effect simply of processing any i...

Page 56: ...n the following subsections Zoom In Zoom Out Invert Deskew Enhance Display Zoom In The Zoom In command is active when you are in either preview or interactive training mode It magnifies the page image...

Page 57: ...skew the page automatically as part of preprocessing This feature does not effect the quality of the output Enhance Display The Enhance Display command is active when you are in either preview or inte...

Page 58: ...e Pro in preview mode It is the same as pressing the Preview button on the main toolbar You can activate the Preview command at the beginning of or during a job The first or next page to be processed...

Page 59: ...ncel Page The Cancel Page command is functionally equivalent to the Cancel Page button on the main toolbar The Cancel Page command is available when TextBridge Pro is currently processing a page or wh...

Page 60: ...in preview you can select Continue to start recognition of the page Recognize menu The Recognize menu provides commands that let you fine tune the document recognition process From the Recognize menu...

Page 61: ...xtBridge Pro how to compose the output document in your word processor This submenu is equivalent to the Output Layout pop up menu on the main window Refer to Chapter 6 for more information about the...

Page 62: ...n Language pop up menu on the main window Refer to Chapter 6 for more information about the Recognition Language settings and when to use them Custom Dictionary A custom dictionary is a text ASCII fil...

Page 63: ...the character shapes styles and sizes used in a particular document At the start of any later job you can load the training data to improve recognition of similar documents For example if you always s...

Page 64: ...lays the Save Zone Template dialog box Figure 3 9 Specify the name of the new template file Click to save Figure 3 9 Save Zone Template dialog box Here you can save the currently displayed zone set in...

Page 65: ...new training file Click to save Figure 3 10 Save Training Data dialog box The Save Training Data command is active only when you are in preview mode and you have accepted or corrected any suspect wor...

Page 66: ...nner preferences available in TextBridge Pro The following subsections describe in more detail the commands in the Scanner menu namely Select Source Brightness Page Size Resolution Sheet Feeder More S...

Page 67: ...complete the process Note Most TWAIN sources work best when displaying the TWAIN user interface However if you choose to do so or if you choose an Adobe Photoshop Import Plug in which always displays...

Page 68: ...tion pop up menu on the main window Refer to Chapter 6 for more information about the Resolution setting Sheet Feeder The Sheet Feeder command enables you to tell TextBridge Pro whether or not to auto...

Page 69: ...chapter you are ready to use the application for your own documents Chapter 4 Using TextBridge Pro provides step by step procedures for the many tasks you can perform with the program Chapter 5 Tutor...

Page 70: ...xtBridge Instant Access OCR PREPARING THE JOB TextBridge Pro is designed to be easy to use Often you can run document recognition successfully without changing default preferences or using any of the...

Page 71: ...w Initially only the four most commonly used controls are displayed Click the preferences view bar below the preference pop up menus hold the mouse button down and drag the bar down to show all settin...

Page 72: ...preadsheets Select Text and pictures One column for one column documents that contain straight text and pictures TextBridge Pro will perform a pre processing step to detect picture locations and preve...

Page 73: ...mple editable form and you want copies of the halftone photographs from your original document as well Note that TextBridge Pro outputs four bit grayscale versions of the original photos and places th...

Page 74: ...acters from these printers are made up of disconnected dots and could otherwise be difficult for an OCR program With this setting TextBridge Pro pre processes the image before performing OCR Note that...

Page 75: ...e that you may also want to select Automatic if you are recognizing on line image files and are not sure if the page image has the proper orientation in the file With auto orienta tion TextBridge Pro...

Page 76: ...can load a custom dictionary to improve recognition of a particular document The custom dictionary is loaded as soon as you begin OCR Note that you cannot load a custom dictionary once a job is in pr...

Page 77: ...act with and improve the OCR process TextBridge Pro compiles information about the character shapes styles and sizes found in the document being recognized This information is called training data You...

Page 78: ...ow light or dark scanned page images will be Use Normal Image for good quality office documents Use Lighter Image to provide a brighter page image to the TextBridge Pro recognition engine For example...

Page 79: ...st page your scanner can accommodate Note that some of the scanners supported by TextBridge Pro particularly those without a sheet feeder do not support greater than A4 page size Thus for these scanne...

Page 80: ...sheet feeder and will scan from there even if the sheet feeder option is off However TextBridge Pro will display the Add More Pages dialog box for every page unless the sheet feeder option is on If yo...

Page 81: ...ed document Pages of a single sided document are printed only on one side of the paper The reverse sides are blank and are not included in the page numbering Note The following procedure assumes that...

Page 82: ...Figure 4 2 Save dialog box 4 Specify the name location and format of the text output file and click Continue Note If you attempt to save the output text to a locked floppy disk an unnumbered error mes...

Page 83: ...e When scanning is completed TextBridge Pro displays the Add More Pages dialog box Figure 4 3 Figure 4 3 Add More Pages dialog box 5 Proceed to Step 6 to continue the job Go directly to Step 8 to end...

Page 84: ...rder in the output file Note The following procedure assumes that your scanner is properly installed powered on and ready and that the TextBridge Pro main application is active It also assumes that yo...

Page 85: ...ays the Add More Pages dialog box Figure 4 3 5 Turn the stack of pages over and insert the stack back into the scanner s automatic document feeder Pages should now be oriented so that last even number...

Page 86: ...Pro main application is active 1 Insert the page s to be processed into the scanner If you have a scanner with a sheet feeder you can load a stack of pages If you have a flatbed scanner place the fir...

Page 87: ...ck to begin scanning Figure 4 4 Save dialog box to save the image file 6 Define the base name location and format of the page image files to be saved Each scanned image uses the base name plus a three...

Page 88: ...scans the page s in the scanner If you are driving your scanner with a TWAIN source displaying the TWAIN user interface or with an Adobe Photoshop Import Plug in the TWAIN or Plug in user interface wi...

Page 89: ...files can originate from fax modems or other sources TextBridge Pro can process page images stored in PICT or most TIFF formats Page images must be binary black type on a white background and have res...

Page 90: ...ine image files If files contain fax quality 100 by 200 200 by 100 or 200 by 200 dots per inch images choose the Automatic or Fax setting in the Original Quality category Also if you are unsure of the...

Page 91: ...the area on the lower portion of the Image Queue dialog box To add the contents of a folder to the queue select the folder and click Add or click Add All The files in the folder will be added to the q...

Page 92: ...mplete you can go on to use the recognized text by editing the output file in your word processor or other text application PREVIEWING PAGES BEFORE PROCESSING To view or define specific areas of a pag...

Page 93: ...the process The Save dialog box is displayed Figure 4 2 5 Specify the name location and format of the text output file and click Continue If you are scanning TextBridge Pro automatically scans a page...

Page 94: ...Figure 4 6 Preview toolbar is added Page image is displayed Figure 4 6 Main window in preview mode Scroll bars in the view area let you shift the display horizontally and vertically 6 Zoom the page if...

Page 95: ...that you want to capture only part of the page Text is output in galley single column format However you can create image and ignore zones without affecting docu ment recomposition This feature is cal...

Page 96: ...ed or you are otherwise finished previewing the page click the Go button to start the recognition process To process all pages of the job to the current zones in place also click the Preview button on...

Page 97: ...are using a scanner it is properly connected to your Macintosh powered on and ready and that the TextBridge Pro main application is active To work in interactive training mode 1 If you are scanning l...

Page 98: ...ge s from one or more on line image files TextBridge Pro first displays the Image Queue dialog box Figure 4 5 In the Image Queue dialog box queue up the image files to be processed then click Continue...

Page 99: ...ed is restored to its original condition in the Word text box and you can correct the mistake You can control the frequency at which TextBridge Pro will display suspect words Simply pull down the Trai...

Page 100: ...8 End interactive training by clicking the Train OCR button in the main toolbar so that it is no longer pressed in The training toolbar disappears and TextBridge Pro continues OCR automatically for t...

Page 101: ...Menu Items folder TextBridge Instant Access OCR runs from any Macintosh word processing or other text application Instant Access OCR appears in the Apple menu as the Instant Access OCR command When y...

Page 102: ...the TextBridge Pro main window appears in Instant Access mode Figure 4 9 Indicates Instant Access mode Figure 4 9 Main window in Instant Access mode 3 Set up and initiate OCR from the main window exa...

Page 103: ...o clicking Stop stops processing as usual and leaves Instant Access mode as well Instant Access OCR uses the clipboard to copy and paste recog nized text into your application either as formatted RTF...

Page 104: ...his chapter you can run virtually all the capabilities of TextBridge Pro For more advanced information see Chapter 6 Tips and Techniques That chapter takes a closer look at ways to get the highest rec...

Page 105: ...tic operation capturing parts of a document preview interactive training Instant Access OCR running TextBridge Pro from within a text application document recomposition requires a word processor sprea...

Page 106: ...ion title bar is the main window which provides a main toolbar and a preferences panel These tools let you set up start and control the document recognition process Initially only one row of preferenc...

Page 107: ...e Pro s powerful document recognition tools SAMPLE DOCUMENTS For use with the tutorial sessions provided in this chapter five sample documents are located in the Sample Documents folder in the TextBri...

Page 108: ...ick the Go button TextBridge Pro now displays the Image Queue dialog box Figure 5 3 Double click a file on the list or highlight a file and click Add After you select the files click Proceed Files you...

Page 109: ...ESSION 1 AUTOMATIC OPERATION TextBridge Pro provides a range of features designed to be very easy to use For most documents you can use default settings and simply press the Go button to start the doc...

Page 110: ...n the Save Output As text box type a file name In the Text pop up menu select the output format for your word processor spreadsheet or web browser application Click Proceed to start processing TextBri...

Page 111: ...lly editable text TUTORIAL SESSION 2 CAPTURING PARTS OF A DOCUMENT TextBridge Professional Edition also enables you to capture selected parts text and graphics of a document For this purpose TextBridg...

Page 112: ...zonepic then click Proceed TextBridge Pro displays the Save dialog box Figure 5 4 5 Define the output text file then click Continue in the Save dialog box TextBridge Pro reads the on line image and in...

Page 113: ...a text zone Select the Text Zone tool Position the mouse inside the view area at the upper left corner of the page image Holding down the mouse button drag the mouse diagonally downward until the tex...

Page 114: ...ure 5 6 Text zone on previewed page 9 Now zoom in on the page Select the Zoom In tool Click once on the line art at the bottom right of the page image to magnify the area 10 Create an image zone Click...

Page 115: ...Figure 5 7 Image zone on the previewed page 11 Click the Go button again to process the zoned text and image When processing is complete TextBridge Pro converts and saves the recognized data and retu...

Page 116: ...t TextBridge Pro s recognition decisions for a page or two you also train the program to improve its own accuracy rate for later pages of the document In addition you can save and later reload trainin...

Page 117: ...Continue in the Save dialog box After beginning recognition TextBridge Pro adds the training toolbar to the main window When it finds the first suspect word it displays the suspect word in the Word te...

Page 118: ...t is recommended that you complete at least one full page of a multi page document to sufficiently train TextBridge Pro about the character shapes and sizes for that document For the purposes of this...

Page 119: ...pplication s open document This capability is referred to as Instant Access OCR When you install TextBridge Pro documented in Chapter 2 the Installer automatically places a copy of the Instant Access...

Page 120: ...roceed TextBridge Pro reads the on line image and automatically performs OCR on it The program converts the recognized text to two formats RTF and ASCII and copies it to the clipboard It then automati...

Page 121: ...hat includes a picture Assuming your text application supports these elements TextBridge Pro not only can recognize the text it can correctly recompose the column layout and output a copy of the pictu...

Page 122: ...ith your word processor Display the document so that its full layout is shown For example in Word you must select the Page Layout command from the View menu Notice that the document is composed in thr...

Page 123: ...t appropriately Smart Zones can improve recomposition in cases where pictures in a document include text or some element that TextBridge Pro mistakes for text Smart Zones aids the recomposition proces...

Page 124: ...tutorial sessions in this chapter were designed to give you a solid basis on which to use TextBridge Pro for your own documents For complete information about TextBridge Pro please refer to the User...

Page 125: ...o achieves a consistently high level of character recognition accuracy over a wide range of documents However there are some actions you can take to help the program do the best possible recognition o...

Page 126: ...ks that might be captured during scanning Load the scanner correctly Make sure your document is not scanned at an angle This can make character recognition more difficult When using the document feede...

Page 127: ...htness Type on dark background increase brightness Figure 6 2 Document originals and brightness Darkness of text the lightness of the background and the amount of noise dirt smeared ink fingerprints h...

Page 128: ...or very thin If your scanner supports the Auto brightness setting select this to achieve the best level of brightness for each page of a document To adjust Brightness in fine increments click Manual a...

Page 129: ...drop out color examine the color of the scanner light as is moves across the flatbed The color of the light determines the drop out color Many scanners have a light green scanner light for example th...

Page 130: ...er when the image is less than 225 dpi Note Do not use the Fax filter on non fax documents either scanned or on line If you do OCR accuracy can degrade Also if you notice that recognition is poor on s...

Page 131: ...totally different document with different typefaces and point sizes the knowledge that TextBridge Pro gained for the first page becomes invalid TextBridge Pro must begin the learning process over aga...

Page 132: ...be proper names professional or technical terms acronyms and so on Before you process a document with TextBridge Pro you can load the custom dictionary listing special terms contained in the document...

Page 133: ...does not exceed 10 000 words Use interactive training tools If you find that TextBridge Pro is giving less than satisfactory character recognition results on a particular document you can improve rec...

Page 134: ...ltiple page document train the program on one or two pages End training by clicking the Train OCR icon on the main toolbar so it is no longer pressed in TextBridge Pro will use your input to make bett...

Page 135: ...king in preview mode you can use the Image Zone tool to identify the graphic areas on page images TextBridge Pro will then ignore any image zones for recognition purposes If you specified one of the O...

Page 136: ...fine your document zone to capture only the data you want save and load zone templates save and load training data use deferred processing for long documents Using these features you can assure that T...

Page 137: ...Input layout The Input layout setting controls how much analysis of the page image layout TextBridge Pro is to perform The Text One column setting is appropriate for simple documents such as text only...

Page 138: ...etting The Recompose Text setting outputs text in its original column layout If TextBridge Pro detects any halftones or manually zoned line art in the original document it outputs empty frames in plac...

Page 139: ...requires some additional processing time but is more efficient than the Automatic setting Use the Dot matrix setting if you know your document originated from a draft quality dot matrix printer This...

Page 140: ...With the Automatic setting however TextBridge Pro runs an analysis on the page image s to determine their orientation then rotates them in memory if necessary before beginning OCR Figure 6 9 Portrait...

Page 141: ...nition process With the zoning tools in TextBridge Pro s preview mode you can specify which areas to ignore or you can specify only the text and images that you want to capture Figure 6 10 Note Creati...

Page 142: ...s avoiding having to re create the zones After creating a set of zones click the Save Zone Template button on the preview toolbar This displays the Save Zone Template dialog box where you can specify...

Page 143: ...ew mode refer to Chapter 4 Save and load training data At the end of a job during which you interactively trained TextBridge Pro the program displays the Save Training dialog box Figure 6 13 Specify t...

Page 144: ...n is a two phase process capturing the page images and recognizing the data from the images The amount of time each phase takes can vary depending on your system configuration and whether you are usin...

Page 145: ...gure 6 15 and attend to other business or go home while TextBridge Pro performs recognition For complete information about deferred processing refer to Scanning pages for deferred processing and Recog...

Page 146: ...extBridge Pro first consult this appendix to try to resolve the problem yourself TextBridge Pro error messages appear in a standard Macintosh alert box as shown in Figure A 1 Click OK then correct the...

Page 147: ...o generate the message This information will be useful later if you cannot solve the problem and must contact us If you get an error message that you cannot locate in this appendix and or you cannot r...

Page 148: ...free telephone number when you register your software The disk is locked You are attempting to save a file to a locked floppy disk Eject the disk and unlock it then try again 34 The disk is full You a...

Page 149: ...try to replace a document that is currently open in your word processor Check the file name and folder and try again or check to see if another user or application is working with the same file 48 A f...

Page 150: ...extBridge CD ROM 1701 A required parameter is missing from the AppleScript command Refer to Appendix C AppleScript Interface of the User s Guide 1002 An unexpected MacOS error has occurred Quit TextBr...

Page 151: ...o with incompatible system software TextBridge requires System 7 1 or higher Contact your Apple dealer for information on upgrading your system software 1006 TextBridge Pro requires a Macintosh with a...

Page 152: ...transient memory allocation error occurred This message indicates an internal error where a managed area of memory a heap is being freed while some of that memory is still being used Restart TextBrid...

Page 153: ...Bridge Pro CD ROM 1022 TextBridge Pro could not find any language packs in the TextBridge Pouch Quit TextBridge Pro and install at least one language pack from the TextBridge Pro CD ROM The language p...

Page 154: ...nceled processing from the TextBridge Pro user interface Try running the script again 1032 Scanning has been canceled from the TWAIN source This message occurs if you press Cancel from the TWAIN sourc...

Page 155: ...resolution image without sufficient RAM available Try quitting all other applications and try again Or if possible reduce the image resolution You may need to install additional RAM on your system 103...

Page 156: ...lready 999 zones which is the limit Only 999 zones can be defined for each page 1054 TextBridge Pro cannot add a zone Please refer to Appendix A of the TextBridge Pro User s Guide for troubleshooting...

Page 157: ...ad is invalid or corrupted You have directed TextBridge Pro to read a damaged TIFF file Check the file and try again 1068 TextBridge Pro cannot add another image file The Image Queue is full This erro...

Page 158: ...white mode at a resolution between 72 and 600 dpi can be read TIFF Uncompressed Intel header TIFF CCITT 3 Intel header TIFF CCITT 4 Intel header TIFF Uncompressed Motorola header TIFF CCITT 3 Motorola...

Page 159: ...canner driver If you are using a TWAIN driver and if the error was a result of an attempt to open the Select Source dialog box Make sure the TWAIN folder is installed in the Preferences folder in the...

Page 160: ...attaching a scanner but TextBridge preferences had previously stored the information that a scanner was there Be sure to select Select Source from the TextBridge Scanner pull down menu and make sure t...

Page 161: ...mory than others to acquire an image 1083 The scanner driver is out of memory Quit TextBridge Pro and use the Get Info command to make sure that enough memory is allocated to TextBridge Pro Some scann...

Page 162: ...others to acquire an image 1089 The scanner returned an image that is not the correct size The scanner has transferred an image with a size that is different than expected Set the page size to a diff...

Page 163: ...e selected TWAIN source could not be enabled Refer to error 1093 1101 The selected TWAIN source could not be found TextBridge Pro cannot find the selected TWAIN data source Make sure that the TWAIN fo...

Page 164: ...on of 200 300 or 400 DPI The selected TWAIN source is configured to transfer an image with an invalid resolution In the TWAIN dialog box make sure to specify a resolution of 200 300 or 400 dpi 1106 Th...

Page 165: ...TextBridge Pro and install the language pack again from the TextBridge Pro CD ROM The selected language pack has been inadvertently damaged Quit TextBridge Pro and install the language pack from the...

Page 166: ...TextBridge CD ROM 1118 A file error occurred during conversion This error may indicate insufficient free disk space Delete some file to free disk space and try again If the problem persists quit Text...

Page 167: ...cannot read the CD ROM at all contact one of the following to get a replacement CD ROM If you purchased TextBridge Pro from an authorized ScanSoft reseller contact the reseller If TextBridge Pro came...

Page 168: ...sure the scanner and all other connected SCSI devices are turned on Turn the scanner power off then turn it on again Make sure no two SCSI devices have the same ID The following IDs are typically used...

Page 169: ...he value in the Minimum Size field of the Memory Requirements section of the window Click on the close box in the upper left corner to close the window and save the changes Start TextBridge Pro Tools...

Page 170: ...g with TWAIN driver selected results in error message 1082 TextBridge Pro could not open the scanner See the explanation and possible solution for error message 1082 Attempt to set input from scanner...

Page 171: ...the Select Source dialog box or choose an Adobe Photoshop Import Plug in which always displays a user interface for scanning the scanner settings options on the TextBridge main window will be grayed...

Page 172: ...not with the SCSI cabling or hardware Scanner uses sheet feeder even when check box is not checked Power down scanner and disconnect ADF cable Manual Brightness setting gives incorrect result Try a di...

Page 173: ...Pro hangs or crashes when attempting to scan Refer to Basic Troubleshooting Steps for crashes in this chapter Doesn t scan but TextBridge Pro says it did Ofoto is installed Restart Mac and Scanner St...

Page 174: ...Minimum Size field of the Memory Requirements section of the window e Click on the close box in the upper left corner to close the window and save the changes f Start TextBridge Pro CRASH PROBLEMS Bas...

Page 175: ...v2 07 Just installed video board possible conflict with board SuperMac Thunder with the 1 601 ROM is known to cause TextBridge to crash Processing image files created by other applications TextBridge...

Page 176: ...E 1095 says there are many problems with early versions of this library Apple recommends version 1 2 for all users It is supposed to fix all known crashing bugs Install the update found at the Apple W...

Page 177: ...Dokey PaperPort PaperPort Extension Penworks Super Cache Not sure if you are using software with a conflict Restart with non system extensions by restarting the computer while holding down the shift...

Page 178: ...and rebooting the system Problems with TextBridge Pro or Instant Access OCR The Macintosh freezes randomly scanners are not recognized or applications do not retain their setup parameters Possible cor...

Page 179: ...gnostic software Some of these come bundled with the Macintosh others may come bundled with scanners or other SCSI devices Final Check for Unresolved Problem Check third party known problems from thir...

Page 180: ...r 2 the default installation folder is the TextBridge Pro Folder Within this folder are a number of others including the Sample Documents folder This folder contains eight sample documents stored in T...

Page 181: ...s a typical one column office document named markplan The document uses serif fonts in several different sizes and styles and includes bullet characters all of which TextBridge Pro can recognize and o...

Page 182: ...mn newsletter style document zonepic The document is designed to illustrate TextBridge Pro s manual zoning features with which you can identify specific areas text and graphics of pages to capture Fig...

Page 183: ...Session 3 in Chapter 5 uses a fax quality document named plexis The degraded image quality of fax documents is ideal to illustrate the interactive training feature of TextBridge Pro Figure B 3 shows a...

Page 184: ...sume The document is typical of the type of data you might like to pour directly into your word processor for immediate editing purposes It is designed to illustrate TextBridge Pro s Instant Access OC...

Page 185: ...n 4 in Chapter 5 uses a three column document with a picture in the middle column It is named 3col The document is designed to illustrate TextBridge Pro s powerful recomposition capabilities Figure B...

Page 186: ...essor or other applications TextBridge Pro is fully scriptable and recordable It not only responds to Apple events but also allows you to write your own scripts by recording the events as they occur T...

Page 187: ...tBridge Pro from the Finder type a script from scratch or modify the sample scripts provided with TextBridge The following subsections discuss recording a script editing recorded scripts and describe...

Page 188: ...n Load the scanner if necessary and then select preferences and click Go as you would normally As TextBridge Pro processing occurs the corresponding Apple events appear in the Script Editor window Fig...

Page 189: ...ate the TextBridge Pro AppleScript interface These files are installed with TextBridge Pro in the Sample AppleScripts folder Sample scripts are the following In Basket Watcher and In Basket Watcher sc...

Page 190: ...docFormatter imageSource recognizer TBDictionary TBLanguage TBTrainingData TBZones Zone application The application Properties None Element classes window by numeric index by name before after another...

Page 191: ...Get Set Example tell application TextBridge Professional activate set useFileInput of recognizer 1 to true docFormatter This is the conversion for text output from OCR to a document Specify by name I...

Page 192: ...by TextBridge Pro Properties name The name of the image source Only available for scanners Object class string Modifiable No IsScanner If true specifies that the image source is a scanner if false th...

Page 193: ...eetFeeder Indicates whether a sheet feeder is available Object class Boolean Modifiable No useSheetFeeder Indicates whether the scanner should take sheets from the sheet feeder if one is small integer...

Page 194: ...AppleScript Interface C 9 Element classes None Commands handled Get Set Example set resolution of imageSource HP Scan 2 to 200...

Page 195: ...usPreviewing statusPreprocessing statusDeskewing Image statusOrientingImage statusCheckingSource statusLoadingTraining statusLoadingLanguage statusRecognizing statusVerifying statusFormatting statusCr...

Page 196: ...nstant Access OCR has loaded the clipboard in the current or latest job Object class Boolean Modifiable No doImageOutput Instructs TextBridge Pro to write input images to files and defer OCR Object cl...

Page 197: ...e anticipated orientation of input images portrait landscape automaticOrient Object class Enumerated Modifiable Yes inputLayout Specifies the anticipated layout of input images oneColTextIn oneColText...

Page 198: ...the current or next job Object class docFormatter Modifiable Yes curLanguage The language of the document being recognized Object class TBLanguage Modifiable Yes curDictionary The custom dictionary t...

Page 199: ...display a dialog box asking if you want to save training data at end of job if training has taken place Object class Boolean Modifiable Yes verifyThreshold The confidence threshold for word verificat...

Page 200: ...y numeric index Commands handled Set Synchronize StartJob ContinueJob StopJob CancelPage SaveZones SaveTraining InvertImage DeskewImage Rescan Example Synchronize recognizer 1 desiredStatus statusPrev...

Page 201: ...nizer 1 to TBDictionary Lancet Dictionary TBLanguage The language of the document to be recognized Specify by name ID or index Properties name The name of the language Refer to the language pack names...

Page 202: ...Data A record of training data Specify by name or index Properties name The name of a training data file Refer to the training data file names in the TextBridge Pouch Object class string Modifiable No...

Page 203: ...pecify by name or index Properties name The name of the zone template Refer to the TextBridge Pouch for zone template names Object class string Modifiable No Element classes None Commands handled Get...

Page 204: ...tput to the document 1 first Object class integer Modifiable Yes zoneType The type of zone TextZone ImageZone IgnoreZone Object class Enumerated Modifiable No Element classes None Commands handled Get...

Page 205: ...ction describes the commands that the TextBridge Pro recognizer understands CancelPage Cancel recognition of current page but continue current OCR job Command syntax CancelPage recognizer Parameters N...

Page 206: ...during preview or training Command syntax ContinueJob recognizer Parameters None Result None Example ContinueJob recognizer 1 DeskewImage Deskew the current page image This command is only available d...

Page 207: ...s Guide Result None Example DeskewImage recognizer 1 InvertImage Invert the current page image This command is only available during preview Command syntax InvertImage recognizer Parameters None Resu...

Page 208: ...ommand syntax Rescan recognizer Parameters None Result None Example Rescan recognizer 1 SaveTraining Save current training data to file in TextBridge Pouch Command syntax SaveTraining recognizer fileN...

Page 209: ...Training recognizer 1 fileName Caboose SaveZones Save current zone list to zone template file in TextBridge Pouch Command syntax SaveZones recognizer fileName Parameters fileName Name of the new zone...

Page 210: ...Parameters None Result None Example StartJob recognizer 1 StopJob Stop the current job Command syntax StopJob recognizer discardData Boolean Parameters discardData Indicates whether to discard any pag...

Page 211: ...ange statusInitializing statusReady statusGettingImage statusPreviewing statusPreprocessing statusDeskewing Image statusOrientingImage statusCheckingSource statusLoadingTraining statusLoadingLanguage...

Page 212: ...Pro works with Photoshop Import Plug ins as installed according to scanner manufacturer s instructions AppleScript Apple Computer s system software level scripting language for the Macintosh With App...

Page 213: ...em Folder which controls a device such as a printer or scanner To make a device available to an application you must first choose it in the Chooser CCITT The acronym for Consultative Committee for Int...

Page 214: ...processes it to another format the output format In TextBridge Pro recognized text in an internal format can be converted to say WordPerfect or any of a number of other supported formats The file tha...

Page 215: ...as more than one bit of data For example four bit grayscale data stores each pixel of an image as four bits of data This enables a finer representation of the original data than simple binary black a...

Page 216: ...it accepts format analyzed text a Xerox proprietary format as input to the conversion process Input layout In TextBridge Pro preferences a setting that informs the program about the number of text co...

Page 217: ...recognition task O object In AppleScript the part of an application that responds to events and which contains elements and properties Some TextBridge Pro examples are window zone and recognizer opti...

Page 218: ...nt flow across the more narrow dimension or width of the page preferences In TextBridge Pro the settings that you can specify to control the document recognition process preview In TextBridge Pro a mo...

Page 219: ...nguages are to be installed on your system and made available to TextBridge Pro for recognition purposes resolution The degree of detail measured in dots per inch dpi with which a scanner or fax machi...

Page 220: ...tBridge Pro scanner preferences a setting that informs the program about how light or dark the resulting image from a scanned page will be This is similar to the Brightness or Contrast setting on a ph...

Page 221: ...ations of binary black and white TIFF Motorola Header Intel Header TIFF Uncompressed TIFF CCITT 3 TIFF CCITT 4 TIFF Packbits TIFF Uncompressed TIFF CCITT 3 TIFF CCITT 4 Training An interactive capabil...

Page 222: ...reas of the page image zone template A set of zones created in preview mode and saved to a template file Later when processing the same type of document you can reload the template file to process the...

Page 223: ...9 3 24 3 29 Adobe Photoshop Import Plug ins 1 3 1 8 2 2 location of 2 11 Apple events CancelPage C 20 ContinueJob C 21 DeskewImage C 21 InvertImage C 22 Rescan C 23 SaveTraining C 23 SaveZones C 24 St...

Page 224: ...prove recognition accuracy 6 3 when to increase or decrease 6 4 Brightness command 3 29 Brightness auto 6 4 Button behavior 3 3 C Cancel Page button 3 6 21 Cancel Page command 3 21 Cell tables 5 19 Ch...

Page 225: ...ut Layout 3 23 Instant Access OCR 4 32 4 33 Invert 3 18 More 3 30 Move To Back 3 17 4 27 Move To Front 3 17 4 27 Original Quality 3 23 Output Layout 3 23 Page Orientation 3 24 Page Size 3 30 Paste 3 1...

Page 226: ...ons A 29 Create Ignore Zone button 3 8 Create Image Zone button 3 8 Create Text Zone button 3 8 Custom Dictionary creating and loading 6 8 specifying 4 7 Custom Dictionary command 3 24 Custom dictiona...

Page 227: ...ject C 6 Document double sided scanning 4 15 scanning and converting 4 12 single sided scanning 4 12 Document recomposition 1 3 5 17 displaying the recomposed document in your word processor 5 18 feat...

Page 228: ...WAIN or Adobe Photoshop Import Plug in A 26 F Fax documents 1 6 4 5 Fax image 6 6 4 21 synthesized 6 6 Fax modem 4 20 2 3 Fax setting 6 6 6 15 when not to use 6 6 File menu 3 12 Final Check for Unreso...

Page 229: ...g 4 17 Input From File button 3 5 3 13 4 21 4 24 4 28 5 4 5 5 5 7 5 12 5 16 5 17 Input From File command 3 13 Input From Scanner button 3 4 3 13 4 17 4 24 4 28 Input From Scanner command 3 13 Input La...

Page 230: ...ctive training 5 12 how much is enough 4 30 improving OCR accuracy with 6 9 level of specifying 4 30 noise 4 31 Undo command 4 30 Interactive training mode 3 18 4 28 5 14 specifying 4 28 turning off 4...

Page 231: ...2 Menus Edit 3 15 File 3 12 Process 3 20 Recognize 3 22 Scanner 3 28 View 3 18 Menus and commands 3 12 Microsoft Word RTF and other applications 1 7 More command 3 30 Move To Back command 3 17 4 27 M...

Page 232: ...tips for using 6 14 Output Layout command 3 23 Output layout setting 5 17 P Page image 5 13 specifying format of 4 19 zooming 4 24 Page images 3 13 4 17 5 3 naming files 4 18 origin of 4 20 performing...

Page 233: ...displaying 3 11 scanner settings 6 3 specifying 4 1 when to set 4 2 Preferences panel 3 2 3 11 5 2 5 7 5 12 5 17 expanding 4 2 expanding 5 2 Preferences view bar 3 11 4 2 5 2 Preview button 3 5 Previe...

Page 234: ...created by other applications A 30 Q Quit command 3 15 R ReadMe 1 7 ReadMe Support 3 2 Recognition Language command 3 24 Recognition language 3 11 Recognition language 6 8 Recognition language specif...

Page 235: ...ossible solutions A 23 Scanner 1 3 drop out color 6 5 loading pages in 6 2 maintenance and proper use 6 2 Scanner brightness 3 11 4 9 using to improve recognition accuracy 6 3 Scanner drivers Adobe Ph...

Page 236: ...ition 6 11 when to use 6 14 Starting TextBridge Pro 2 10 State buttons 3 3 Stop button 3 6 3 21 and Instant Access OCR 4 34 Stop command 3 21 Submenus Custom Dictionary 3 24 Input Layout 3 23 Original...

Page 237: ...Text Zone tool 5 9 Text zone 4 26 6 17 creating 5 9 TextBridge Instant Access OCR host application limitations 4 34 problems A 31 procedure for using 4 32 TextBridge Pro Apple events C 20 Apple Guide...

Page 238: ...processing 1 4 Instant Access OCR 1 3 4 32 integrating with other applications 1 4 C 1 interactive training mode 3 18 main toolbar 3 2 3 4 main window 1 1 3 1 4 25 menus and commands 3 12 pointsize r...

Page 239: ...3 text formats supported 1 7 text output formats 1 4 tips for efficient processing 6 12 toolbars 3 3 training data 1 5 training during OCR 4 28 training toolbar 3 9 TWAIN requirements for 1 8 two side...

Page 240: ...nt Access OCR 5 15 interactive training mode 5 14 main toolbar 5 2 memory requirements 2 2 preferences panel 5 2 preview mode 5 9 preview toolbar 5 2 5 3 program icon 5 1 running from within another a...

Page 241: ...l options 3 10 Training Level pop up menu 4 30 adjusting 6 10 Training toolbar buttons table of 3 9 Training toolbar 3 9 3 21 4 29 5 2 5 3 5 13 Troubleshooting A 1 Tutorials sample documents 5 3 B 1 T...

Page 242: ...eleting 4 27 moving 4 27 resizing 4 26 selecting 4 26 to capture parts of a document 6 17 Zone object C 19 Zone template 1 5 specifying 2 8 tips for using 6 18 Zone Template command 3 25 Zone template...

Page 243: ...Zoom page image in preview mode 4 25 Zoom In button 3 7 3 9 4 31 Zoom In command 3 18 Zoom In tool 5 10 Zoom Out button 3 7 4 31 Zoom Out command 3 18 Zoom Out tool 5 9 Zooming in and out on the page...

Reviews: