background image

 11

Chapter 2

Introduction to

OmniPage Web

You probably have documents lying on your desk that you would like 
to share with the rest of your company, or, perhaps, the rest of the world. 
You could photocopy the information and mail it to anyone who might 
be interested, or you could retype it and hand-code it in HTML format. 
Neither of these is an appealing option. 

OmniPage Web offers a smart solution to increase your productivity 
and the visibility of your documents. OmniPage Web’s 

optical character 

recognition (OCR)

 technology accurately and easily converts scanned 

paper documents and image files into editable text. Once you have 
converted your paper documents into electronic ones, OmniPage Web 

outlines

 the document structure and creates a complete, dynamic Web 

site with separate Web pages for each chapter or section. OmniPage Web 
even creates hypertext links, navigation tools, and a hyperlinked table of 
contents. 

Within minutes you have an HTML file that can be published to the 
World Wide Web, or your company’s intranet, making your documents 
instantly available to anyone in the world.

Please continue reading this chapter for information on these topics:

• What Is Optical Character Recognition (OCR)?

• What Is Outlining?

• Basic Steps of Creating a Web Page

• The  OmniPage  Web  Desktop

Summary of Contents for OMNIPAGE WEB

Page 1: ...OmniPage Web User s Manual...

Page 2: ...you should know how to work in the Microsoft Windows environment Please refer to Windows documentation if you have questions about how to use menu commands dialog boxes scroll bars edit boxes and so o...

Page 3: ...on to OmniPage Web What Is Optical Character Recognition OCR 12 What Is Outlining 12 Basic Steps of Creating a Web Page 13 The OmniPage Web Desktop 14 AutoWeb Toolbar 16 Standard Toolbar 17 Zone Toolb...

Page 4: ...etting AutoWeb Toolbar Commands 44 AUTO Button Commands 45 Image Button Commands 46 Zone Button Commands 47 OCR Button Commands 48 Outline Button Commands 49 Export Button Commands 50 Selecting Option...

Page 5: ...ues 78 Scanner Drivers Supplied by the Manufacturer 78 Scanner Drivers Supplied by Caere 79 Scan Manager is Needed with OmniPage Web 79 Problems Connecting OmniPage Web to Your Scanner 80 Missing Scan...

Page 6: ...vi...

Page 7: ...been designed for quick and easy information retrieval Please see Getting Online Help on page 3 for more information OmniPage Web Readme File The OmniPage Web Readme file contains last minute informa...

Page 8: ...and so on The following conventions are used in this manual Convention Purpose Italicized text Emphasizes menu commands dialog box options labeled buttons and file names For example Choose Open in the...

Page 9: ...t a particular OmniPage Web command toolbar button or dialog box option in the following ways Click the Help button in the Standard toolbar and then click any toolbar button menu command or area of th...

Page 10: ...or common questions and answers updates patches and troubleshooting procedures choose Caere on the Web Product Support in the Help menu OmniPage Web Readme file Read the OmniPage Web Readme file for l...

Page 11: ...des information on installing and starting OmniPage Web Please continue reading for information on these topics Minimum System Requirements Installing OmniPage Web Setting Up Your Scanner with OmniPag...

Page 12: ...if you plan to scan documents Please see the Scanner Setup Notes for a list of tested scanners A Web browser to view your HTML documents You may also want to use an HTML editor to make changes to the...

Page 13: ...r scanner or another input device such as a digital camera so you can use it with OmniPage Web During installation Caere Scan Manager prompts you to select your scanner manufacturer and model or other...

Page 14: ...our Windows desktop OmniPage Web s desktop appears when you open OmniPage Web See The OmniPage Web Desktop on page 14 for an introduction to OmniPage Web s user interface The image view displays the c...

Page 15: ...reen OmniPage Web will decide on the best method of registration according to your country and computer system It may try using modem FTP or HTTP connections to transmit your registration information...

Page 16: ...10 Chapter 1...

Page 17: ...easily converts scanned paper documents and image files into editable text Once you have converted your paper documents into electronic ones OmniPage Web outlines the document structure and creates a...

Page 18: ...OCR you can convert the resulting text to HTML format using OmniPage Web s outlining feature What Is Outlining Outlining is the process of examining the structure of a document detecting original docu...

Page 19: ...erprets text characters in an image After OCR you can check and correct errors in the text using the OCR Proofreader See Performing OCR on a Document on page 28 for more information 4 Outline the orig...

Page 20: ...ars to perform various tasks on the document The thumbnail view displays a picture of each page in the document The image view displays the current page s original image The text view displays the cur...

Page 21: ...in image view and a preview of the HTML document in HTML view The outline view displays an outline of the original document objects The image view displays the current page s original image The HTML v...

Page 22: ...The Zone button allows you to automatically create zones on images based on their original page layouts or predefined templates The OCR button allows you to perform OCR and check OCR results The Outl...

Page 23: ...n a page image See Customizing Zones on page 67 for more information New Open Save Print Proofread OCR Copy Undo View Image Editor Options Rotate Image Straighten Image Zoom Help HTML Option Draw Rect...

Page 24: ...by promoting demoting changing or deleting objects Options Dialog Box You can select settings for processing in the Options dialog box To open it click the Options button or choose Options in the Too...

Page 25: ...settings for HTML components in the HTML Options dialog box To open it click the HTML Options button or choose HTML Options in the Tools menu See Chapter 4 OmniPage Web Settings for more information...

Page 26: ...20 Chapter 2...

Page 27: ...ally or you can start each step individually You can even do different tasks at the same time Please continue reading this chapter for information on these topics Ways to Process Documents Bringing Do...

Page 28: ...r more information 4 Outline the document to detect structural elements such as headings body text headers and footers and to link cross references e mail addresses and URLs to their destinations See...

Page 29: ...desired Image Zone OCR Outline and Export commands See Setting AutoWeb Toolbar Commands on page 44 for more information 3 Choose Options in the Tools menu and check that settings are appropriate for...

Page 30: ...b of the Options dialog box before loading or scanning a color or grayscale image You cannot change the resolution after the image has been added to OmniPage Web Scanning Pages You can scan paper docu...

Page 31: ...loaded image files are inserted as new pages The following procedure is for loading image files only To open an OmniPage Web Document wmt use the Open command in the File menu To load image files int...

Page 32: ...click Add to put it in the Selected Files list Click Add All to add all files from the current folder 6 Click Open when you have selected all the files you want to load Image files are loaded in the o...

Page 33: ...ifying zones deleting unwanted zones and using zone templates please see Customizing Zones on page 67 Creating Zones Automatically OmniPage Web can analyze a page and create zones automatically for yo...

Page 34: ...ten text However it can retain handwritten text such as a signature as a graphic To perform OCR 1 Choose Options in the Tools menu and click the Page Format tab 2 Select an Original Page Layout settin...

Page 35: ...nd a picture of how it originally looked in the image 2 Select one of these options for the word Click Ignore to allow the word to remain as is Click Ignore All to ignore all instances of the word in...

Page 36: ...has been loaded zoned recognized and proofread OmniPage Web then examines the document structure and creates an outline of the structural elements called objects Before you proceed to the outlining st...

Page 37: ...a preview of how the object will appear in the HTML document If your document is large you may want to filter which objects appear in the outline to make it easier to read and edit You can select whi...

Page 38: ...change the outline hierarchy 1 Highlight the object that you want to change in outline view 2 Click the appropriate button in the Outline toolbar to demote promote demote to body text change to heade...

Page 39: ...e components you want included in your Web page choose the location of the components on the page and set basic component settings 3 Click the Component Styles tab to select additional components and...

Page 40: ...line and HTML views allow you to look at and work with pages in the current document Once pages are recognized the image text and thumbnail views are visible Drag this splitter to the left or right to...

Page 41: ...e text or HTML view to enlarge or reduce the view To resize a page view 1 Click in the view you want to resize to make it active 2 Choose a size option in the Zoom drop down list in the Standard toolb...

Page 42: ...Before outlining the thumbnail view image view and text view all display the same page of a document After outlining the image and HTML views display the same section of the document that is currently...

Page 43: ...g Pages You can reorder pages in a document by dragging their thumbnails to different positions in the thumbnail view Hold down the Ctrl key while you click thumbnails if you want to select multiple t...

Page 44: ...r page deletions cannot be undone Printing a Document You can print the current document s original page images or recognized text To print a document 1 Choose Print Image in the File menu to print or...

Page 45: ...box appears 2 Select a folder location and file type for your document To use your document on the World Wide Web save it as an HTML file type Be sure to view your document on as many browsers as poss...

Page 46: ...saved with the file To save your document as you work Click the Save button in the Standard toolbar or choose Save in the File menu to save changes to the current document as you work The Save As dia...

Page 47: ...n a particular browser is to view your document in that browser Some component settings are only supported by the most recent versions of Web browsers If you selected Use style sheets in the Component...

Page 48: ...Testing Your HTML Document 42 Chapter 3...

Page 49: ...mniPage Web s online Help for more detailed information on settings The settings you select for processing documents can greatly affect HTML results You may have to experiment with different settings...

Page 50: ...tions in the Tools menu Click the Options button and select process commands in the Options dialog box The pictures in the AutoWeb toolbar buttons change as you set different process commands The comm...

Page 51: ...s drop down list contains the AutoWeb and Web Wizard commands AutoWeb Select AutoWeb to finish processing a new or open document according to the selected process commands See Automatic Processing on...

Page 52: ...e to load existing image files such as TIFF DCX BMP JPG or PCX files Scan Image Select Scan Image to scan paper documents in your scanner This command only appears in the drop down list if you have in...

Page 53: ...er zones on single column document images such as letters or memos Multiple Column Pages Select Multiple Column Pages to have OmniPage Web automatically draw and order zones on multiple column documen...

Page 54: ...mands Perform OCR Select Perform OCR to recognize text on document images During OCR OmniPage Web analyzes the image and identifies characters to produce editable text See Performing OCR on a Document...

Page 55: ...e and Defer Outlining commands Outline Select Outline to outline the recognized document structure During outlining OmniPage Web detects original objects such as headings body text headers and footers...

Page 56: ...as an OmniPage Web document wmt or an HTML file Save and Launch Select Save and Launch to automatically launch your Web browser or HTML editor whenever you save your HTML document You can change the...

Page 57: ...ples that follow However documents require different settings depending on their input attributes and your output goals To get the best results learn how to identify document characteristics and make...

Page 58: ...b to select settings that affect OCR accuracy Select the type of characters that are in your document Usually these settings should be selected for optimal accuracy The Language Analyst evaluates and...

Page 59: ...r you might need to have your scanner connected and turned on for the Scanner tab to appear Use these settings if your scanner has an automatic document feeder This is recommended for pages with color...

Page 60: ...tting of a page is handled during OCR and outlining Select a setting that best describes how your original page looks The resolution is the number of dots or pixels that make up an image A higher reso...

Page 61: ...eign language document it may be difficult for OmniPage Web to accurately determine the document structure and your outline results may be incorrect This is the character used in place of unknown char...

Page 62: ...nge the browser or editor that automatically launches when you select Save and Launch The Web Wizard will guide you through the HTML conversion process when you click the AUTO button on the AutoWeb to...

Page 63: ...TML Options Click the HTML Options button or choose HTML Options in the Tools menu to open the HTML Options dialog box This is the central location for HTML settings Click for a description of each se...

Page 64: ...not want your HTML documentformatted Select this to haveOmniPage Web create a link to the original page image in your HTML document Specifies what you want OmniPage Web to use as the title of your HT...

Page 65: ...tab to select which components you want included in your HTML document and where you want the components to appear on the final Web page Select the order in which you want the components to appear on...

Page 66: ...es tab to select formatting options for each component in your HTML document Select this for more formatting options if you know your visitors have browsers that support cascading style sheets Select...

Page 67: ...r describes how to use these features Please continue reading this chapter for information on these topics Making Your Web Page More Effective Using Themes Making Your Web Page More Effective Customiz...

Page 68: ...lude more than one HTML document on a page Include a navigation panel at the top and bottom of each page Provide author and contact information to allow visitors to follow up on anything they see or d...

Page 69: ...oad the text will be unreadable Include pictures to illustrate your text Add images Add image maps Add links to original images Remember that some of these interesting effects take longer to download...

Page 70: ...on of fun and professional themes for you to use or you can create and save one of your own To select a theme 1 Click the HTML Options button in the Standard toolbar or choose HTML Options in the Tool...

Page 71: ...g box and select one of the provided themes or begin selecting your own settings 2 Click Save Themes to open the Save Themes dialog box 3 Type in a file name for the new theme All the current settings...

Page 72: ...mage button to rotate the image 90 degrees clockwise at a time Or choose Rotate in the View menu and select 90 180 or 270 degrees To straighten a page image 1 Click on the page image to make the image...

Page 73: ...customize zones including Reordering Zones Modifying Zones Deleting Zones Changing Zone Properties For information on creating zones automatically please see Creating Zones for OCR on page 27 Zone to...

Page 74: ...you do not number all the zones they are automatically numbered for you when you start OCR Modifying Zones You can modify zones by moving resizing extending subtracting connecting or dividing them Pl...

Page 75: ...in the AutoOCR toolbar You will be prompted to replace the current zones To delete zones 1 Select the zone you want to delete by clicking inside the zone Shift click to select additional zones Choose...

Page 76: ...ne content setting This specifies the characters OmniPage Web looks for within a zone during OCR You can select Alphanumeric or Numeric as the zone content setting For example if a particular zone onl...

Page 77: ...ly encloses the irregular area 4 Select a zone content for the selected zones You can select a zone content setting for any zone type except Graphic 5 Click the Close button when you are done You can...

Page 78: ...click Edit to edit an existing user dictionary Click New to create a new user dictionary Enter a name in the dialog box that appears and click OK The User Dictionary dialog box appears 3 Add or delete...

Page 79: ...d scanners and any connection or software driver issues The Readme file contains last minute information relating to OmniPage Web To open these documents click Start in the Windows taskbar and choose...

Page 80: ...requirements listed under Minimum System Requirements on page 6 Make sure that your scanner is plugged in and that all cable connections are secure Turn off your computer and your scanner turn your sc...

Page 81: ...ng image file such as the Sample tif file If OmniPage Web does not launch or run properly in safe mode then there may be a problem with the installation Uninstall and reinstall OmniPage Web and then r...

Page 82: ...ory optimizes OCR performance See Minimum System Requirements on page 6 for more information Low Disk Space Problems Problems may occur if your system runs low on free disk space Try these solutions f...

Page 83: ...age image separately If you select Save all pages in the Save Image dialog box Page where is the four digit page number is appended to file names to distinguish separately saved pages If you select Sa...

Page 84: ...y Caere Problems Connecting OmniPage Web to Your Scanner Missing Scan Image Command Scanner Message on Launch System Crash Occurs While Scanning Scanner Drivers Supplied by the Manufacturer Many scann...

Page 85: ...Panel 2 Look for the Caere Scan Manager icon The icon does not appear if Caere Scan Manager is not installed Use the following procedure to install Caere Scan Manager if it has not been installed To...

Page 86: ...propriate scanner driver See the Scanner Setup Notes for more information Make sure your scanner is connected turned on compatible with your system and runs with the software provided by the manufactu...

Page 87: ...urn the scanner to its default state Then restart your computer Check your scanner setup See Scanner Setup Issues on page 78 for more information Check the Scanner Settings tab in the Caere Scan Manag...

Page 88: ...scan an original document instead of a photocopy If you are going to use FAX copies to OCR ask your FAX sender to send them to you using their machine s Best or Fine mode Make sure the page is properl...

Page 89: ...ge images lots of text and graphics or elaborate formatting into smaller jobs Draw zones manually or modify automatically created zones and perform OCR on one page area at a time See Customizing Zones...

Page 90: ...ts likely errors during OCR See Accuracy Settings on page 52 for more information Check the glass mirrors and lenses on your scanner for dust smudges or scratches Clean if necessary OmniPage Web only...

Page 91: ...1 Clearing zones 69 Closing documents 38 Colored text turning off color markers 29 Comparing text with images 30 Component Styles 33 Components 33 Conventions in this manual 2 Creating user dictionari...

Page 92: ...online help Installing OmniPage Web 6 Scan Manager 79 L Language Analyst using for poor quality documents 84 Language settings 54 Large Buttons 44 Load Image command 46 Loading image files 25 Low dis...

Page 93: ...Scanner drivers supplied by Caere 79 supplied by the manufacturer 78 Scanner settings 53 Scanner setup issues 78 changing the current scanner 79 installing the Scan Manager 79 missing Scan Image comma...

Page 94: ...Web browser 6 changing the default 56 saving and launching 40 testing your document on 41 Web page creating 13 making it more effective 62 testing 13 Web Wizard using 22 Windows NT memory requirement...

Reviews: