Author

Jianjun Sa

Publication Date

1992

Document Type

Dissertation/Thesis

First Advisor

Bow, Sing-Tze, 1924-

Degree Name

M.S. (Master of Science)

Legacy Department

Department of Electrical Engineering

LCSH

Data compression (Computer science); Documentation

Abstract

A generalized computer-based automated documentation system which processes engineering documents is extremely desirable. Since document archives is memory intensive, data compression algorithms are becoming increasingly important A revolutionary technique which separates text from mixed text/graphic documents and succinctly describes graphics has been introduced. This thesis introduces two new main algorithms. The first one focuses on the separation of text from mixed text/graphic documents (Chapter 3). It includes an Edge Expanding Search (EES) algorithm for the searching of character-shaped objects, Neighborhood Checking (NC) algorithm for the checking of the neighborhood of the object, and Touching Character Recognition (TCR) algorithm for the identification of the character touching on a graphic. The second algorithm relates to the description of graphics (Chapter 4). The performance of these algorithms, both in terms of their effectiveness and efficiency, is evaluated with fifteen mixed text/graphic engineering documents. The superior performance of these algorithms as compared to other techniques as described in Chapter 2 is clear from the evaluation results.

Comments

Includes bibliographical references (pages [75]-80)

Extent

x, 121 pages

Language

eng

Publisher

Northern Illinois University

Rights Statement

In Copyright

Rights Statement 2

NIU theses are protected by copyright. They may be viewed from Huskie Commons for any purpose, but reproduction or distribution in any format is prohibited without the written permission of the authors.

Media Type

Text

Share

COinS