|
Forms are composed of fields and items. Among the fields and items of a data form, there exist some special dependence relationships. To improvethe efficiency of form processing, in addition to good form segmentationand recognition process, the understanding of the dependence relationshipsis also necessary. In this study, new approaches to segmentation of blankdata forms, recognition of form formats, and understanding of field anditem dependence relationships are proposed. In the segmentation of a blankdata form image, the fields are extracted by alternative vertical andhorizontal projections and a logical tree structure of the form is constructed.After segmentation, form encoding is performed by traversing the tree structure. A form can be transformed into an attributed string, and the attributed string is proven to be a unique code. It can be used to recognizeforms by attributed string matching. In the understanding of a blank data form,various types of dependence relationships are identified and methods for detection of the dependence relationships are proposed. In the proposed form processing system, a window-based interface for the above works is also provided. Some experimental results showing the feasibility of the proposed methods are also included.
|