臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.62) 您好！臺灣時間：2025/11/15 13:22

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

葉嘉霖

研究生(外文):

Chia-Lin Yeh

論文名稱:

印刷不良名片英文與數字之辨識

論文名稱(外文):

Recognition of English Alphabets and Numerals in Ill-Printed Name Cards

指導教授:

李錫堅

指導教授(外文):

Hsi-Jian Lee

學位類別:

碩士

校院名稱:

國立交通大學

系所名稱:

資訊工程系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2000

畢業學年度:

語文別:

英文

論文頁數:

中文關鍵詞:

文字辨識、名片辨識、英數字辨識

外文關鍵詞:

Character Recognition、Recognition of Engition Alphabets and Numerals

相關次數:

被引用:0
點閱:422
評分:
下載:0
書目收藏:0

在這篇論文裡我們設計一種方法用來辨識名片上的英文與數字。此方法包括三個部分，即文字辨識核心、連字切割與破碎字合併模組和辨識結果修正模組。首先，我們將單行的二值化影像做水平校正、斜體字偵測、連通單元(connected components)抽取、適當的合併與去雜訊、斜體字推正與平滑化。將框住連通單元的矩形由左向右水平投影，找出最靠近上下兩側的最大投影量，決定一行文字中的四條基準線。抽取出來的連通單元，由一個統計式方法設計的辨識核心程式辨識。我們用三種判斷連字的特徵，即寬高比(aspect ratio)，辨識結果的可靠度(recognition cost)及水平方向的筆畫變異度(horizontal crossing-count)，以區別單一文字與連字。接著以兩階段的切字模組進行切字。第一階段以分析連字影像的垂直投影決定可能的切點，如果第一階段中的切點導致切字的失敗，第二階段再分析連字的上下輪廓決定其他可能的切點。我們試著將相鄰且靠得很近,同時又是較細的字元(aspect ratio小)或碎點(例如:逗號和句號)合併，分析合併後字元之辨識結果，將可能的破碎字(broken characters)辨識出來。在雙語言的辨識系統中，例如:中英文名片辨識系統，我們討論三種信心度(confidence values)用來決定連通單元是中文或英數字，再分別以不同的辨識核心辨識。最後，根據連通單元的垂直位置和字元高度、一行文字的四條基準線、前後文的資訊與字元影像結構分析，調整辨識結果，使得辨識核心容易混淆的字元也能辨識正確。
我們從300張英文名片上擷取約10,000個英數字影像，作為測試辨識核心的樣本，測試的結果，辨識率達到了99.47%；我們擷取479個相連文字，其中有92.90%被正確地切割出來; 另外，我們也擷取336行的單行文字，找四條基準線的正確率達到了98.51%。

In this thesis, we design a procedure for recognizing characters in name cards. The procedure consists of three main operations: character recognition kernel, character segmentation, and correction of recognition results. In the first phase, we get possible isolated characters by performing the pre-processing, which includes binarization, skewing angle detection and straighten, italicness detection, smoothing, and connected-components extraction. We project the bounding rectangles of connected-components from left to right on the horizontal direction. Four typographical lines of single text lines are determined by projection analysis. Then, the components are recognized by a statistical multi-font recognition kernel. To detect touched characters, we use three measurements include aspect ratio, recognition cost and horizontal crossing-count. In the character segmentation phase, we develop a two-stage character segmentation module to segment the touched characters. In the first stage, we find the breaking points by projection analysis. If all possible segmentations have bad recognition results in the first stage, we use outline analysis to find other breaking points in the second stage. In order to solve the problems of broken characters, the merging process are give in our system. We try to merge consecutive and close characters that are either thin characters (small aspect ratio) or fragments (such as "," and "."). We analysis the recognition results of merging characters to decide whether accept or not. In some dual-lingual systems, such as Chinese-English name card understanding systems, we propose three measurements of confidence values that determine which language connected-components belong to. Then, either the Chinese OCR kernel or the English OCR kernel will recognize connected-components. Finally, the geometric properties, contextual information and disambiguation by structural difference are used to correct the recognition results. By rejecting impossibilities, the correct classes may eventually be promoted to the first candidate. Moreover, characters that have the same shape in capital and lower cases are justified according to their geometric properties.
In our experiments, we extracted about 10,000 single characters and 479 touched characters from more than 300 English business name cards as test samples. Besides, we also collected 336 images of text lines to test the four typographical lines determination module. The recognition rate for single characters was 99.47%. 92.90% of touched characters were correctly segmented. The accuracy of determining four typographical lines was 98.51%.

ABSTRACT IN CHINESEi
ABSTRACT IN ENGLISHii
ACKNOWLEDGEMENTSiii
TABLE OF CONTENTSiv
LIST OF FIGURESviii
CHAPTER 1. INTRODUCTION1
1.1 Motivation1
1.2 Problem Definition2
1.2.1 Maintaining High Recognition Rate2
1.2.2 Touched Character Segmentation3
1.2.3 Broken Character Merging3
1.3 Survey of related Research5
1.4 System Description and Assumptions6
1.4.1 System Description7
1.4.2 Assumptions8
1.5 Thesis Organization9
CHAPTER 2. MULTI-FONT CHARACTER RECOGNITION10
2.1 Introduction10
2.2 Training of Recognition Kernel12
2.2.1 Samples used12
2.2.2 Crossing-count Features (CCFs)12
2.2.3 Contour-direction Features (CDFs)13
2.2.3 Extraction of Peripheral-background-area Features (PBAFs)14
CHAPTER 3. DIFFERENTIATIONS OF ENGLISH AND CHINESE CHARACTERS16
3.1 Introduction16
3.2 Dual-lingual systems16
3.3 Confidence Values17
3.3.1 Measurement 128
3.3.2 Measurement 219
3.3.3 Measurement 320
3.5.5 Discussions20
CHAPTER 4. CHARACTER SEGMENTATION22
4.1 Introduction22
4.2 Assumptions22
4.3 Touched Character Determination24
4.3.1 Width-to-height aspect ratio24
4.3.2 Recognition Cost (Distance)24
4.3.3 Horizontal-crossing-count (HCC)25
4.3.4 Discussion25
4.4 Touched Character Segmentation26
4.4.1 Vertical Projection26
4.4.2 The Ratio of The Second Difference of Vertical Projection27
4.4.3 Full Projection Analysis38
4.4.4 Outline analysis30
4.4.5 Two-Stage Characters Segmentation31
4.5 Merging Process34
CHAPTER 5. CORRECTION OF RECOGNITION RESULTS36
5.1 Introduction36
5.2 Confusion set36
5.3 Geometric Properties37
5.3.1 The Procedure of Correction of Recognition Results with Geometric Propeties
48
5.3.2 Determining the four typographical lines40
5.3.2 Discriminating "i" from "j"41
5.3.4 Drop-caption42
5.3.5 Mixed-item43
5.4 Contextual Information44
5.5 Disambiguation by structural difference45
5.5.1 Confusion sets discrimination45
5.5.2 Touched Characters Considerations46
CHAPTER 6. EXPERIMENTAL RESULTS AND ANALYSIS49
6.1 Introduction49
6.2 Mixed-font Character Recognition Rate49
6.3 Results of Touched Character Segmentation51
6.4 Accuracy of Determining Four Typographical Lines53
6.5 Results By Using Geometric Properties and Contextual Information55
CHAPTER 7. CONCLUSIONS AND FUTRUE WORK58
References60

[1]T. M. Cheng, Character Recognition of English Alphabets and Numerals, Master thesis, Institute of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1999.
[2]C. H. Tung, A Study of Hand-written Chinese Text Recognition, Ph.D. Dissertation, Institute of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1994.
[3]Y. H. Chiou, Recognition of Chinese Business Cards, Master thesis, Institute of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1996.
[4]S. Kahan and T. Pavlidis, "On the recognition of printed characters of any font and size," IEEE Trans. Pattern anal. Mach. Intell. PAM-9. 274-287, March, 1987.
[5]S. Tsujimoto and H. Asada, "Major Components of a Complete Text Reading System," Proc. IEEE, Vol. 80, No. 7, July 1992, pp.1133-1149.
[6]R. G. Casey and G. Nagy, "Recursive Segmentation and Classification of Composite Character Patterns," Proc. 6th Int. Conf. Pattern Recognition (Munich, Germany), 1982, pp.1023-1026.
[7]Y. H. Tseng, K. C. Kuo and H. J. Lee, "Speeding Up Chinese Character Recognition in an Automatic Document Reading System," Proc. Fourth Int. Conf. Document Analysis and Recognition, pp. 629-632, Ulm, Germany, 1997.
[8]S. H. Lee, Design of a Business Card Understanding System, Master thesis, Institute of Computer Science and Information Engineering, National Chiao-Tung university, Taiwan, R.O.C., 1998.
[9]W. Niblack, An Introduction to Digital Image Processing, pp.115-116, Prentice Hall, 1986.
[10]H. Fujisawa, Y. Nakano, and K. Kurino, "Segmentation Methods for Character Recognition: From Segmentation to Documeent Structure Analysis," Proc. IEEE, Vol. 80, No. 7, July 1992, pp.1079-1091,
[11]Y.O. Graham Leedham, "Segmentation and recogntion of handwritten pitman shorthand outlines using an interactive heuristic search," Pattern Recognition, Vol. 26, no. 3, pp.277-294, 1994.
[12]Y. Lu and M. Shridhar, "Character Segmentation in Handwritten Words - an Overview," Pattern Recognition, vol. 29, No. 1, pp. 77-96, 1996.
[13]G. Seni and E. Cohan, "External word segmentation of off-line handwritten text lines," Pattern Recogntion, Vol. 27, No. 1, pp.41-52, 1994.
[14]R. C. Gonzalez and R. E. Woods, Digital Image Processing, Addision-Wesley publishing company, 1993.
[15]S. S. You, P. C. Chang, N. J. Cheng and Y. J. Tsay, "Automatic Knowledge Acquisition for Chinese Archive Document," CVGIP 1995.
[16]C.C. Kuo, "Evaluation and Speeding Up of Statistical Chinese Character Recognition," Master Thesis, Department of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1996
[17]R. C. Gonzalez and R. E. Woods, Digital Image Processing, Addison-Wesley, 1992.
[18]B. Aldefeld, S. E. Levinson, and T. G. Szymanski, "A minimum-distance search technique and its application to automatic directory assistance," Bell Syst. Tech. J., vol. 59, pp. 1343-1356, 1980.
[19]Y. O. Graham Leedham, "Segmentation and Recogntion of Handwritten pitman shorthand outlines using an interactive heuristic search," Pattern Recogntion, Vol. 26, no. 3, pp. 277-294,1994.
[20]J. Wang and J. Jean, "Segmentation of Merged Character by Neural Networks and Shortest Path," Pattern Recognition, Vol. 29, No. 5, pp.649-658, 1994.
[21]G. Seni and E. Cohan, "External word segmentation of off-line handwritten text lines," Pattern Recognition, Vol. 27, No. 1, pp.41-52, 1994
[22]Richard G. Casey and Eric Lecolinet, "A Survey of Methods and Strategies in Character Segmentation", IEEE Trans. Pattern Anal. Mach. Intell., vol. 18, no. 7, pp. 690-706, 1996
[23]C. Y. Suen, C. Nadal, R. Legault, T.A Mai, and L. Lam, "Computer recognition of Unconstrained Handwritten numerals," Proceeding of the IEEE, Vol. 80, no. 70, July 1996.
[24]F. H. Cheng and W. H. Hsu, "Research on Chinese OCR in Taiwan," International Hournal of Pattern Recognition and Artificial Intelligence, Vol. 5, nos. 1ž, pp. 139-164, June 1991.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

無相關論文

無相關期刊

1.	中文字帖書法字之描述
2.	現代網路搜尋引擎中轉置檔快取系統之設計
3.	演進支援用於以代理人為基礎之電子商務程序環境
4.	Web環境下需求規格再利用系統之設計
5.	高效能微處理機之全域訊號繞線與加入緩衝器
6.	短期憑證--一種新型態的金鑰認證中心
7.	多層分支預測之研究
8.	書法影像內中文字的二值化和非等方向性擴散
9.	應用LDAP目錄服務於短期簽證認證中心
10.	軌跡快取記憶體上之啟發式軌跡選擇機制
11.	中文報紙影像中之文章分析與文字切割
12.	MPEG-4音訊壓縮中TwinVQ之探討
13.	在超純量微處理器上的高頻寬指令抓取機制
14.	日光動態模擬的熱輻射法運算
15.	中文文件處理系統中使用之多核心辨識方法與簡化型語言模式

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室