跳到主要內容

臺灣博碩士論文加值系統

(216.73.216.62) 您好!臺灣時間:2025/11/15 13:22
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:葉嘉霖
研究生(外文):Chia-Lin Yeh
論文名稱:印刷不良名片英文與數字之辨識
論文名稱(外文):Recognition of English Alphabets and Numerals in Ill-Printed Name Cards
指導教授:李錫堅李錫堅引用關係
指導教授(外文):Hsi-Jian Lee
學位類別:碩士
校院名稱:國立交通大學
系所名稱:資訊工程系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2000
畢業學年度:88
語文別:英文
論文頁數:61
中文關鍵詞:文字辨識名片辨識英數字辨識
外文關鍵詞:Character RecognitionRecognition of Engition Alphabets and Numerals
相關次數:
  • 被引用被引用:0
  • 點閱點閱:422
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
在這篇論文裡我們設計一種方法用來辨識名片上的英文與數字。此方法包括三個部分,即文字辨識核心、連字切割與破碎字合併模組和辨識結果修正模組。首先,我們將單行的二值化影像做水平校正、斜體字偵測、連通單元(connected components)抽取、適當的合併與去雜訊、斜體字推正與平滑化。將框住連通單元的矩形由左向右水平投影,找出最靠近上下兩側的最大投影量,決定一行文字中的四條基準線。抽取出來的連通單元,由一個統計式方法設計的辨識核心程式辨識。我們用三種判斷連字的特徵,即寬高比(aspect ratio),辨識結果的可靠度(recognition cost)及水平方向的筆畫變異度(horizontal crossing-count),以區別單一文字與連字。接著以兩階段的切字模組進行切字。第一階段以分析連字影像的垂直投影決定可能的切點,如果第一階段中的切點導致切字的失敗,第二階段再分析連字的上下輪廓決定其他可能的切點。我們試著將相鄰且靠得很近,同時又是較細的字元(aspect ratio小)或碎點(例如:逗號和句號)合併,分析合併後字元之辨識結果,將可能的破碎字(broken characters)辨識出來。在雙語言的辨識系統中,例如:中英文名片辨識系統,我們討論三種信心度(confidence values)用來決定連通單元是中文或英數字,再分別以不同的辨識核心辨識。最後,根據連通單元的垂直位置和字元高度、一行文字的四條基準線、前後文的資訊與字元影像結構分析,調整辨識結果,使得辨識核心容易混淆的字元也能辨識正確。
我們從300張英文名片上擷取約10,000個英數字影像,作為測試辨識核心的樣本,測試的結果,辨識率達到了99.47%;我們擷取479個相連文字,其中有92.90%被正確地切割出來; 另外,我們也擷取336行的單行文字,找四條基準線的正確率達到了98.51%。

In this thesis, we design a procedure for recognizing characters in name cards. The procedure consists of three main operations: character recognition kernel, character segmentation, and correction of recognition results. In the first phase, we get possible isolated characters by performing the pre-processing, which includes binarization, skewing angle detection and straighten, italicness detection, smoothing, and connected-components extraction. We project the bounding rectangles of connected-components from left to right on the horizontal direction. Four typographical lines of single text lines are determined by projection analysis. Then, the components are recognized by a statistical multi-font recognition kernel. To detect touched characters, we use three measurements include aspect ratio, recognition cost and horizontal crossing-count. In the character segmentation phase, we develop a two-stage character segmentation module to segment the touched characters. In the first stage, we find the breaking points by projection analysis. If all possible segmentations have bad recognition results in the first stage, we use outline analysis to find other breaking points in the second stage. In order to solve the problems of broken characters, the merging process are give in our system. We try to merge consecutive and close characters that are either thin characters (small aspect ratio) or fragments (such as "," and "."). We analysis the recognition results of merging characters to decide whether accept or not. In some dual-lingual systems, such as Chinese-English name card understanding systems, we propose three measurements of confidence values that determine which language connected-components belong to. Then, either the Chinese OCR kernel or the English OCR kernel will recognize connected-components. Finally, the geometric properties, contextual information and disambiguation by structural difference are used to correct the recognition results. By rejecting impossibilities, the correct classes may eventually be promoted to the first candidate. Moreover, characters that have the same shape in capital and lower cases are justified according to their geometric properties.
In our experiments, we extracted about 10,000 single characters and 479 touched characters from more than 300 English business name cards as test samples. Besides, we also collected 336 images of text lines to test the four typographical lines determination module. The recognition rate for single characters was 99.47%. 92.90% of touched characters were correctly segmented. The accuracy of determining four typographical lines was 98.51%.

ABSTRACT IN CHINESEi
ABSTRACT IN ENGLISHii
ACKNOWLEDGEMENTSiii
TABLE OF CONTENTSiv
LIST OF FIGURESviii
CHAPTER 1. INTRODUCTION1
1.1 Motivation1
1.2 Problem Definition2
1.2.1 Maintaining High Recognition Rate2
1.2.2 Touched Character Segmentation3
1.2.3 Broken Character Merging3
1.3 Survey of related Research5
1.4 System Description and Assumptions6
1.4.1 System Description7
1.4.2 Assumptions8
1.5 Thesis Organization9
CHAPTER 2. MULTI-FONT CHARACTER RECOGNITION10
2.1 Introduction10
2.2 Training of Recognition Kernel12
2.2.1 Samples used12
2.2.2 Crossing-count Features (CCFs)12
2.2.3 Contour-direction Features (CDFs)13
2.2.3 Extraction of Peripheral-background-area Features (PBAFs)14
CHAPTER 3. DIFFERENTIATIONS OF ENGLISH AND CHINESE CHARACTERS16
3.1 Introduction16
3.2 Dual-lingual systems16
3.3 Confidence Values17
3.3.1 Measurement 128
3.3.2 Measurement 219
3.3.3 Measurement 320
3.5.5 Discussions20
CHAPTER 4. CHARACTER SEGMENTATION22
4.1 Introduction22
4.2 Assumptions22
4.3 Touched Character Determination24
4.3.1 Width-to-height aspect ratio24
4.3.2 Recognition Cost (Distance)24
4.3.3 Horizontal-crossing-count (HCC)25
4.3.4 Discussion25
4.4 Touched Character Segmentation26
4.4.1 Vertical Projection26
4.4.2 The Ratio of The Second Difference of Vertical Projection27
4.4.3 Full Projection Analysis38
4.4.4 Outline analysis30
4.4.5 Two-Stage Characters Segmentation31
4.5 Merging Process34
CHAPTER 5. CORRECTION OF RECOGNITION RESULTS36
5.1 Introduction36
5.2 Confusion set36
5.3 Geometric Properties37
5.3.1 The Procedure of Correction of Recognition Results with Geometric Propeties
48
5.3.2 Determining the four typographical lines40
5.3.2 Discriminating "i" from "j"41
5.3.4 Drop-caption42
5.3.5 Mixed-item43
5.4 Contextual Information44
5.5 Disambiguation by structural difference45
5.5.1 Confusion sets discrimination45
5.5.2 Touched Characters Considerations46
CHAPTER 6. EXPERIMENTAL RESULTS AND ANALYSIS49
6.1 Introduction49
6.2 Mixed-font Character Recognition Rate49
6.3 Results of Touched Character Segmentation51
6.4 Accuracy of Determining Four Typographical Lines53
6.5 Results By Using Geometric Properties and Contextual Information55
CHAPTER 7. CONCLUSIONS AND FUTRUE WORK58
References60

[1]T. M. Cheng, Character Recognition of English Alphabets and Numerals, Master thesis, Institute of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1999.
[2]C. H. Tung, A Study of Hand-written Chinese Text Recognition, Ph.D. Dissertation, Institute of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1994.
[3]Y. H. Chiou, Recognition of Chinese Business Cards, Master thesis, Institute of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1996.
[4]S. Kahan and T. Pavlidis, "On the recognition of printed characters of any font and size," IEEE Trans. Pattern anal. Mach. Intell. PAM-9. 274-287, March, 1987.
[5]S. Tsujimoto and H. Asada, "Major Components of a Complete Text Reading System," Proc. IEEE, Vol. 80, No. 7, July 1992, pp.1133-1149.
[6]R. G. Casey and G. Nagy, "Recursive Segmentation and Classification of Composite Character Patterns," Proc. 6th Int. Conf. Pattern Recognition (Munich, Germany), 1982, pp.1023-1026.
[7]Y. H. Tseng, K. C. Kuo and H. J. Lee, "Speeding Up Chinese Character Recognition in an Automatic Document Reading System," Proc. Fourth Int. Conf. Document Analysis and Recognition, pp. 629-632, Ulm, Germany, 1997.
[8]S. H. Lee, Design of a Business Card Understanding System, Master thesis, Institute of Computer Science and Information Engineering, National Chiao-Tung university, Taiwan, R.O.C., 1998.
[9]W. Niblack, An Introduction to Digital Image Processing, pp.115-116, Prentice Hall, 1986.
[10]H. Fujisawa, Y. Nakano, and K. Kurino, "Segmentation Methods for Character Recognition: From Segmentation to Documeent Structure Analysis," Proc. IEEE, Vol. 80, No. 7, July 1992, pp.1079-1091,
[11]Y.O. Graham Leedham, "Segmentation and recogntion of handwritten pitman shorthand outlines using an interactive heuristic search," Pattern Recognition, Vol. 26, no. 3, pp.277-294, 1994.
[12]Y. Lu and M. Shridhar, "Character Segmentation in Handwritten Words - an Overview," Pattern Recognition, vol. 29, No. 1, pp. 77-96, 1996.
[13]G. Seni and E. Cohan, "External word segmentation of off-line handwritten text lines," Pattern Recogntion, Vol. 27, No. 1, pp.41-52, 1994.
[14]R. C. Gonzalez and R. E. Woods, Digital Image Processing, Addision-Wesley publishing company, 1993.
[15]S. S. You, P. C. Chang, N. J. Cheng and Y. J. Tsay, "Automatic Knowledge Acquisition for Chinese Archive Document," CVGIP 1995.
[16]C.C. Kuo, "Evaluation and Speeding Up of Statistical Chinese Character Recognition," Master Thesis, Department of Computer Science and Information Engineering, National Chiao Tung University, Taiwan, R.O.C., 1996
[17]R. C. Gonzalez and R. E. Woods, Digital Image Processing, Addison-Wesley, 1992.
[18]B. Aldefeld, S. E. Levinson, and T. G. Szymanski, "A minimum-distance search technique and its application to automatic directory assistance," Bell Syst. Tech. J., vol. 59, pp. 1343-1356, 1980.
[19]Y. O. Graham Leedham, "Segmentation and Recogntion of Handwritten pitman shorthand outlines using an interactive heuristic search," Pattern Recogntion, Vol. 26, no. 3, pp. 277-294,1994.
[20]J. Wang and J. Jean, "Segmentation of Merged Character by Neural Networks and Shortest Path," Pattern Recognition, Vol. 29, No. 5, pp.649-658, 1994.
[21]G. Seni and E. Cohan, "External word segmentation of off-line handwritten text lines," Pattern Recognition, Vol. 27, No. 1, pp.41-52, 1994
[22]Richard G. Casey and Eric Lecolinet, "A Survey of Methods and Strategies in Character Segmentation", IEEE Trans. Pattern Anal. Mach. Intell., vol. 18, no. 7, pp. 690-706, 1996
[23]C. Y. Suen, C. Nadal, R. Legault, T.A Mai, and L. Lam, "Computer recognition of Unconstrained Handwritten numerals," Proceeding of the IEEE, Vol. 80, no. 70, July 1996.
[24]F. H. Cheng and W. H. Hsu, "Research on Chinese OCR in Taiwan," International Hournal of Pattern Recognition and Artificial Intelligence, Vol. 5, nos. 1ž, pp. 139-164, June 1991.

QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top