臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.124) 您好！臺灣時間：2026/06/04 22:22

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

詹前裕

研究生(外文):

Chian-Yu Chan

論文名稱:

以由下往上的方式作彩色文件分析與重排

論文名稱(外文):

A Bottom-Up Approach to Color Image Document Analysis and Rearrangement

指導教授:

蔡文祥

指導教授(外文):

Wen-Hsiang Tsai

學位類別:

碩士

校院名稱:

國立交通大學

系所名稱:

資訊科學系

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

1999

畢業學年度:

語文別:

英文

論文頁數:

中文關鍵詞:

文件分析、文件重排、彩色文件、區塊抽取、文字切割

外文關鍵詞:

documnet analysis、document rearrangement、color document、block extraction、character segmentation

相關次數:

被引用:0
點閱:353
評分:
下載:0
書目收藏:0

本論文提出一以下往上方式作文件分析與重排的方法。首先，提出用邊緣偵測及區塊生長演算法找基本區塊的方法。利用這種方法有幾個好處，第一可以避免彩色圖片經過印刷、掃描後的雜訊干擾。第二，可以快速地取得基本區塊。接下來利用一新提出的區塊辨識演算法，利用區塊的結構以及統計特徵，判斷這些區塊為文字或是圖像區塊，再將相同屬性的區塊合併成大的區塊。在合併文字區塊的過程中，我們可以找到文字區塊的文字走向以及文字列印方向，還可以利用文字大小的特徵將文字區塊作簡單的分析。在文件重排方面，我們根據先前的結果，找到文章區塊的每一行文字，利用二值化後水平與垂直投影的特性，將每行文字作切字處理，進而找出文字區塊中的文字閱讀順序，將文章重排到新的文件中。良好的實驗結果，證明了所提出的方法是可行而且實用的。

A bottom-up approach to image document analysis and rearrangement is proposed in this study. First, an edge detection algorithm and a region-growing algorithm are proposed to extract the basic blocks. Two advantages can be obtained by employing these algorithms. First, the distortion effect caused by printing or scanning can be avoided. Second, these techniques are faster than color-quantization techniques. After basic block extraction, several features are used to classify extracted blocks into text blocks and graphic blocks. Some improved methods for merging text blocks into text areas and graphic blocks into graphic areas are also proposed. In the stage of text block merging, the direction of article reading is obtained. Some information is used to analyze text areas. For article rearrangement, we can use the segmented result to obtain the text lines in the text area, then segment the characters by characteristics of vertical and horizontal projections. After the reading orders of the characters in the text area are analyzed, article rearrangement is performed. Good experimental results prove the feasibility and practicability of the proposed approaches.

Chapter 1 Introduction
1.1 Motivation
1.2 Survey of Related Studies
1.3 Brief Description of Proposed Approach
1.3.1 Definition of terminologies
1.3.2 Assumption
1.3.3 Overview of proposed approach
1.4 Thesis Organization
Chapter 2 Segmentation of Basic Blocks
2.1 Introduction
2.2 Segmentation of Basic Blocks
2.2.1 Edge detection operation
2.2.2 Modified Region growing algorithm
2.3 Experimental Results
Chapter 3 Basic Block Recogntion
3.1 Introduction
3.2 Features for Recognition of Basic Blocks
3.2.1 Size feature
3.2.2 Statistical features
3.3 Recognition of Basic Text and Graphic Blocks
Chapter 4 Text and Graphic Area Construction
4.1 Introduction
4.2 Merge of False Blocks
4.2.1 Merge of overlapping false blocks
4.2.2 Merge of neighboring false blocks
4.3 Merge of Text Line Blocks
4.4 Merge of Punctuation Marks
4.5 Merge of Text Blocks into Text Areas
4.6 Merge of Graphic Areas
4.7 Experimental Results
Chapter 5 Character Segmentation
5.1 Introduction
5.2 Text Line Segmentation in Text Blocks
5.2.1 Text line segmentation in vertical text blocks
5.2.2 Text line segmentation in horizontal text blocks
5.3 Character Segmentation in Text Line Blocks
5.3.1 Character segmentation in vertical text line blocks
5.3.2 Character segmentation in horizontal text line blocks
5.4 Experimental Results
Chapter 6 Rearrangement of Articles
6.1 Introduction
6.2 Rearrangement of Articles
6.2.1 Rearrangement for vertical direction
6.2.2 Rearrangement for horizontal direction
6.3 Experimental Results
Chapter 7 Experimental Results and Discussions
7.1 Experimental Results
7.2 Discussions
Chapter 8 Conclusions and Suggestions for Future Research
8.1 Conclusions
8.2 Suggestions for Future Works
References

[1] F.M Wahl, K. Y. Wong, and R. G. Casey. “Block Segmentation and Text Extraction in Mixed Text/Image Documents,” Computer Graphics and Image Processing 20, pp. 375-390, 1982.
[2] H.Wang, S.Z Li, and S. Ragupathi. “Document Segmentation and Classification with Top-Down Approach,” in 1997 1st Int. Conf. On Knowledge-Based Intelligent Electronic System, p243-247, 1997.
[3] D. Wang and S. N. Srihari, “ Classification of newspaper image blocks using texture analysis,“ Computer Vision, Graphics, and Image Processing, Vol. 47, pp 327-352, 1989.
[4] L.A Fletcher and R. Kasturi, “ An Robust Algorithm For Text String Separation from Mixed Text/Graphics Images,” IEEE trans. Pattern Analysis and Machine Intelligence, Vol. 10, pp 910-918, 1998.
[5] L. Cinque, L. Lombardi, and G. Manzini, “ A Multiresolution Approach for Page Segmentation,” Pattern Recognition Letters, pp. 217-225, 1998.
[6] C.L. Tan, amd P. O. NG, “ Text Extraction Using Pyramid”, Pattern Recognition, Vol. 31, No. 1, pp. 63-72, 1998.
[7] Y. H. Chuang and W.H. Tsai, ” Segmentation of Texts, Graphics, and Special Components for Color Document Image Analysis,” in Proc. Int. Conf. Computer Vision, Graphics, and Image Processing, Taoyuan, Taiwan, Republic of China, pp. 471-478, August 1995.
[8] Y. S. Lin and W. H. Tsai, “ Image Segmentation for Color Document Analysis,” in Proc. Int. Conf. Computer Vision, Graphics, and Image Processing, Taoyuan, Taiwan, Republic of China, pp. 471-478, August 1995.
[9] A. K. Jain and B. Yu, “Automatic Text Location in Images and Video Frames,“ Pattern Recognition, vol.31, no.12, pp.2055-76, Dec. 1998.
[10] H. M Suen and J. F. Wang, ”Preprocessing of Color-Printed Document Images for Automatic Character Recognition,” Ph.D. Dissertation, Department of Computer Science and Information Engineer, National Cheng Kung University, 1998.
[11] W. L. Hwang and F. Chang, “ Character Extraction From Documents Using Wavelet Maxima,” Image and Vision Computing, 16, pp. 307-315, 1998.
[12] J. H. Bae, L. C. Jung, J. W. Kim, H. J. Kim, “ Segmentation of Touching Character Using an MLP,” Pattern Recognition Letters, 19, pp. 701-709, 1998.
[13] Y. C. Yang and C. S. Fuh, “ Chinese Character Segmentation in Machine Printed Documents,” in 7th Optical Character Recognition and Document Analysis Workshop, November 1997, pp. 2.20-2.23.
[14] R. C. Gonzalez and R. E. Woods, Digital Image Processing, Addision Wesley, pp197-201, 1992.
[15] W. H. Tsai, “Moment-Preserving Thresholding: A New Approach,” Computer Vision, Graphics and Image Processing, Vol.29, No 3, pp. 377-393, 1985.
[16] K. C. Fan and L. S. Wang, “Document Segmentation and Classification,” Proc. of 1997 IPPR Conf. On CVGIP, Taichung, Taiwan, ROC, pp 273-283, 1997.
[17] M. Sawaki and N. Hagita, “ Text line Extraction and Character Recognition of Document Headline With Graphical Designs Using Complementary Similarity Measure,” IEEE trans. Pattern Analysis and Machine Intelligence, Vol. 20, No 10, 1998, pp 1103-1109.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	表格公文處理系統之設計
2.	光學文件處理之究研

無相關期刊

1.	數位相機影像分析中的幾何校正、景物疊合及物體分離
2.	對手填文件影像之內容特徵作抽取,分類與應用
3.	用小波轉換作彩色影像中之資料隱藏及浮水印
4.	影像處理之網紋分析及其應用
5.	針對資料探勘技術的資料型態整合
6.	一個找尋異常群集的快速分群演算法
7.	物件導向知識庫管理系統
8.	合作式網路文件分級系統之研究
9.	以電腦視覺為基礎利用特徵分群法及形狀比對技術在室外環境中作自動車之導航
10.	應用序列對法之平面規劃
11.	特殊無線網路環境下之行動管理與路由方法
12.	適用於多階無線區域網路之MobileIP
13.	多重群播重疊問題之最佳化
14.	新一代網際網路規約下Mobile-IP
15.	利用位能場做三維物體之路徑規劃

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室