跳到主要內容

臺灣博碩士論文加值系統

(44.210.99.209) 您好!臺灣時間:2024/04/14 15:42
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:林昇賢
研究生(外文):Shen-Hsien Lin
論文名稱:使用結構與視覺化資訊分析網頁查詢界面之研究
論文名稱(外文):Web Query Interface Parsing with Structure and Visual Information
指導教授:蔡志忠蔡志忠引用關係
指導教授(外文):Jyh-Jong Tsay
學位類別:碩士
校院名稱:國立中正大學
系所名稱:資訊工程所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2009
畢業學年度:97
語文別:英文
論文頁數:42
中文關鍵詞:結構與視覺化資訊查詢界面分析
外文關鍵詞:structure and visual informationweb query interface parsing
相關次數:
  • 被引用被引用:0
  • 點閱點閱:918
  • 評分評分:
  • 下載下載:10
  • 收藏至我的研究室書目清單書目收藏:0
近年來,越來越多電子商務網站提供查詢介面給使用者查詢她們關切的資訊。為了整合這些查詢介面,我們必須了解並分析它們。
從我們觀察大量網站的資料來源,查詢界面似乎由某些語義模組所構成,這些語義模組代表存取後端資料庫資料的能力。
我們的目的就是擷取在查詢介面中的這些語義模組。
在本篇論文中,我們將定義一些分析的規則來描述語義模組的關係,並且在不同階段利用這些規則分析查詢介面。
這些分析規則是透過視覺化資訊所建構來分析介面上鄰近項目的空間關係。
我們的實驗顯示我們的方法可以準確的分析不同性質的查詢介面
Recently, more and more E-commerce sites provide query interfaces for users to query their desired information.
To integrate the interfaces, we must understand and parse them.
From observing myriad sources, query interfaces seem to be constructed by certain semantic models which
represent the capabilities of accessing data from database behind.
Our purpose is to extract the semantic models on interface.
In this thesis, we will define some parsing rules to represent the relationships about semantic models and
parse query interfaces with the rules between different phases.
The parsing rules are constructed by visual information to analyze the spatial proximity of adjacent elements on interface.
Our experiments show our approach can parse Web query interfaces across heterogenous sources accurately.
1 Introduction
1.1 Motivation
1.2 Related Work
1.3 Contribution
1.4 Organization
2 Term Definition
3 Data Preprocessing
3.1 Document Object Model (DOM)
3.2 Element Extraction with DOM
3.3 Element Division by Path
4 Web Query Interface Parsing
4.1 Relationships about Semantic Models
4.2 Visual Relations Description
4.3 Parsing Rules Definition
4.4 Interface Parsing Phases
5 Experimental Result
5.1 Metric
5.2 Experimental Discussion
6 Conclusion
[1] Zhen Zhang, Bin He, Kevin C.-C. Chang. Understanding Web query interfaces: Best-Effort Parsing with Hidden Syntax In proceedings of the 2004 ACM SIGMOD Conference (SIGMOD 2004), Paris, France, June 2004
[2] HAI HE, WEIYI MENG, CLEMENT YU, ZONGHUAN WU Automatic Integration of Web Search Interfaces with WISE-Integrator Journal of VLDB, 2004
[3] B.He, T. Tao, and K. C.-C. Chang. Clustering structured web sources: A schema-based, model-differentiation approach In EDBT''04 ClustWeb Workshop, 2004
[4] B. He and K. C.-C. Chang. Statistical schema matching across web query interfaces In SIGMOD Conference, 2003
[5] E.J. Golin. Parsing visual languages with picture layout grammars Journal of Visual Languages and Computing, 4(2):371 - 394, 1991
[6] R.Helm, K. Marriott, and M. Odersky. Building visual language parsers In Proceedings on Human Factors in Computing Systems (CHI), 1991
[7] J.J. and G. E. Online parsing of visual languages using adjacency grammars In Proceedings of the 11th International IEEE Symposium on Visual Languages, 1995
[8] V.Crescenzi, G. Mecca, and P. Merialdo. Roadrunner: Towards automatic data extraction from large web sites In VLDB Conference, 2001
[9] R. B. Doorenbos, O. Etzioni, and D. S. Weld. A scalable comparison-shopping agent for the world-wide web In Proceedings of the First International Conference on Autonomous Agents, 1997
[10] S. Liddle, S. Yau, and D. Embley. On the automatic extraction of data from the hidden web In Proceedings of the International Workshop on Data Semantics in Web Information Systems, 2001
[11] Ya-Ting Yang, Jyh-Jong Tsay Schema Matching for Integration of Product Specification in Web Shopping Search 2007
[12] W3C Document Object Model http://www.w3.org/DOM/
[13] Wikipedia http://en.wikipedia.org/wiki/Document\_Object\_Model
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top