跳到主要內容

臺灣博碩士論文加值系統

(18.97.14.80) 您好!臺灣時間:2024/12/04 04:46
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:莊博允
研究生(外文):Po-Yun Chuang
論文名稱:設計一個新的文件定義搜尋引擎
論文名稱(外文):Design of a new Search Engine for XML Document Schemata
指導教授:呂瑞麟呂瑞麟引用關係
指導教授(外文):Eric Jui-Lin Lu
學位類別:碩士
校院名稱:朝陽科技大學
系所名稱:資訊管理系碩士班
學門:電算機學門
學類:電算機一般學類
論文種類:學術論文
論文出版年:2004
畢業學年度:92
語文別:中文
論文頁數:42
中文關鍵詞:XML儲存庫DTD樹的校正距離搜尋引擎DTD樹
外文關鍵詞:DTDTree edit distanceDTD treeXML RepositorySearch engine
相關次數:
  • 被引用被引用:1
  • 點閱點閱:311
  • 評分評分:
  • 下載下載:31
  • 收藏至我的研究室書目清單書目收藏:0
近年來,有愈來愈多的商業交易透過網際網路達成,為了能夠在網路上順利的進行交易,企業必須使用良好的訊息交換格式。目前XML(Extensible Markup Language)為公認最好的資料呈現及交換的格式,這是因為XML允許使用者定義符合自己的標籤及屬性。但是允許使用者定義自己的標籤及屬性將會造成同義異名及同名異義的問題。所以,有許多知名的國際組織,建立XML儲存庫來解決此問題。然而,儲存庫被建立之後,將會有愈來愈多的文件定義被註冊,因此能夠正確的找到最接近使用者所想要的文件定義將是一個很重要的問題。目前主要有二種方法用來搜尋文件定義,為Kotsakis和Bohm提出的XSD(XML Schema Directory)和Lu和Jung提出的XDSearch,但此兩種方法在搜尋時並沒有考慮到meta-character及提供子結構的搜尋。因此,本研究將提出一個新的文件定義搜尋引擎,使用Zhang和Shasha樹與樹之間的校正距離的演算法測量兩文件定義之間的相似度,並利用此演算法所算出的距離矩陣來判斷文件定義是否有使用者需求的子結構。最後,針對XDSearch和提出的方法使用真實世界的文件定義做實驗比較及分析。
Electronic commerce is an emerging trade model under dramatically rapid development. It is believed that XML is one of the best layout formats for exchanging messages over the Internet. Although XML allow developers to define elements and attributes to fit their own needs, this feature makes it difficult to exchange XML documents between companies. To solve this problem, it is believed that XML repositories have to be established. XML repositories store and manage re-usable objects such as document schemata and entities such as DTDs for developers. Therefore, it is imperative to develop an efficient search engine so that developers can locate objects they need. However, to be able to search for the right document schemata, a powerful document schema search engine is required. In this paper, we propose a new document schema search engine to search similarity for XML schemata.
中文摘要 I
ABSTRACT II
誌謝 III
目 錄 IV
圖 目 錄 V
表目錄 VI
第一章 緒論 1
1.1研究背景 1
1.2研究動機 3
1.3研究目標 7
第二章 文獻探討 8
2.1關鍵字搜尋 8
2.2目錄搜尋 9
2.3 XSD 9
2.4 XDSEARCH 10
第三章 文件定義搜尋引擎的架構 12
3.1資訊元件 13
3.1.1文件定義儲存庫 13
3.1.2專有名訽表格 13
3.1.3文件定義表格 15
3.1.4索引表格 16
3.2搜尋元件 16
3.2.1建立DTD樹模組 16
3.2.2排序模組 20
3.2.3測量相似度模組 23
3.2.4索引模組 29
3.3介面元件 30
第四章 實作及實驗 31
4.1實作 31
4.2實驗 34
4.3分析 38
第五章 結論與未來工作 41
5.1結論 41
5.2未來工作 42
參考文獻 43
1.Ron Bourret, Christof Bornhovd, and Alejandro P. Buchmann, “A generic load/extract utility for data transfer between XML documents and relational databases,” in Proceedings of the Second International Workshop on Advanced Issues of E-Commerce and Web-based Information Systems, pp. 134–143, 2000.
2.B. Chandrasekaran and John R. Josephson, “What are ontologies, and why do we need them?,” IEEE Intelligent Systems, Vol. 14, No. 1, pp. 20-26, 1999.
3.Tae-Sun Chung and Hyoung-Joo Kim, “Techniques for the evaluation of XML queries: a survey,” Data & Knowledge Engineering, vol. 46, no. 2, pp. 225-246, August 2003.
4.Paolo Ciancarini, Fabio Vitali, and Cecilia Mascolo, “Managing complex documents over the WWW: A case study for XML,“ Knowledge and Data Engineering, vol. 11, no. 4, pp. 629–638, 1999.
5.Stefan Decker, Sergey Melnik, Frank van Harmelen, Dieter Fensel, Michel C. A. Klein, Jeen Broekstra, Michael Ermann, and Ian Horrocks, “The semantic web: The roles of XML and RDF,” IEEE Internet Computing, vol. 4, no. 5, pp. 63-74, 2000.
6.Mary Fernandez, Wang-Chiew Tan, and Dan Suciu, “Silkroute: trading between relations and XML,” Computer Networks, vol. 33, pp. 723–745, 2000.
7.Robert E. Filman and Sangam Pant, “Search the Internet,” IEEE Internet Computing, vol. 2, pp. 21–23, 1998.
8.Minos Garofalakis, Aristides Gionis, Rajeev Rastogi, S. Seshadri, and Kyuseok Shim, “XTRACT: a system for extracting document type descriptors from XML documents,” in Proceedings of ACM SIGMOD Conference on Management of Data, vol. 29, pp. 165–176, 2000.
9.Venkat N. Gudivada, Vijay V. Raghavan, William I. Grosky, and Rajesh Kasanagottu, “Information retrieval on the world wide web,” IEEE Internet Computing, vol. 1, pp. 58-68, 1997.
10.Charalambos. L. Iacovou, Izak Benbasat, and Albert S. Dexter, “Electronic data interchange and small organizations: Adoption and impact of technology,” Journal of the Association for Computing Machinery, vol. 19, no. 4, pp. 465–485, 1995.
11.Mei Kobayashi and Koichi Takeda, “Information retrieval on the web,” ACM Computing Surveys, vol. 32, no. 2, pp. 144-173, 2000.
12.Alan Kotok, “White Paper on Global XML Repositories for XML/EDI,” XML/EDI Group, 1999.
13.Alan Kotok, “Government and finance industry urge caution on XML,” Technical report, XML.com, 2002.
14.Evangelos Kotsakis, “XSD: A hierarchical access method for indexing XML schemata,” Knowledge and Information Systems, 4:168–201, 2002.
15.Evangelos Kotsakis and Klemens Bohm, “XML schema directory: A data structure for XML data processing,” in Proceedings of the First International Conferece on Web Information Systems Engineering, pp. 62–99, 2000.
16.Mong Li Lee, Liang Huai Yang, Wynne Hsu, and Xia Yang, “Xclust::Clustering XML Schemas for Effective Integration,” in Proceedings of 11th ACM International Conference on Information and Knowledge Management (CIKM), McLean,. Virginia, USA, pp. 292-299, November 2002.
17.Eric Jui-Lin Lu, Shihyu Chou, and Ru-Hui Tsai, “An empirical study of XML/EDI,” The Journal of Systems and Software, vol. 58, no. 3, pp. 269–277, September 2001.
18.Eric Jui-Lin Lu and Rong-Ji Hwang, “A distributed EDI model,” Journal of Systems and Software, vol. 56, no. 1, pp. 1–7, 2001.
19.Eric Jui-Lin Lu and Yu-Ming Jung, “XDSearch: An efficient search engine for XML document schemata,” Expert Systems with Applications, vol. 24, no. 2, pp. 213–224, Feburary 2003.
20.S. Macroibeaird, A.T. Manes, S. Hinkelman, and B. McKee, “Using UDDI to Find ebXML Reg/Reps,” ebXML, 2001.
21.Chuang-Hue Moh, Ee-Peng Lim, and Wee-Keong Ng, “DTD-Miner: A tool for mining DTD from XML documents,” in Proceedings of Second International Workshop on Advance Issues of E-Commerce and Web-Based Information Systems, pp. 144–151, 2000.
22.J. Rissanen, “Modeling by shortest data description,” Automatica, vol. 14, pp. 465–471, 1978.
23.Jayavel Shanmugasundaram, Kristin Tufte, Chun Zhang, Gang He, David J. DeWitt, and Jeffrey F. Naughton, “Relational databases for querying XML documents: Limitations and opportunities,” in Proceedings of the 25th VLDB Conference, pp. 302–314, September 1999.
24.Bo Shu and Subhash C. Kak, “A Neural Network-based Intelligent Metasearch Engine engine,” Information Sciences, vol. 120, pp. 1–11, 1999.
25.Hong Su, Harumi Kuno, and Elke Rundensteiner, “Automating the transformation of XML documents,” Workshop on Web Information and Data Management (WIDM''01), Atlanta, GA, USA, Nov. 9, 2001.
26.B. O Szuprowicz, "Search engine technologies for the world wide web and Interne," Computer Technology Research Corp, USA, 1997.
27.Kuo-Chung Tai, “The tree-to-tree correction problem,” Journal of the Association for Computing Machinery, vol. 26, pp. 422-433, 1979.
28.Kun-Won Tang and Subhash C. Kak, “A new corner classification approachto neural network training,” Circuits Systems Signal, vol. 17, no. 4, pp. 459–469, 1998.
29.Kuo-Chung Tai, “The Tree-to-Tree Correction Problem,” Journal of the Association for Computing Machinery, vol. 26, no. 3, pp. 422–433, July 1979.
30.Mark Wutka, “A Java DTD Parser,” http://www.wutka.com/dtdparserdownload.html, 2004
31.W3C Recommendation, “Extensible Markup Language (XML) 1.0 (Third Edition),” http://www.w3.org/TR/REC-xml, 2004
32.W3C Recommendation, “XML Schema Part 0: Primer,” http://www.w3.org/TR/xmlschema-0/, 2001
33.XML.Org, “XML.Org Registry Schemas/DTDs,” http://www.xml.org/xml/register.jsp, 2004.
34.Kaizhong Zhang, Rick Statman, and Dennis Shasha, “On the editing distance between unordered labeled trees,” Information processing Letters, vol. 42, pp. 133-139, 1992.
35.Kaizhong Zhang and Dennis Shasha, “Simple fast algorithms for the editing distance between trees and related problems,” SIAM Journal on Computing, vol. 18, no. 6, pp. 1245–1262, December 1989.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top