跳到主要內容

臺灣博碩士論文加值系統

(54.172.135.8) 您好!臺灣時間:2022/01/18 16:17
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:王正峰
研究生(外文):Jeng-Feng Wang
論文名稱:一個可延伸且易於維護的XML文件資料庫系統
論文名稱(外文):An Extensible and Easy-to-maintain Database System for XML Documents
指導教授:楊武楊武引用關係
指導教授(外文):Wuu Yang
學位類別:碩士
校院名稱:國立交通大學
系所名稱:資訊科學系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2002
畢業學年度:90
語文別:英文
論文頁數:75
中文關鍵詞:XML資料庫XML文件查詢
外文關鍵詞:XML DatabaseXML DocumentXML Query
相關次數:
  • 被引用被引用:2
  • 點閱點閱:300
  • 評分評分:
  • 下載下載:74
  • 收藏至我的研究室書目清單書目收藏:2
  隨著XML逐漸成為網際網路上資料交換的標準,XML文件的管理也成為一項重要的課題。欲管理大量的資料,使用資料庫來處理應該是無庸置疑的。目前使用關連式資料庫來處理XML文件的研究中,都存在著資料庫裡所儲存的XML文件不易更新的問題,然而實際上,一份XML文件的內容往往是經常在變動的。
  在這篇論文中,我們提出一個新的方法,能夠很容易地更新資料庫中的XML文件。這個方法首先定義了一個與XML文件結構無關的關連式綱目(relational schema),利用這個綱目跟一個維護演算法(maintenance algorithm)來保持XML文件與資料庫內容的一致性。此外,由於XML的各項相關技術大多尚未標準化,未來隨實有可能會出現新的XML技術,因此,為了保持我們XML資料庫系統的彈性,我們也定義了一套延伸系統功能的架構,可以在不修改原始系統的原則下,利用各種plug-in讓我們的XML資料庫系統增加新的功能。根據我們實驗的結果,我們的方法在查詢XML文件有不錯的表現,並且在維護經常有小幅度變更的XML文件有特別好的效果。

XML is becoming a standard for data exchange over the Internet. XML’s proliferation raises the question of how XML documents are managed. It is natural to use a DBMS due to the vast quantity of XML documents. There have been several approaches to handling XML documents with a relational database. However, they share the common question that it is difficult to update the XML documents. Unfortunately, an XML document is changing all the time.
In this thesis, we propose a new approach to easily updating XML documents in a relational database. Firstly, the approach defines a relational schema which is independent of the structure of the XML documents. The consistency between the XML documents and the database contents is preserved with this schema and a maintenance algorithm. Furthermore, most of the XML related technologies have not been standardized yet. It is quite possible that new technologies will emerge in the future. Therefore, our database system proposes an extensible architecture for keeping the flexibility of the system. It uses plug-ins to extend its functionalities without modifying the existing system. According to our experiment, our approach has a good performance in the query retrieval. In particular, it has an obvious improvement in maintaining frequent, small updates of XML documents.

Abstract (in Chinese) i
Abstract (in English) ii
Acknowledgements iii
Contents iv
List of Figures vi
List of Tables viii
1 Introduction 1
1.1 Motivation……………………………………………………………1
1.2 Goals…………………………………………………………………2
1.3 Organization…………………………………………………………3
2 Related Work 4
2.1 Classifying XML Documents………………………………………4
2.1.1 Data-Centric XML Document Management………………………6
2.1.2 Document-Centric XML Document Management…………………8
2.2 Existing Approaches………………………………………………9
2.2.1 Designed Database Schemas Based on DTDs…………………9
2.2.2 Storing XML Documents Without DTD…………………………12
3 System Design and Implementation 15
3.1 System Architecture………………………………………………15
3.1.1 XML Database System Architecture…………………………16
3.1.1.1 Parser…………………………………………………………17
3.1.1.2 Reconstructor…………………………………………………19
3.1.1.3 Query Mapper…………………………………………………22
3.1.1.4 Maintainer……………………………………………………25
3.1.1.5 Database Connection Interface……………………………27
3.1.2 General Extenders………………………………………………29
3.2 System Implementation……………………………………………35
3.2.1 The Default Database Schema…………………………………35
3.2.2 The Default Parser Algorithm………………………………41
3.2.3 The Default Reconstructor Algorithm………………………41
3.2.4 The Default Query Mapper Algorithm………………………46
3.2.5 The Default Maintainer Algorithm…………………………50
4 Performance Evaluation and Analysis 57
4.1 Experimental Setup………………………………………………57
4.2 Experimental Results……………………………………………58
4.2.1 Database Size……………………………………………………59
4.2.2 Query Retrieval…………………………………………………59
4.2.3 Maintenance………………………………………………………61
5 Conclusion and Future Work 67
5.1 Conclusion…………………………………………………………67
5.2 Future Work…………………………………………………………67
References 69
Appendix 72

[1] S. Abiteboul, S. Cluet, V. Christophides, T. Milo, G. Moerkotte and J. Simeon, “Querying documents in object databases”, in International Journal on Digital Libraries, Vol. 1, No. 1, Pages 5-19, April 1997.
[2] A. V. Aho , D. S. Hirschberg and J. D. Ullman, “Bounds on the Complexity of the Longest Common Subsequence Problem”, in Journal of the ACM, Vol. 23, No. 1, Pages 1-12, January 1976.
[3] E. Bertino and B. Catania, “Integrating XML and Databases”, in IEEE Internet Computing, Vol. 5, No. 4, Pages 84-88, July-August 2001.
[4] R. Bourret, “XML Database Products”, Tech. Report, Last Update: May 14, 2002, http://www.rpbourret.com/xml/XMLDatabaseProds.htm.
[5] R. Bourret, “XML and Databases”, Tech. Report, Last Update: February 2002, http://www.rpbourret.com/xml/XMLAndDatabases.htm.
[6] T. S. Chung, S. Park, S. Y. Han and H. J. Kim, “Extracting Object-Oriented Database Schemas from XML DTDs Using Inheritance”, in Procedings of Second International Conference on Electronic Commerce and Web Technologies (EC-Web), Pages 49-59, September 2001.
[7] D. Florescu and D. Kossmann, “Storing and Querying XML Data using an RDBMS”, in Bulletin of the IEEE Computer Society Technical Committee on Data Engineering, Vol. 22, No. 3, Pages 27-34, September 1999.
[8] D. S. Hirschberg, “A Linear Space Algorithm for Computing Maximal Common Subsequences”, in Communications of the ACM, Vol. 18, No. 6, Pages 341-343, June 1975.
[9] S. C. Lin, “A Database System for XML Documents”, M.S. Thesis, Department of Computer Science and Information Engineering, National Taiwan University, Taiwan, June 2001.
[10] J. A. Miller and S. Sheth, “Querying XML documents”, IEEE Potentials, Vol. 19, No. 1, Pages 24-26, February-March 2000.
[11] J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. Dewitt and J. Naughton, “Relational Databases for Querying XML documents: Limitations and Opportunities”, in Proceedings of 25th International Conference on Very Large Data Bases (VLDB), Edinburgh, Scotland, UK, Pages 302-314, September 1999.
[12] J. Shanmugasundaram, E. Shekita, J. Kiernan, R. Krishnamurthy, E. Viglas, J. Naughton and I. Tatarinov, “A General Technique for Querying XML Documents using a Relational Database System”, SIGMOD Record, Vol. 30, No. 3, Pages 20-26, September 2001.
[13] D. Shasha and K. Zhang, “Fast Algorithms for the Unit Cost Editing Distance Between Trees”, in Journal of Algorithms, Vol. 11, No. 4, Pages 581-621, December 1990.
[14] T. Shimura, M. Yoshikawa and S. Uemura, “Storage and Retrieval of XML Documents using Object-Relational Databases”, in Proceedings of the 10th International Conference on Database and Expert Systems Applications (DEXA), Florence, Italy, Pages 206-217, August-September 1999.
[15] W. Yang, “A Fast Tree-Matching Algorithm for Similar Ordered Trees”, Tech. Report, Computer and Information Science Department, National Chiao-Tung Unicersity, HsinChu, Taiwan, 1993.
[16] M. Yoshikawa and T. Amagasa, T. Shimura and S. Uemura, “XRel: A Path-Based Approach to Storage and Retrieval of XML Documents Using Relational Databases”, in ACM Transactions on Internet Technology, Vol. 1, No. 1, Pages 110-141, August 2001.
[17] “Eclipse Project”, Release 1.0: November 7, 2001, http://www.eclipse.org/
[18] “NetBeans”, Release 3.3.1: February 1, 2002, http://www.netbeans.org/
[19] “Simple Object Access Protocol (SOAP) 1.1”, W3C Note, Latest Version: May 8, 2000, http://www.w3.org/TR/SOAP/
[20] “XQuery 1.0”, W3C Working Draft, Latest Version: April 30, 2002, http://www.w3.org/TR/xquery/
[21] “XML-QL”, W3C Submission, Latest Version: August 19, 1998, http://www.w3.org/TR/NOTE-xml-ql/
[22] “Extensible Makeup Language (XML) 1.0”, W3C Recommendation, Latest Version: October 6, 2000, http://www.w3.org/TR/REC-xml
[23] “XML Path Language (XPath) 1.0”, W3C Recommendation, Latest Version: November 16, 1999, http://www.w3.org/TR/xpath

QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top