跳到主要內容

臺灣博碩士論文加值系統

(18.97.14.80) 您好!臺灣時間:2024/12/08 23:50
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:王義評
研究生(外文):Yi-ping Wang
論文名稱:選擇性剪裁資料庫的設計與實作
論文名稱(外文):The Design and Implementation of an Alternative Splicing Database
指導教授:許芳榮許芳榮引用關係
指導教授(外文):F. R. Hsu
學位類別:碩士
校院名稱:逢甲大學
系所名稱:資訊工程所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2006
畢業學年度:94
語文別:英文
論文頁數:64
中文關鍵詞:轉錄體基因表現選擇性減裁選擇性減裁資料庫
外文關鍵詞:transcriptomegene expressionalternative splicing
相關次數:
  • 被引用被引用:0
  • 點閱點閱:155
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
選擇性剪裁是真核細胞生物在基因表現上很重要的一個調控機制。由於選擇性剪裁的影響會導致同一個基因可能因不同的發展階段而產生不同的蛋白質產物,從而加大了產物的差異性。然而生物體的基因中的哪些地方會在什麼情況下發生選擇性剪裁,至今生物學家仍未歸納出完整的相關調控機制。因此,大量地鑑別出所有能鑑別的選擇性剪裁事件,便能提供生物學家更好的研究資料。因為現今大量的表現序列標籤的定序,使得利用電腦技術自動化地位各物種鑑別選擇性剪裁資訊成為可能。選擇性減裁資料庫便是用來整理、儲存並呈現這些鑑別出來的選擇性減裁資訊的一個強而有力的工具。A value added transcriptome database (AVATAR)便是這麼一個選擇性減裁資料庫。然而,AVATAR中鑑別出來的選擇性減裁事件,有部分事件的因為序列在alignment時所得的結果,相比於其他alignment工具來說,較不理想。因此,我們在此同時利用兩種alignment工具重新設計建立一資料庫以解決此一問題,並且整合提供其他在轉錄體階層上的相關研究成果資訊,例如跨物種保留的選擇性減裁事件,鑑別、註明並提供NMD相關的選擇性減裁事件,此外還設計一個便利使用的web介面,為各類選擇性減裁事件提供更完整的視覺化呈現方式。
關鍵詞:選擇性減裁,基因表現,轉錄體,選擇性減裁資料庫
Alternative splicing is an important regulatory mechanism of gene expression in eukaryotic cells. It can increase the diversity of the proteins from the same gene, and variant splice isoforms are often specific to different stages of development. However, we still know less about the complete mechanisms in the transcriptome level. But now, because of the fast increasing of the amount of ESTs, we can systematically identify the alternative splicing events to help the biologists to do more deeply researches. Alternative splicing databases are powerful tools to systematically collect, annotate, store and provide the alternative splicing information. However, each alternative splicing database almost has its own alignment methods and criteria to filter out the alignment results. This reason results into that the alternative splicing events among distinct databases are almost different. For the reason, we build a new alternative splicing database to address the problem by composing three different alignment tools. In the new database, we not only address the defect but also integrate some other research achievements (such as cross-species conservation) into the new database, offer more visualized display for six kinds of alternative splicing events and the annotation of the tissue annotated in the ESTs for each alternative splicing event, and detect the NMD-related alternative splicing events. The new alternative database was built upon the latest version genome and ESTs from the NCBI in May 2006 A.D...
Keywords: alternative splicing, gene expression, transcriptome.
Acknowledgements i
中文摘要 ii
Abstract iii
Chapter 1 Introduction 1
1.1 Motivation 2
1.2 Objective 3
Chapter 2 Related Works 4
2.1 Alternative splicing database 4
2.1.1 The Mechanism of Alternative Splicing 4
2.1.2 Alternative splicing databases 7
2.1.3 The methods to build A.S. databases 16
2.1.4 The characteristics of the development about A.S. databases 17
2.1.5 A Value Added TrAnscRiptome Database (AVATAR) 18
2.2 The mechanism of NMD 19
Chapter 3 Materials and Methods 20
3.1 Materials 20
3.2 Methods 20
3.2.1 AS detector 20
3.2.2 Cross-species information Integration 27
3.2.3 NMD identifier 28
Chapter 4 Experimental results 30
4.1 Hardware and Software 30
4.1.1 Hardware 30
4.1.2 Software 30
4.2 Experimental results 31
4.2.1 Time cost 31
4.2.2 Comparison 33
4.3 The analysis about NMD 33
4.4 The introduction of the website 35
Chapter 5 Conclusion 39
5.1 Discussion 39
5.2 Conclusion 39
5.3 The future works 40
Reference 41
[1] Lander, E.S., et al., 2001. “ Initial sequencing and analysis of the human genome”, Nature 409, 860– 921.
[2] R.E Breitbart, A. Andreadis and B. Nadal-Ginard, “Alternative splicing: a ubiquitous mechanism for generation of multiple protein isoforms from single genes”, Annu. Rev. Biochem. 1987, vol. 56, pp. 467-495.
[3] P.J. Grabowski, and D.L. Black, “Alternative RNA splicing in the nervous system”, Progress Neurobiol. 2001, vol. 65, pp. 289-308.
[4] B R. Graveley, “Alternative splicing: increasing diversity in the proteomic world”, Trends Genet. 2001, vol. 17, pp. 100-107.
[5] Celotto A.M., Graveley B.R., 2001. “Alternative splicing of the Drosophila Dscam pre-mRNA is both temporally and spatially regulated.” Genetics 159, 599– 608.
[6] B. Modrek, A. Resch, C. Grasso and C. Lee, “Genome-wide analysis of alternative splicing using human expressed sequence data”, Nucleic Acids Res. 2001, vol. 29, pp. 2850-2859.
[7] M. S. Gelfand, I. Dubchak, I. Dralyuk and M. Zorn, “ASDB: database of alternatively spliced genes”, Nucleic Acids Res., 1999, Vol. 27, No. 1
[8] Hongkai Ji, Qing Zhou, Fang Wen, Huiyu Xia, Xin Lu and Yanda Li, “AsMamDB: an alternative splice database of mammals”, Nucleic Acids Res., 2001, Vol. 29, No. 1
[9] Y.-H. Huang, Y.–T. Chen, J.–J. Lai, S.-T. Yang and U.-C. Yang, “PALS db: Putative Alternative Splicing database”, Nucleic Acids Res.h, 2002, Vol. 30, No. 1
[10] Christopher Lee, Levan Atanelov, Barmak Modrek and Yi Xing, “ASAP: the Alternative Splicing Annotation Project”, Nucleic Acids Res., 2003, Vol. 31, No.1
[11] H.D. Huang, J.T. Horng, C.C. Lee, and B.J. Liu, “ProSplicer: A Database of Putative Alternative Splicing Information Deriving From Proteins, mRNAs and Expressed Tag Data”, Genome Biology, Mar. 2003, vol. 4, R29.
[12] J. Leipzig, P. Pevzner, and S. Heber, “The Alternative Splicing Gallery (ASG): bridging the gap between genome and transcriptome”, Nucleic Acids Res., Aug. 2004, vol. 32(13), pp. 3977 – 3983
[13] P. Kim, N. Kim, Y. Lee, B. Kim, Y. Shin, and S. Lee, “ECgene: genome annotation for alternative splicing”, Nucleic Acids Res., Jan. 2005, vol. 33, pp. D75 - D79.
[14] Hsien-Da Huang, Jorng-Tzong Horng, Feng-Mao Lin, Yu-Chung Chang, and Chen-Chia Huang, “SpliceInfo: an information repository for the modes of mRNA alternative splicing in human genome”, Nucleic Acids Res., 2005 January 1; 33(Database Issue): D80–D85
[15] Pierre de la Grange, Martin Dutertre, Natalia Martin and Didier Auboeuf, “FAST DB: a website resource for the study of the expression regulation of human gene products”, Nucleic Acids Res., 2005, Vol. 33, No. 13
[16] Dirk Holste, George Huo1, Vivian Tung and Christopher B. Burge, “HOLLYWOOD: a comparative relational database of alternative splicing”, Nucleic Acids Res.h, 2006, Vol. 34, Database issue
[17] F.R. Hsu, H.Y. Chang, Y.L. Lin, Y.T. Tsai, H.L. Peng, Y.T. Chen, C.Y. Cheng, M.Y. Shih, C.H. Liu, and C.F. Chen, “AVATAR: A database for genome-wide alternative splicing event detection using large scale ESTs and mRNAs”, Bioinformation, Apr. 2005, Vol.1 No.1, pp. 16-18.
[18] Bairoch A, Apweiler R: “The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000”. Nucleic Acids Res. 2000, 28:45-48.
[19] D. A. Benson, M. S. Boguski, D. J. Lipman, J. Ostell, B. F. Ouellette, B. A. Rapp, and D. L. Wheeler: “GenBank”, Nucleic Acids Res. 1999 January 1; 27(1): 12–17.
[20] J.D.Thompson, D.G Higgins and T.J.Gibson, CLUSTAL W: improving the sensitivity of progressive multiple: “sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice”, Nucleic Acids Res., Nov. 1994, vol. 22, pp. 4673-4680.
[21]Schuler G.D., Boguski M.S., Stewart E.A., Stein L.D., Gyapay G., Rice K.,White RE, Rodriguez-Tome P, Aggarwal A, Bajorek E, et al.: “A gene map of the human genome”. Science 1996, 274:540-546.
[22] Altschul S.F., Gish W, Miller W, Myers EW, Lipman DJ: “Basic local alignment search tool”. J Mol Biol 1990, 215:403-410.
[23] Povey S, Lovering R, Bruford E, Wright M, Lush M, Wain HM: “The HUGO Gene Nomenclature Committee (HGNC). Nomenclature Recommendations”. Hum Genet 2001, 109:678-680.
[24] Hubbard T, Barker D, Birney E, Cameron G, Chen Y, Clark L, Cox T, Cuff J, Curwen V, Down T, et al.: “The Ensembl genome database project”. Nucleic Acids Res. 2002, 30:38-41.
[25] A. Bairoch and R. Apweiler : “The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1999”. Nucleic Acids Res. 1999 January 1; 27(1): 49–54.
[26] RefSeq http://www.ncbi.nih.gov/RefSeq/
[27] Christoffels A., vanGelder A., Greyling G., Miler R., Hide T. And Hide W. (2001) “STACK: Sequence Tag Alignment and Consensus Knowledgebase”. Nucleic Acids Res., 29, 234–238.
[28] Quackenbush J., Cho J., Lee D., Liang F., Holt I., Karamycheva S., Parvizi,B., Pertea,G., Sultana,R. and White,J. (2001) “The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species”. Nucleic Acids Res., 29, 159–164.
[29] N. Kim, S. Shin, and S. Lee, “ASmodeler: gene modeling of alternative splicing from genomic alignment of mRNA, EST and protein sequences”, Nucleic Acids Res., Jul. 2004,vol. 32(suppl_2), pp. W181 - W186.
[30] Lash, A.E., Tolstoshev, C.M., Wagner,L., Schuler,G.D., Strausberg,R.L., Riggins,G.J. and Altschul,S.F. (2000) SAGEmap: a public gene expression resource. Genome Res., 10, 1051–1060.
[31] Mulder N.J., Apweiler R., Attwood T.K., Bairoch A., Barrell D., Bateman A., Binns D., Biswas M., Bradley P., Bork P. et al. (2003) “The InterPro Database, 2003 brings increased coverage and new features”. Nucleic Acids Res., 31, 315–318.
[32] Harris, M.A., Clark, J., Ireland, A., Lomax, J., Ashburner, M., Foulger, R., Eilbeck, K., Lewis, S., Marshall, B., Mungall, C. et al. (2004) “The Gene Ontology (GO) database and informatics resource”, Nucleic Acids Res., 32 Database issue, D258-261.
[33] Bailey T.L. and Elkan C. (1994).
[34] Zuker, M. (2003) “Mfold web server for nucleic acid folding and hybridization prediction”, Nucleic Acids Res., 31, 3406-3415.
[35] UCSC http://genome.ucsc.edu
[36] NCBI http://www.ncbi.nih.gov
[37] Stefan Stamm, Jean-Jack Riethoven, Vincent Le Texier, Chellappa Gopalakrishnan, Vasudev Kumanduri, Yesheng Tang, Nuno L. Barbosa-Morais and Thangavel Alphonse Thanaraj, “ASD: a bioinformatics resource on alternative splicing”, Nucleic Acids Res., 2006, Vol. 34, Database issue
[38] Kasprzyk A., Keefe D., Smedley D., London D., Spooner W., Melsopp C., Hammond M., Rocca-Serra P., Cox T. and Birney E. (2004) “EnsMart: a generic system for fast and flexible access to biological data.”, Genome Res., 14, 160–169.
[39] GENOA http://genes.mit.edu/genoa
[40] Yeo G.W., Van Nostrand E., Holste D., Poggio T. and Burge C.B. (2005) Identification and analysis of alternative splicing events conserved in 15 human and mouse. Proc. Natl Acad. Sci. USA, 102, 2850–2855.
[41] A.F.A Smit, and P.Green,
http://ftp.genome.washington.edu/RM/RepeatMasker.ht ml.
[42] J.A. Bedell, I. Korf, and W. Gish, “MaskerAid: a performance enhancement to RepeatMasker”. Bioinformatics, 2000, vol. 16, pp.1040-041.
[43] X. Huang, and A. Madan, “CAP3: a DNA sequence assembly program”. Genome Res., 1999, vol. 9, pp.868–877.
[44] F. R. Hsu and J. F. Chen., “Aligning ESTs to Genome Using Multi-Layer Unique Markers”, Proc. Of the IEEE Computational Systems Bioinformatics Conference (CSB2003), Aug., 11-14, San Francisco, USA, pp. 564-566.
[45] Lewis, B.P., Green, R.E., Brenner, S.E., 2003, Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans”, Proc. Natl. Acad. Sci. U. S. A. 100, 189– 192.
[46] ESTs ftp:// ftp.ncbi.nlm.nih.gov/repository/dbEST
[47] Genome ftp:// ftp.ncbi.nlm.nih.gov/genomes
[48] Thomas D. Wu, and Colin K. Watanabe, “GMAP: a genomic mapping and alignment program for mRNA and EST sequences”, Bioinformatics Vol. 21 no. 9 2005, pages 1859–1875
[49] Florea L, Hartzell G, Zhang Z, Rubin GM, Miller W: “A computer program for aligning a cDNA sequence with a genomic DNA sequence”. Genome Res. 1998, 8:967-974.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top