跳到主要內容

臺灣博碩士論文加值系統

(54.173.214.227) 您好!臺灣時間:2022/01/29 16:28
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:陳儷云
論文名稱:最大連續串長度的正確及近似分配
指導教授:連怡斌連怡斌引用關係
學位類別:碩士
校院名稱:國立彰化師範大學
系所名稱:數學系
學門:數學及統計學門
學類:數學學類
論文種類:學術論文
論文出版年:2002
畢業學年度:90
語文別:中文
中文關鍵詞:吻合片段近似分配完整連續串二元序列正確分配聚類分析矩陣最大
相關次數:
  • 被引用被引用:0
  • 點閱點閱:105
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
DNA序列間,吻合片段的最大連續串長度L (longest matching run),經常被用來衡量這些序列,或這些序列所代表之物種間的相似度(similarity),相似度越大,其來自相同祖先的可能性就越大。所以,求L之分配,自1950年代以來,有很多近似分配的成果,直到1994年, Fu和Koutras(1994)提出了完整連續串的exact分配,周婉琪(2001)更討論了非完整連續串的exact分配,但卻有計算上的困難,因此,我們利用Karlin的公式來求得L之近似分配,以解決計算上的困難,且將之與exact分配作比較,並利用它建構一實際資料之演化樹。

Abstract I
摘要II
目次III
第一章 緒論1
第二章 獨立序列最大完整連續串長之分配
2.1 Karlin的近似分配6
2.2 Exact Distribution7
2.3 模擬(simulation)9
2.4 Karlin的近似分配與exact分配及經驗分配之比較9
第三章 獨立序列最大非完整連續串長之分配
3.1 Karlin的近似分配11
3.2 Exact Distribution12
3.3 Karlin的近似分配與exact分配及經驗分配之比較13
第四章 Markov-dependent序列最大well-matching run的分配
4.1 Karlin的近似分配16
4.2 Karlin的近似分配與exact分配及經驗分配之比較19
4.3 近似機率與exact機率之統整21
第五章 實例應用與結果分析
5.1 相似度(similarity)矩陣23
5.2 結果比較25
第六章 結論29
參考文獻30
附錄
附錄一33
附錄二41
附錄三49
附錄四55

G. Bateman.(1948).
On the power function of the longest run as a test for randomness in a sequence of
alternatives.
it Biometrika. 35,97-112.
A. Dembo, S. Karlin and O. Zeitouni. (1992).
Critical phenomena for sequence matching with scoring.
it Statistics and Probability Latters. 4,1993-2021.
W.J. Ewens, Gregory R. Grant.
Statistical Methods in Bioinformatics.
385-422.
W.J. Ewens. (1999)
Statistical methods in human genetics.
it Statistics in genetics, 1999,Springer Verlag,New York,Inc.147-164.
David E. Fousler, Samuel Karlin. (1987).
Maximal success durations for a semi-markov process.
it Stochastic Processes and their Applications 24,203-224.
J.C. Fu and M.V. Kourtas. (1994).
Distribution theory of runs:A Markov Chain approach.
it Journal of American Statistical Association vol.39,no.427. 1050-1058.
Samuel Karlin and S.F. Altschul.(1990).
Methods for assessing the statistical significance of molecular sequence feature
by using general scaring schemes.
it Proc. Natl. Acad. Sci. U.S.A. Vol.87,2264-2268.
Samuel Karlin, Ghassan Ghandour. (1985).
Comparative statistics for DNA and portein sequences: Single sequence analysis.
it Proc. Natl. Acad. Sci. U.S.Avol.82,5800-5804.
Samuel Karlin, Friedemann Ost (1987).
Counts of long aligned word matches among random letter sequences.
it Adv. Appl. Prob. 19,293-351.
Samuel Karlin, Friedemann Ost (1988).
Maximal length of common words among random letter sequences.
it Annals of Probability, vol.16. Issue 2 (Apr.,1988),535-563.
Samuel Karlin, Friedemann Ost and B. Edwin Blaisdell. (1989).
it Patterns in DNA and Amino Acid Sequence and Their Statistical significance.
Boca Raton, FL: CRC Press. 133-156.
R.F. Mott, T.B.L. Kirkwood and R.N. Curnow. (1989).
A test for the statistical significance of DNA sequence similarities for application
in databank searches.
it CABIOS vol.5,no.2. 123-131.
R.F. Mott, T.B.L. Kirkwood and R.N. Curnow. (1990).
An accurate approximation to the distribution of the length of the longest matching
word between two random DNA sequences.
it Bulletin of Mathematical Biology vol.52,~no.6. 773-784.
W.Y Wendy Lou. (1996).
On runs and longest run tests:A method finite Morkov Chain imbedding.
it Journal of American Statistical Association vol.91,no.436. 1595-1601.
Marco Muselli (2000).
Useful inequalities for the longest run distribution.
it Statistics and Probability Letters.46,239-249.
Andreas N. Philippou, Frosso S. Makri (1986).
Successes,runs and longest runs.
it Statistics and Probability Letters. 4,101-105.
Eugene F. Schuster. (1996).
The conditional distribution of the longest run in a sample from a multiletter
alphabet.
it Commun. Statist. Simula. 25(1),215-224.
Temple F. Smith, Michael S. Waterman and Christian Burk. (1985).
The statistical distribution of nucleic acid similarities.
it Nucleic Acids Research vol.13,no.2. 645-656.
Serguei Yu. Novak (1992).
Longest runs in a sequence of m-dependent random variables.
it Probab. Theory Relat. Fields 91,269-281.
Serguei Yu. Novak (1998).
On the joint limiting distribution of the first and the second maxima.
it Commun. Statist.-Stochastic Models, 14(1 and 2),311-318.
Wan Qi Zhou (2001).
Statistical significance of long matching word between two DNA sequences.
http://www.kepu.com.cn/big5/lives/paleontology/mammal/mml30101.html

QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top