【正文】
hobic core may be provided by cystines and metal ions, respectively. Homologous domains with mon functions usually show sequence similarities.結構域(structure domain)是在蛋白質三級結構中介于二級和三級結構之間的可以明顯區(qū)分但又相對獨立的折疊單元,每個結構域自身形成緊實的三維結構,可以獨立存在或折疊,但結構域與結構域之間關系較為松散。結構功能域通常由25~300個氨基酸殘基組成,不同蛋白質分子中結構域的數目不同,同一個蛋白質分子中的幾個結構域彼此相似或者不盡相同。結構域是蛋白質的功能、結構和進化單位,結構功能域分析對于蛋白質結構的分類和預測有著重要的作用。Bits scoresAlignment scores are reported by HMMer and BLAST as bits scores. The likelihood that the query sequence is a bona fide homologue of the database sequence is pared to the likelihood that the sequence was instead generated by a “random” model. Taking the logarithm (to base 2) of this likelihood ratio gives the bits score.PvalueThis represents a probability that, given a database of a particular size, random sequences score higher than a value X. Pvalues are generated by the BLAST algorithm that has been integrated into SMART.EvalueThis represents the number of sequences with a score greaterthan, or equal to, X, expected absolutely by chance. The Evalue connects the score (“X”) of an alignment between a usersupplied sequence and a database sequence, generated by any algorithm, with how many alignments with similar or greater scores that would be expected from a search of a random sequence database of equivalent size. Since version Evalues are calculated using Hidden Markov Models, leading to more accurate estimates than before.Motif模體Sequence motifs are short conserved regions of polypeptides. Sets of sequence motifs need not necessarily represent homologues.motif又稱模體,是序列中局部的保守區(qū)域,或者是一組序列中共有的一小段序列模式。一般指構成任何一種特征序列的基本結構,但是多數情況下是指可能具有分子功能、結構性質或家族成員相關的任何序列模式。motif作為結構域中的亞單位,表現結構域的各種生物學功能。常見的蛋白質結構motif,種類超過28類。常見的motif搜索方法主要基于兩種,一種是序列模式(Pattern),另外一種是序列特征譜(Profile)。Pattern序列模式方法直接搜索關鍵的幾個保守殘基,忽略其他位置的氨基酸多態(tài)性。例如,“Lx(6)Lx(6)Lx(6)L”(x表示任意氨基酸)為亮氨酸拉鏈結構的序列模式,這樣一段序列多處于蛋白質的活性區(qū)域或重要結構區(qū),較為保守,是motif搜索的目標之一。由于序列模式方法搜索的不是完整的結構域或整個蛋白的特征,故其適用于識別保守的功能區(qū)域,對于序列變異大的功能區(qū)域,則無法準確識別。此外,隨機的氨基酸序列也可能出現短小的序列模式,故易產生假陽性,對于此類搜索需要搜索多個不同的數據庫,得到盡可能多得同源序列,從而才能更好的說明序列中包含的信息。ProfileA profile is a table of positionspecific scores and gap penalties, representing an homologous family, that may be used to search sequence databases (Ref.:[1],[2],[3]).In CLUSTALWderived profiles those sequences that are more distantly related are assigned higher weights ([4],[5],[6]). Issues in profilebased database searching are discussed in Bork amp。 Gibson (1996)[7].序列特征譜搜索是基于蛋白質序列多重比對結果中的保守序列區(qū)域進行搜索,由于考慮了不同保守度的氨基酸在相應位置的權重,可以更為敏感的檢測到進化距離較遠的蛋白質相關性,得到比序列模式方法更為靈敏的結果,但可靠的序列特征譜數目往往有限,因此該方法在進行新基因功能預測時受到了較大的障礙。Alignment 多重比對,序列比較Representation of a prediction of the amino acids in tertiary structures of homologues that overlay in three dimensions.3個月前下一篇上一篇投稿/私信/存檔/RSS關注窗體頂端窗體底端Powered bydiandian.返回頂部寧可累死在路上,也不能閑死在家里!寧可去碰壁,也不能面壁。是狼就要練好牙,是羊就要練好腿。什么是奮斗?奮斗就是每天很難,可一年一年卻越來越容易。不奮斗就是每天都很容易,可一年一年越來越難。能干的人,不在情緒上計較,只在做事上認真;無能的人!不在做事上認真,只在情緒上計較。拼一個春夏秋冬!贏一個無悔人生!早安!—————獻給所有努力的人.學習參考