【正文】
,并且與56%的人類基因組編碼基因相關(guān)。由于DNA甲基化與人類發(fā)育和腫瘤疾病的密切關(guān)系,特別是CpG島甲基化所致抑癌基因轉(zhuǎn)錄失活問題,DNA甲基化已經(jīng)成為表觀遺傳學(xué)和表觀基因組學(xué)的重要研究內(nèi)容。基因組注釋的研究內(nèi)容包括基因識(shí)別和基因功能注釋兩個(gè)方面。以上轉(zhuǎn)自,稍加修改。EXPECTE期望值(Evalue)這個(gè)數(shù)值表示你僅僅因?yàn)殡S機(jī)性造成獲得這一alignment結(jié)果的可能次數(shù)。從搜索的角度看,E值越小,alignment結(jié)果越顯著。這一設(shè)置則表示聯(lián)配結(jié)果中將有10個(gè)匹配序列是由隨機(jī)產(chǎn)生,如果聯(lián)配的統(tǒng)計(jì)顯著性值(E值)小于該值(10),則該alignment將被檢出,換句話說,比較低的閥值將使搜索的匹配要求更嚴(yán)格,結(jié)果報(bào)告中隨機(jī)產(chǎn)生的匹配序列減少。其中,串聯(lián)重復(fù)序列是由相關(guān)的重復(fù)單位首尾相連、成串排列而成的。微衛(wèi)星DNA又叫簡單重復(fù)序列(Simple Sequence Repeat,SSR),指的是基因組中由16個(gè)核苷酸組成的基本單位重復(fù)多次構(gòu)成的一段DNA,廣泛分布于基因組的不同位置,長度一般在200 bp以下。根據(jù)SSR核心序列排列方式的不同,可分為3種類型:完全型(perfect)。如: ATATATATATATATATATATATATATATATATAT不完全型(imperfect)。如:ATATATATGGATATATATATCGATATATATATATATATGGATATATATAT復(fù)合型(pound)。如:ATATATATATATATGGGATATATATATATA3種類型中完全型是SSR標(biāo)記中應(yīng)用較多的一種類型。Conserved structural entities with distinctive secondary structure content and an hydrophobic core. In small disulphiderich and Zn2+binding or Ca2+ binding domains the hydrophobic core may be provided by cystines and metal ions, respectively. Homologous domains with mon functions usually show sequence similarities.結(jié)構(gòu)域(structure domain)是在蛋白質(zhì)三級(jí)結(jié)構(gòu)中介于二級(jí)和三級(jí)結(jié)構(gòu)之間的可以明顯區(qū)分但又相對(duì)獨(dú)立的折疊單元,每個(gè)結(jié)構(gòu)域自身形成緊實(shí)的三維結(jié)構(gòu),可以獨(dú)立存在或折疊,但結(jié)構(gòu)域與結(jié)構(gòu)域之間關(guān)系較為松散。結(jié)構(gòu)域是蛋白質(zhì)的功能、結(jié)構(gòu)和進(jìn)化單位,結(jié)構(gòu)功能域分析對(duì)于蛋白質(zhì)結(jié)構(gòu)的分類和預(yù)測(cè)有著重要的作用。Alignment scores are reported by HMMer and BLAST as bits scores. The likelihood that the query sequence is a bona fide homologue of the database sequence is pared to the likelihood that the sequence was instead generated by a “random” model. Taking the logarithm (to base 2) of this likelihood ratio gives the bits score.This represents a probability that, given a database of a particular size, random sequences score higher than a value X. Pvalues are generated by the BLAST algorithm that has been integrated into SMART.This represents the number of sequences with a score greaterthan, or equal to, X, expected absolutely by chance. The Evalue connects the score (“X”) of an alignment between a usersupplied sequence and a database sequence, generated by any algorithm, with how many alignments with similar or greater scores that would be expected from a search of a random sequence database of equivalent size. Since version Evalues are calculated using Hidden Markov Models, leading to more accurate estimates than before.Sequence motifs are short conserved regions of polypeptides. Sets of sequence motifs need not necessarily represent homologues.motif又稱模體,是序列中局部的保守區(qū)域,或者是一組序列中共有的一小段序列模式。motif作為結(jié)構(gòu)域中的亞單位,表現(xiàn)結(jié)構(gòu)域的各種生物學(xué)功能。常見的motif搜索方法主要基于兩種,一種是序列模式(Pattern),另外一種是序列特征譜(Profile)。序列模式方法直接搜索關(guān)鍵的幾個(gè)保守殘基,忽略其他位置的氨基酸多態(tài)性。由于序列模式方法搜索的不是完整的結(jié)構(gòu)域或整個(gè)蛋白的特征,故其適用于識(shí)別保守的功能區(qū)域,對(duì)于序列變異大的功能區(qū)域,則無法準(zhǔn)確識(shí)別。A pro[1][2][3][4][5][6]filebased database searching are discussed in Bork amp。[7]序列特征譜搜索是基于蛋白質(zhì)序列多重比對(duì)結(jié)果中的保守序列區(qū)域進(jìn)行搜索,由于考慮了不同保守度的氨基酸在相應(yīng)位置的權(quán)重,可以更為敏感的檢測(cè)到進(jìn)化距離較遠(yuǎn)的蛋白質(zhì)相關(guān)性,得到比序列模式方法更為靈敏的結(jié)果,但可靠的序列特征譜數(shù)目往往有限,因此該方法在進(jìn)行新基因功能預(yù)測(cè)時(shí)受到了較大的障礙。Alignment 多重比對(duì),序列比較Representation of a prediction of the amino acids in tertiary structures of homologues that overlay in three dimensions.3個(gè)月前下一篇上一篇投稿私信存檔RSS關(guān)diandian.返回頂部寧可累死在路上,也不能閑死在家里!寧可去碰壁,也不能面壁。什么是奮斗?奮斗就是每天很難,可一年一年卻越來越容易。能干的人,不在情緒上計(jì)較,只在做事上認(rèn)真;無能的人!不在做事上認(rèn)真,只在情