【文章內(nèi)容簡(jiǎn)介】
( 12個(gè)克隆) 列池( 8個(gè)克?。? 大大減少篩選的工作量,降低成本,所得篩選結(jié)果準(zhǔn)確可靠 28 VS 768 sheet of superpools, plate pools, row pools, column pools 一 BAC Screening 前 48個(gè)樣品為引物 superpool(sp)的篩選結(jié)果 后 48個(gè)樣品為引物 superpool(sp)的篩選結(jié)果 引物 sp27,34,45的 plate,row,column pools的篩選結(jié)果 BAC clone 確定 (+為陽(yáng)性克隆 ) 引物 ColonyPCR 延 伸 克 隆 的 篩 選 STS的密度尚未達(dá)到繪制高精度物理圖譜的要求,且在基因組中的分布不均勻,造成很多區(qū)域沒(méi)有陽(yáng)性克隆覆蓋 ,形成空洞。因此需用指紋圖譜( FPC法)或末端序列( Walking by End Sequence)步移等手段對(duì)種子克隆進(jìn)行延伸,形成連續(xù)克隆群。利用延伸方法篩選得到的克隆稱為延伸克隆。 Contig 1 Contig 2 重疊序列 重疊序列 延伸引物 篩選到的延伸克隆 20 kb ~300 bp Molecular weight marker every 5th lane BAC clones 在 96深孔 板中培養(yǎng) Hind III 完全酶切 1% 瓊脂糖凝膠電泳 指 紋 圖 譜 法 ( Walking by Fingerprinting database) 挑取靠近空洞的種子克隆,酶切構(gòu)建其指紋圖譜,在 FPC數(shù)據(jù)庫(kù)中進(jìn)行比對(duì),搜索含有此克隆的重疊克隆群信息,從中確定覆蓋空洞區(qū)域的克隆,達(dá)到延伸目的。 Hind III 完全酶切 Hind III 完全酶切 FPC數(shù)據(jù)庫(kù)中比對(duì) Clone A Clone B Clone C C A B contig搭建中克隆的錯(cuò)位 末端序列步行法 ( Walking by End Sequence) 挑取靠近空洞的種子克隆進(jìn)行末端測(cè)序,然后在基因組數(shù)據(jù)庫(kù)中進(jìn)行比對(duì),確定專一性的序列片段作為新的 STS路標(biāo)。最后設(shè)計(jì)新路標(biāo)的 PCR引物,按照STS—PCR―反應(yīng)池 ” 方案篩選新的克隆,達(dá)到延伸的目的 。 克隆 350A18序列輸入 end sequence database的查詢結(jié)果 四、 Clone Identification STSPCR BAC end sequencing Fingerprinting FISH CK2 CK1 CK2 CK1 13f06 267l16 481o07 250a15 204c23 340j13 對(duì) 15個(gè)克隆進(jìn)行 HindIII酶切后電泳結(jié)果 “工作框架圖”繪制 根據(jù)序列與 STS database進(jìn)行 blastn比較結(jié)果,將克隆定位末端序的比較, 判定延伸在 contig外的一端序列。并可及時(shí)進(jìn)行 walking,篩選新的克隆 霰彈法測(cè)序組裝與 Finishing 工作流程圖 Shotgun Sequencing I :RANDOM PHASE Bac Clone: 100200 kb Sheared DNA: kb Sequencing Templates: Random Reads Shotgun Sequencing II:ASSEMBLY Consensus Sequence Gap Low Base Quality Single Stranded Region MisAssembly (Inverted) Consensus Sequence Gap Low Base Quality Single Stranded Region MisAssembly (Inverted) Shotgun Sequencing III: FINISHING Consensus Sequence Gap Single Stranded Region MisAssembly (Inverted) Shotgun Sequencing III: FINISHING Consensus Sequence Gap MisAssembly (Inverted) Shotgun Sequencing III: FINISHING Consensus MisAssembly (Inverted) Shotgun Sequencing III: FINISHING Shotgun Sequencing III: FINISHING High Accuracy Sequence: 1 error/ 10,000 bases Consed軟件顯示序列組裝結(jié)果界面 Filling ―intraclone gaps‖ BAC453F3’s finishing Sp6 T7 ? First 4 primers 1 2 3 4 All the contigs walked hundreds’ bps toward the gaps. 453F3’s 2600 reads 12 contigs Overlapping BAC454F24’s 200 reads + Sp6 T7 1 3 2 4 a b c Second 3 primers 1200bp’s ATrich , (CATATATA)n repeat. Finally, filled by using ET sequencing Kit . 1240bp’s GCrich, GCcontent is % 。 the BAC’s is %. We used dGTP Kit filling it. Sp6 T7 Completed sequence Sequenced clone BAC selected by endsequence 113L10 324K11 173F11 101A4 167P17 586C2 116K5 572B2 2544N5 R155E14 2022P23 2306M15 R149E15 60K ? Gap filling by end sequences Filling “interclone gaps” The actual and predicted fingerprint of R260J13 digested with HindIII Lane 1: marker, Lane 2: R260J13 digested with HindIII, 3 : the predicted 克隆 211B19組裝后的序列的錯(cuò)誤率為零 Whole Genome Shotgun This bacterium has a circular genome structure with 2,689,445 base pairs, the second largest one of thermophiles decoded pletely to date. Circular representation of the genome of T. tengcongensis 天下為公 國(guó)際一流測(cè)序生產(chǎn)線 7萬(wàn)克隆, 3000萬(wàn)堿基 /天 高產(chǎn)出、低成本: $/bp?¥ /bp?美分 /bp?分 /bp 基因組學(xué): 數(shù)據(jù)導(dǎo)向的大科學(xué) 有數(shù)據(jù) 才是硬道理 世上無(wú)難事 只要肯登攀 De Novo Sequencing the Genome in BIG Hu Songnian Beijing Institute of Genomics, Chinese Academy of Sciences Next Generation Sequencing (NGS) Technology Second generation sequencers 454 1 Solexa 3 SOLiD 5 De novo sequencing RNAseq, Resequencing ChIPseq, Methseq Metagenomics De novo sequencing RNAseq Resequencing ChIPseq RNAseq ―known‖ Genome Novel genome(s) Both types 1x454 2x5500xl 3xSOLEXA 2xHiseq 2022 3x3730xl 1xsequenom 1000 CPU cores 800 TB Storage 數(shù)據(jù)中心 完善的試驗(yàn)與測(cè)序體系和流程 強(qiáng)有力的計(jì)算、存儲(chǔ)及數(shù)據(jù)庫(kù)支持體系 成熟的生物信息數(shù)據(jù)處理和分析流程 2022/5/24 Second generation sequencers in BIG 測(cè) 序 儀 Platform Num Raw/run length Solid4 5 80~100Gb 50bp GA II 3 40~60Gb 120bp 454 1 400Mb 400bp Solid 5500xl 0 150~200Gb 50bp Hiseq 2022 1 200~300Gb 100bp 高通量測(cè)序儀 10臺(tái), 3730XL測(cè)序儀 2臺(tái), Sequenom儀器 1臺(tái),高性能計(jì)算機(jī)刀片服務(wù)器 100余臺(tái),大內(nèi)存服務(wù)器 4臺(tái),存儲(chǔ)設(shè)備約 800TB。 測(cè)序平臺(tái) SOLiD Solexa GA 454 DNA Fragment 25ug 25ug 25ug Pairend 25ug 25ug Matepair 5100ug 5100ug 5100ug RNA 轉(zhuǎn)錄組 10