很多基因家族个性化分析中,经常有基因motif+基因结构的看着还挺高大上的图片,小编接下来带你使用Tbtools做出这样的图片
准备作图文件
(1)根据gffread提取蛋白序列
gffread ref.gff3 -g ref.fa -x ref.cds ref.pep
注:这里得到的序列ID是mRNA的ID,正是Tbtools画图要用到的ID
(2)根据上述pep序列得到我们鉴定的基因家族序列,然后构建进化树,得到nwk文件:
(((((((((VIT_01s0146g00360.t01:0.28847385,VIT_14s0068g01380.t01:0.28230241)1.0000:0.06111202,VIT_07s0104g01360.t01:0.34914978)0.9300:0.02380875,(VIT_12s0059g02500.t01:0.34391825,(VIT_12s0057g01350.t01:0.19570323,VIT_00s0194g00070.t01:0.19115808)1.0000:0.14424314)0.9800:0.03757793)0.5300:0.00823856,(VIT_14s0083g00640.t01:0.30516479,(VIT_11s0052g01800.t01:0.23419010,VIT_04s0008g07340.t01:0.23032602)1.0000:0.05679977)1.0000:0.08472152)0.5700:0.01435481,VIT_01s0011g04250.t01:0.37548261)0.5300:0.01513115,(VIT_17s0000g06570.t01:0.38572071,(VIT_16s0098g00900.t01:0.35034839,(VIT_15s0048g02540.t01:0.33632291,(VIT_13s0067g03390.t01:0.29692585,VIT_06s0004g03660.t01:0.18583277)0.9100:0.05275364)0.4200:0.00591785)0.9800:0.03728953)0.9400:0.04319700)0.1500:0.00249561,(VIT_01s0011g03520.t01:0.39603977,(VIT_11s0103g00760.t01:0.37486077,(VIT_14s0219g00220.t01:0.33266913,VIT_06s0004g07210.t01:0.22052236)0.3200:0.03996263)0.4800:0.02903342)0.8800:0.03436478)0.4500:0.00939851,(VIT_16s0098g00360.t01:0.41366279,(VIT_05s0049g01830.t01:0.40430820,(VIT_19s0027g01130.t01:0.36180423,(VIT_02s0025g00120.t01:0.38516047,VIT_06s0004g03870.t01:0.35902557)0.3300:0.00335603)0.5100:0.01651906)0.4400:0.00772172)0.8500:0.01944839)0.7600:0.01109563,(VIT_01s0146g00480.t01:0.41235474,((VIT_05s0020g04060.t01:0.25969004,VIT_00s0179g00090.t01:0.26246186)1.0000:0.14940630,((VIT_01s0011g05560.t01:0.31757637,VIT_17s0000g02230.t01:0.32189731)1.0000:0.08184726,((VIT_11s0016g00710.t01:0.30925999,VIT_09s0002g00890.t01:0.31330810)1.0000:0.07026754,(VIT_12s0035g00900.t01:0.35070303,(VIT_10s0003g03810.t01:0.16417712,(VIT_04s0008g00110.t01:0.02065004,(VIT_10s0003g03790.t01:0.02855193,VIT_10s0003g03800.t01:0.02656618)0.6000:0.00791335)1.0000:0.15293141)1.0000:0.15416490)0.6200:0.02555785)0.4300:0.01340408)0.3000:0.01029296)0.2800:0.01120827)1.0000:0.03528728,(VIT_18s0001g07730.t01:0.30643164,(VIT_18s0001g07720.t01:0.29221163,(VIT_03s0038g00480.t01:0.27585435,VIT_09s0054g00440.t01:0.27677723)0.3900:0.00555378)0.7000:0.01513123)1.0000:0.16859293);
小编提取了葡萄基因组中40个含有CTT motif的基因序列,使用mega构树,树形文件如上