1.已知某序列为
ATCGCGATTTCGT
每个位置有高GC含量(H)和低GC(L)含量两种态(State)。
(1)写出HMM(不用标出概率)
(2)已知概率如下,写一个脚本输出此序列最可能的state path
emission probability:
H: p(A) = p(T) = 0.1; p(C) = p(G) = 0.40; L: p(A) = p(T) = p(C) = p(G) = 0.25
transition probability:
p(H→L) = 0.3; p(H→H) = 0.7; p(L→H) = 0.6; p(L→L) = 0.4
p(start→H) = p(start→L) = p(H→end) = p(L→end) = 0.5
- 已知两个序列:
seq_1 = ATGCTGATGC
seq_2 = GATCCTAGCT
后续施工中
已知某字符串pineapple,写出BW变换及还原的过程。