187. Repeated DNA Sequences

All DNA is composed of a series of nucleotides abbreviated as A, C, G, and T, for example: "ACGAATTCCG". When studying DNA, it is sometimes useful to identify repeated sequences within the DNA.
Write a function to find all the 10-letter-long sequences (substrings) that occur more than once in a DNA molecule.

For example,

Given s = "AAAAACCCCCAAAAACCCCCCAAAAAGGGTTT",
Return:
["AAAAACCCCC", "CCCCCAAAAA"].

Solution1:Hashset<string> 查重

思路:

屏幕快照 2017-09-14 上午11.54.20.png

Time Complexity: O(N) Space Complexity: O(10N)

Solution2:先encode 再Hashset<int> 查重

Time Complexity: O(N) Space Complexity: O(N)

Solution1 Code:

public List<String> findRepeatedDnaSequences(String s) {
    Set seen = new HashSet(), repeated = new HashSet();
    for (int i = 0; i + 9 < s.length(); i++) {
        String ten = s.substring(i, i + 10);
        if (!seen.add(ten))
            repeated.add(ten);
    }
    return new ArrayList(repeated);
}

Solution2 Code:

class Solution {
    private char[] encode_map = new char[26];
    
    public List<String> findRepeatedDnaSequences(String s) {
        // init
        Set<Integer> seen = new HashSet<Integer>();
        Set<String> repeated = new HashSet<String>();
        //encode_map['A' - 'A'] = 0;
        encode_map['C' - 'A'] = 1;
        encode_map['G' - 'A'] = 2;
        encode_map['T' - 'A'] = 3;
        
        // sliding window
        for (int i = 0; i + 9 < s.length(); i++) {
            String ten = s.substring(i, i + 10);
            int code = encode(ten);
            if (!seen.add(code))
                repeated.add(ten);
        }
        
        // result
        return new ArrayList(repeated);
    }
    
    private int encode(String s) {
        int code = 0;
        for(int j = 0; j < 10; j++) {
            code <<= 2;
            code |= encode_map[s.charAt(j) - 'A'];
        }
        return code;
    }
}
最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • 2017.06.28 豆腐今天很勇敢的打了两针防疫针,基本没有哭闹,为了保证侧卧,我们背着一个大炮弹,特别听话,特...
    morning糖阅读 337评论 0 0
  • 文|聂鲁达 今夜我可以写下最哀伤的诗篇。 写,譬如,“夜缀满繁星, 那些星,灿蓝,在远处颤抖。” 晚风在天空中回旋...
    石勇_dfb8阅读 354评论 0 0
  • 快进看完了西部世界的最后一集,机器人终究还是意识觉醒,将人类这些伪神干翻在地,罗伯特用他的生命作为贡品开启了“Jo...
    范小白Van阅读 333评论 0 0