第2关 - 简书

第2关链接：http://www.pythonchallenge.com/pc/def/ocr.html

第二关题目.png

要求是识别字符，可能在书中，也可能在页面源代码中。
但是看图片，根本看不清嘛！所以肯定在源代码中了。果断按下F12，在源码中找到了注释的片段：

注释片段.png

看到页面源码中有一大段字符串，提示找出重复最少的。注释的字符串很长，果断选择requests方法。代码：

import urllib.request
import re

url = 'http://www.pythonchallenge.com/pc/def/ocr.html'
req = urllib.request.urlopen(url)
body = req.read()
req.close()
body = body.decode('utf8')
text = re.search("<!--\n%%(.|\s)+\n-->", body).group(0)

dict1 = {}
for i in text:
    if i.isalpha():
        dict1[i] = text.count(i)
#print(dict1)
#list(dict1.keys())[list(dict1.values()).index(1)]
list1 = [k for k,v in dict1.items() if v == 1]
print(list1)

得到结果为['e', 'q', 'u', 'a', 'l', 'i', 't', 'y']，则得到下一关的地址：http://www.pythonchallenge.com/pc/def/equality.html