COMP9021 Principles of Programming Lab9

Q1

Using a stack to evaluate fully parenthesised expressions.
Possible interaction is shown below:

$ python3
...
>>> from fully_parenthesised import *
>>> evaluate(’100’)
100
>>> evaluate(’[(1 - 20) + 300]’)
281
>>> evaluate(’[1 - {20 + 300}]’)
-319
>>> evaluate(’( { 20*4 }/5 )’)
16.0
>>> evaluate(’(20*[4/5])’)
16.0
>>> evaluate(’({1 + (20 * 30)} - [400 / 500])’)
600.2
>>> evaluate(’{1 + [((20*30)-400) / 500]}’)
1.4
>>> evaluate(’[1 + {(2 * (3+{4*5})) / ([6*7]-[8/9]) }]’)
2.1189189189189186
>>> evaluate(’100 + 3’)
>>> evaluate(’(100 + 3’)
>>> evaluate(’(100 + -3)’)
>>> evaluate(’(100  50)’)
>>> evaluate(’(100 / 0)’)

思路是先检查输入的expression，将合法字符以string的格式按顺序保存在一个list中。对list中的每个元素顺序处理，如果是数字或者运算符则直接压入栈中，如果是任何的右括号，则弹出栈中前三项进行计算。计算前先检查这三项是否是数字+运算符+数字的合法输入，如果不是则return空，如果是则计算相应数值再压入栈中。直到栈为空，结束处理，返回当前栈中最后的一个数字。

import re
from operator import add, sub, mul, truediv
from stack_adt import Stack, EmptyStackError

def evaluate(expression):
    '''
    Checks whether an expression is a full parenthesised expression,
    and in case the answer is yes, returns the value of the expression,
    provided that no division by 0 is attempted; otherwise, return None.

    >>> evaluate('100')
    100
    >>> evaluate('[(1 - 20) + 300]')
    281
    >>> evaluate('[1 - {20 + 300}]')
    -319
    >>> evaluate('( { 20*4 }/5 )')
    16.0
    >>> evaluate('(20*[4/5])')
    16.0
    >>> evaluate('({1 + (20 * 30)} - [400 / 500])')
    600.2
    >>> evaluate('{1 + [((20*30)-400) / 500]}')
    1.4
    >>> evaluate('[1 + {(2 * (3+{4*5})) / ([6*7]-[8/9]) }]')
    2.1189189189189186
    >>> evaluate('100 + 3')
    >>> evaluate('(100 + 3')
    >>> evaluate('(100 + -3)')
    >>> evaluate('(100  50)')
    >>> evaluate('(100 / 0)')
    '''
    if any(not (c.isdigit() or c.isspace() or c in '+-*/()[]{}') for c in expression):
        return
    #如果表达式中含有数字、空格或者+-*/()[]{}之外的内容，则无法计算，返回void
    tokens = re.compile('(\d+|\+|-|\*|/|\(|\)|\[|\]|\{|\})').findall(expression)
    #将表达式中所有的数字+-*/()[]{}识别出来，每一个形成一个string，放入tokens的list中
    try:
        value = evaluate_expression(tokens)
        return value
    except ZeroDivisionError:
        return

def evaluate_expression(tokens):
    stack = Stack()
    for token in tokens:
        try:
            if token in '+-*/':
                stack.push(token)
            else:
                token = int(token)
                stack.push(token)
            #把所有的数字和运算符号压栈
        except ValueError:
            if token in ')]}':
                try:
                    arg_2 = stack.pop()
                    operator = stack.pop()
                    arg_1 = stack.pop()
                    if operator not in '+-*/' or str(arg_1) in '+-*/' or str(arg_2) in '+-*/':
                        return
                    stack.push({'+': add, '-': sub, '*': mul, '/': truediv}[operator](arg_1, arg_2))
                except EmptyStackError:
                    return
                #如果token是任何一个括号的结尾，则代表要进行一次四则运算，弹出栈中前三项，分别是第二个运算数字、运算符和第一个运算数字
    if stack.is_empty():
        return
    value = stack.pop()
    if not stack.is_empty():
        return
    return value

if __name__ == '__main__':
    import doctest
    doctest.testmod()

Q2

Write a program word_ladder.py that computes all transformations of a word word_1 into a word word_2, consisting of sequences of words of minimal length, starting with word_1, ending in word_2, and such that two consecutive words in the sequence differ by at most one letter. All words have to occur in a dictionary with name dictionary.txt, stored in the working directory.
It is convenient and effective to first create a dictionary whose keys are all words in the dictionary with one letter replaced by a “slot”, the value for a given key being the list of words that match the key with the “slot” being replaced by an appropriate letter. From this dictionary, one can then build a dictionary with words as keys, and as value for a given key the list of words that differ in only one letter from the key.
The program implements a function word_ladder(word_1, word_2) that returns the list of all solutions, a solution being as previously described.
Next is a possible interaction.

$ python3
...
>>> from word_ladder import *
>>> for ladder in word_ladder(’cold’, ’warm’): print(ladder)
...
[’COLD’, ’CORD’, ’WORD’, ’WORM’, ’WARM’]
[’COLD’, ’CORD’, ’WORD’, ’WARD’, ’WARM’]
[’COLD’, ’CORD’, ’CARD’, ’WARD’, ’WARM’]
>>> for ladder in word_ladder(’three’, ’seven’): print(ladder)
...
[’THREE’, ’THREW’, ’SHREW’, ’SHRED’, ’SIRED’, ’SITED’, ’SATED’, ’SAVED’, ’SAVER’, ’SEVER’, ’SEVEN’]

这个题目Eric的思路非常巧妙，我在自己解答的时候不能很好的解决双循环遍历字典用时过长的问题。所以此处贴出对Eric的代码分析：

# Written by Eric Martin for COMP9021


from collections import defaultdict, deque
import sys


'''
Computes all transformations from a word word_1 to a word word_2, consisting of
sequences of words of minimal length, starting with word_1, ending in word_2,
and such that two consecutive words in the sequence differ by at most one letter.
All words have to occur in a dictionary with name dictionary.txt, stored in the
working directory.
'''


dictionary_file = 'dictionary.txt'

def get_words_and_word_relationships():
    try:
        with open(dictionary_file) as dictionary:
            lexicon = set()
            contextual_slots = defaultdict(list)
            for word in dictionary:
                word = word.rstrip()
                #去除单词后面的空格
                lexicon.add(word)
                #将所有单词存入lexicon的set中
                for i in range(len(word)):
                    contextual_slots[word[: i], word[i + 1: ]].append(word)
                #每一个word，以删除一个字母的剩余的前后部分构成的一个tuple做key，word做value，构建一个contextual_slots的dict
                #例如：('A', 'RON'): ['AARON', 'AKRON', 'APRON']
                #dict是hash的，效率高，string拆分效率也很高
            closest_words = defaultdict(set)
            for slot in contextual_slots:
            #遍历所有的slot，他们的特点是元组里都缺了一个字母，但是这些slot的字典value都是彼此相差一个字母的
                for i in range(len(contextual_slots[slot])):
                #遍历所有slot字典value的list中的word，i作为第一个备选单词
                    for j in range(i + 1, len(contextual_slots[slot])):
                    #遍历所有slot字典value中i之后list中的word，j作为第二个备选单词，i与j成为只差一个字母的一对单词
                        closest_words[contextual_slots[slot][i]].add(contextual_slots[slot][j])
                        closest_words[contextual_slots[slot][j]].add(contextual_slots[slot][i])
                        #在closest_words字典中i的key后面加入j的value，j的key后面加入i的value
                        #例如：'APE': {'ABE', 'ACE', 'AGE', 'ALE', 'APT', 'ARE', 'ATE', 'AWE', 'AYE'}
            return lexicon, closest_words
    except FileNotFoundError:
        print(f'Could not open {dictionary_file}. Giving up...')
        sys.exit()

def word_ladder(word_1, word_2):
    lexicon, closest_words = get_words_and_word_relationships()
    word_1 = word_1.upper()
    word_2 = word_2.upper()
    #将输入的单词全部大写，因为字典中都是大写的WORD，避免key value error
    if len(word_1) != len(word_2) or word_1 not in lexicon or word_2 not in lexicon:
        return []
    if word_1 == word_2:
        return [[word_1]]
    solutions = []
    queue = deque([[word_1]])
    #从word_1开始建立一个queue，用deque的原因是契合这里的前后队列操作，效率高，且队列里存放的是单词变换的一个list，不是某一个单词
    while queue:
        word_sequence = queue.pop()
        #取出一个单词变换过程的list
        last_word = word_sequence[-1]
        #取出变换list的最后一个单词
        for word in closest_words[last_word]:
        #基于最后一个单词，找出所有和它差一个字母的单词
            if word == word_2:
            #如果word和目标word_2一致，意味着已经找到了一个解
                if not solutions or len(solutions[-1]) > len(word_sequence):
                    solutions.append(word_sequence + [word])
                    #如果solutions为空，或者solutions最后一个解的长度大于当前找到的解的长度减1的数值，则把新找到的解加入solution中
                    #这里巧妙的地方在于len(solutions[-1]) > len(word_sequence)允许变换长度一致的解被加入solution中
            if not solutions and word not in word_sequence:
                queue.appendleft(word_sequence + [word])
                #如果还没有形成解，则在队列左侧加入新一次变换后的list。
                #放在左侧是BFS的思路，把所有未形成解的变换全部处理完，再处理新的变换。
                #也就是说长度从小到大，遍历每个长度上所有可能，由此找到最短的变换，而不是找到所有从word1到word2的方法
    return solutions

最后编辑于：2017.12.11 01:43:48

人面猴
序言：七十年代末，一起剥皮案震惊了整个滨河市，随后出现的几起案子，更是在滨河造成了极大的恐慌，老刑警刘岩，带你破解...
沈念sama阅读 216,470评论 6赞 501
死咒
序言：滨河连续发生了三起死亡事件，死亡现场离奇诡异，居然都是意外死亡，警方通过查阅死者的电脑和手机，发现死者居然都...
沈念sama阅读 92,393评论 3赞 392
救了他两次的神仙让他今天三更去死
文/潘晓璐我一进店门，熙熙楼的掌柜王于贵愁眉苦脸地迎上来，“玉大人，你说我怎么就摊上这事。” “怎么了？”我有些...
开封第一讲书人阅读 162,577评论 0赞 353
道士缉凶录：失踪的卖姜人
文/不坏的土叔我叫张陵，是天一观的道长。经常有香客问我，道长，这世上最难降的妖魔是什么？我笑而不...
开封第一讲书人阅读 58,176评论 1赞 292
港岛之恋（遗憾婚礼）
正文为了忘掉前任，我火速办了婚礼，结果婚礼上，老公的妹妹穿的比我还像新娘。我一直安慰自己，他们只是感情好，可当我...
茶点故事阅读 67,189评论 6赞 388
恶毒庶女顶嫁案：这布局不是一般人想出来的
文/花漫我一把揭开白布。她就那样静静地躺着，像睡着了一般。火红的嫁衣衬着肌肤如雪。梳的纹丝不乱的头发上，一...
开封第一讲书人阅读 51,155评论 1赞 299
城市分裂传说
那天，我揣着相机与录音，去河边找鬼。笑死，一个胖子当着我的面吹牛，可吹牛的内容都是我干的。我是一名探鬼主播，决...
沈念sama阅读 40,041评论 3赞 418
双鸳鸯连环套：你想象不到人心有多黑
文/苍兰香墨我猛地睁开眼，长吁一口气：“原来是场噩梦啊……” “哼！你这毒妇竟也来了？” 一声冷哼从身侧响起，我...
开封第一讲书人阅读 38,903评论 0赞 274
万荣杀人案实录
序言：老挝万荣一对情侣失踪，失踪者是张志新（化名）和其女友刘颖，没想到半个月后，有当地人在树林里发现了一具尸体，经...
沈念sama阅读 45,319评论 1赞 310
护林员之死
正文独居荒郊野岭守林人离奇死亡，尸身上长有42处带血的脓包…… 初始之章·张勋以下内容为张勋视角年9月15日...
茶点故事阅读 37,539评论 2赞 332
白月光启示录
正文我和宋清朗相恋三年，在试婚纱的时候发现自己被绿了。大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
茶点故事阅读 39,703评论 1赞 348
活死人
序言：一个原本活蹦乱跳的男人离奇死亡，死状恐怖，灵堂内的尸体忽然破棺而出，到底是诈尸还是另有隐情，我是刑警宁泽，带...
沈念sama阅读 35,417评论 5赞 343
日本核电站爆炸内幕
正文年R本政府宣布，位于F岛的核电站，受9级特大地震影响，放射性物质发生泄漏。R本人自食恶果不足惜，却给世界环境...
茶点故事阅读 41,013评论 3赞 325
男人毒药：我在死后第九天来索命
文/蒙蒙一、第九天我趴在偏房一处隐蔽的房顶上张望。院中可真热闹，春花似锦、人声如沸。这庄子的主人今日做“春日...
开封第一讲书人阅读 31,664评论 0赞 22
一桩弑父案，背后竟有这般阴谋
文/苍兰香墨我抬头看了看天上的太阳。三九已至，却和暖如春，着一层夹袄步出监牢的瞬间，已是汗流浃背。一阵脚步声响...
开封第一讲书人阅读 32,818评论 1赞 269
情欲美人皮
我被黑心中介骗来泰国打工，没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留，地道东北人。一个月前我还...
沈念sama阅读 47,711评论 2赞 368
代替公主和亲
正文我出身青楼，却偏偏与公主长得像，于是被迫代替她去往敌国和亲。传闻我的和亲对象是个残疾皇子，可洞房花烛夜当晚...
茶点故事阅读 44,601评论 2赞 353

COMP9021 Principles of Programming Lab9

Q1

Q2

推荐阅读更多精彩内容