AI 学习笔记(一)

Agent:

they take the current percept as input from the sensors and return an action to the actuators

rationality

• The performance measure that defines the criterion of success.
• The agent’s prior knowledge of the environment.
• The actions that the agent can perform.
• The agent’s percept sequence to date.

simple flex agents:

  • These agents select actions on the basis of the current percept, ignoring the rest of the percept history
    即是根据当前的输入来选择动作
  • condition–action rule: if ... then ...
  • Infinite loops are often unavoidable for simple reflex agents operating in partially observable environments.Simple flex 需要环境是fully observable的。Scaping from infinite loops is possible if the agent can randomize its actions

model-based agents:

  • the agent should maintain some sort of internal state that depends on the percept history and thereby reflects at least some of the unobserved aspects of the current state.
  • An agent that uses such a model is called a model-based agent

goal-based agents:

  • the agent needs some sort of goal information that describes situations that are desirable
  • Although the goal-based agent appears less efficient, it is more flexible because the knowledge that supports its decisions is represented explicitly and can be modified.

Properties of task environments

Fully observable vs. partially observable

If an agent’s sensors give it access to the complete state of the environment at each point in time, then we say that the task environ- ment is fully observable

Single agent vs. multiagent

agent 的个数,开车就是一个,下棋就是两个

Deterministic vs. stochastic

If the next state of the environment is completely determined by the current state and the action executed by the agent, then we say the environment is deterministic; otherwise, it is stochastic(随机)
In principle, an agent need not worry about uncertainty in a fully observable, deterministic environment.

Episodic vs. sequential

the next episode does not depend on the actions taken in previous episodes 前后并无关联
In sequential environments, on the other hand, the current decision could affect all future decisions.例子:chess and taxi

Static vs. dynamic

If the environment can change while an agent is deliberating, then we say the environment is dynamic for that agent; otherwise, it is static.

Discrete vs. continuous

The discrete/continuous distinction applies to the state of the environment, to the way time is handled, and to the percepts and actions of the agent. chess:discrete taxi:continuous

Known vs. unknown

to the agent’s (or designer’s) state of knowledge about the “laws of physics” of the environment. In a known environment, the outcomes (or outcome probabilities if the environment is stochastic) for all actions are given.

什么是admissible heuristic function

In computer science, specifically in algorithms related to pathfinding, a heuristic function is said to be admissible if it never overestimates the cost of reaching the goal, i.e. the cost it estimates to reach the goal is not higher than the lowest possible cost from the current point in the path.

Greedy Best-first Search

每个节点有启发式函数,表示这个节点到终点的预计距离h(n)(the cost to get from the node to the goal),每次选最短的(greed),直到到达终点

Minimax and alpha-beta pruning

The minimax algorithm is a way of finding an optimal move in a two player game. Alpha-beta pruning is a way of finding the optimal minimax solution while avoiding searching subtrees of moves which won't be selected.

Some true of false:

  • Hill-climbing is an entirely deterministic algorithm. F stochastic hill-climbing random selection among uphill moves.
  • DFS has lower asymptotic space complexity than BFS F The limiting behavior of the use of memory space of an algorithm when the size of the problem goes to infinity. 我觉得应该是相反的
  • During search, one usually applies the goal test onto newly expanded children, before queuing-up these children. F 很多算法都是先queue再查
  • A contingency problem involves a nondeterministic and accessible environment. F inaccessible
  • A* is an admissible algorithm. 不确定 A* is admissible 但是是有前提的:启发函数要是admissible的
  • When using the correct temperature decrease schedule, simulated annealing is guaranteed to find the global optimum in finite time. F 不一定
  • Alpha-beta pruning accelerates game playing at the cost of being an approximation to full minimax. 我觉得是对的
  • Genetic algorithms use a step called “failover”. F
  • A perfectly rational backgammon-playing agent never loses F
  • Hill climbing search is best used for problem domains with densely packed goals T
  • The exact evaluation function values do not affect minimax decision as long as the ordering of these values is maintained. F
最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • **2014真题Directions:Read the following text. Choose the be...
    又是夜半惊坐起阅读 9,864评论 0 23
  • 2016-05-27懒先生的盐懒人帮 敌人是一种虚幻 人品可以出卖 商战无耻 是无底线的 在生意场上,从来没有永久...
    懒爷邱阅读 408评论 1 1
  • 形如琵琶黄橙色, 面部绒毛硬实软。 气味芳香味更浓, 甜甜味道肉软软。 消食止渴汁水多, 润肤美颜止咳喘。 视力提...
    六月天气阅读 506评论 7 9
  • 紧赶慢赶却还是因为走错方向错过末班车,如果是以前的我会去想如果早一点查路线就好了,一个星期前的我会说一切都是最...
    从妫妤到澜依阅读 351评论 0 0
  • 从出生 一直在岁月的山上攀爬 云雾缭绕 听过云霄之外的鸟 草本兴盛 嗅过蒲公英的羽毛 星河璀璨 遥遥的,向宇宙延伸...
    方牧重阅读 197评论 0 0