Agents include humans, robots, softbots, thermostats, etc.
Percept refers to the agent perceptual input at any given instant
The agent function maps from percept histories to actions:
f : P∗ → A
The agent program implements f on the physical architecture.
A rational agent chooses whichever action maximizes the expected value of the performance measure given the percept sequence to date(rational ̸= successful)
The environment type largely determines the agent design
The real world is (of course) partially observable, stochastic, sequential, dynamic, continuous, multi-agent
Agent types
Four basic types of agents in order of increasing generality:
– simple reflex agents
Decisions are made on the basis of the current percept only
– reflex agents with state
The internal state keeps track of relevant unobservable aspects of the en- vironment.
– goal-based agents
The goal describes desirable situations. The agent combines goal and en- vironment model to choose actions.
**Planning and search **are AI subfields devoted to building goal-based agents.
– utility-based agents
The utility function internalises the performance measure. Under uncertainty, the agent chooses actions that maximise the expected utility.
All these can be turned into learning agents
另外,Learning agents——The learning element uses feedback from the critic to modify the action selection. The problem generator suggests actions that lead to new informative experience.
Exploration vs Exploitation——A fundamental dilemma for learning agents
Exploitation: greedily uses what the agent has learnt to select the action that will, in the light of the current knowledge, have the best outcome
Exploration: Taking some other (possibly random) action to learn more, hoping to find something even better than what is currently known
Typically, a smart agent explores more in early stages than later on
Summary
- Agents interact with environments through actuators and sensors
- The agent function describes what the agent does in all circumstances
- Agent programs implement agent functions
- The performance measure evaluates the environment sequence
- A perfectly rational agent maximizes expected performance
- All agents can improve their performance through learning
- PEAS descriptions define task environments(Performance measure, Environment, Actuators, Sensors)