17 lines
826 B
Markdown
17 lines
826 B
Markdown
|
- Assigning credit/blame for outcomes to each internal decision
|
||
|
- Loading Problem
|
||
|
- Loading a training set into the free parameters
|
||
|
- Important to any learning machine attempting to improve performance in situations involving temporally extended behaviour
|
||
|
|
||
|
Two Sub-problems:
|
||
|
- ***Temporal*** credit-assignment problem
|
||
|
- Assigning credit for **outcomes** to **actions**
|
||
|
- Involves time when actions that deserve credit were taken
|
||
|
- Relevant when many actions taken and want to know which one was responsible
|
||
|
- ***Structural*** credit-assignment problem
|
||
|
- Assigning credit for **actions** to **internal decisions**
|
||
|
- Involves internal structures of actions generated by system
|
||
|
- Relevant for identifying which component should have behaviour altered
|
||
|
- By how much
|
||
|
|
||
|
- Important in MLPs when there are many hidden neurons
|