stem/AI/Neural Networks/Learning/Credit-Assignment Problem.md

17 lines
826 B
Markdown
Raw Normal View History

vault backup: 2023-06-07 09:02:27 Affected files: STEM/AI/Classification/Classification.md STEM/AI/Classification/Decision Trees.md STEM/AI/Classification/Gradient Boosting Machine.md STEM/AI/Classification/Logistic Regression.md STEM/AI/Classification/Random Forest.md STEM/AI/Classification/Supervised.md STEM/AI/Classification/Supervised/README.md STEM/AI/Classification/Supervised/SVM.md STEM/AI/Classification/Supervised/Supervised.md STEM/AI/Learning.md STEM/AI/Neural Networks/Learning/Boltzmann.md STEM/AI/Neural Networks/Learning/Competitive Learning.md STEM/AI/Neural Networks/Learning/Credit-Assignment Problem.md STEM/AI/Neural Networks/Learning/Hebbian.md STEM/AI/Neural Networks/Learning/Learning.md STEM/AI/Neural Networks/Learning/README.md STEM/AI/Neural Networks/RNN/Autoencoder.md STEM/AI/Neural Networks/RNN/Deep Image Prior.md STEM/AI/Neural Networks/RNN/MoCo.md STEM/AI/Neural Networks/RNN/Representation Learning.md STEM/AI/Neural Networks/RNN/SimCLR.md STEM/img/comp-learning.png STEM/img/competitive-geometric.png STEM/img/confusion-matrix.png STEM/img/decision-tree.png STEM/img/deep-image-prior-arch.png STEM/img/deep-image-prior-results.png STEM/img/hebb-learning.png STEM/img/moco.png STEM/img/receiver-operator-curve.png STEM/img/reinforcement-learning.png STEM/img/rnn+autoencoder-variational.png STEM/img/rnn+autoencoder.png STEM/img/simclr.png STEM/img/sup-representation-learning.png STEM/img/svm-c.png STEM/img/svm-non-linear-project.png STEM/img/svm-non-linear-separated.png STEM/img/svm-non-linear.png STEM/img/svm-optimal-plane.png STEM/img/svm.png STEM/img/unsup-representation-learning.png
2023-06-07 09:02:27 +01:00
- Assigning credit/blame for outcomes to each internal decision
- Loading Problem
- Loading a training set into the free parameters
- Important to any learning machine attempting to improve performance in situations involving temporally extended behaviour
Two Sub-problems:
- ***Temporal*** credit-assignment problem
- Assigning credit for **outcomes** to **actions**
- Involves time when actions that deserve credit were taken
- Relevant when many actions taken and want to know which one was responsible
- ***Structural*** credit-assignment problem
- Assigning credit for **actions** to **internal decisions**
- Involves internal structures of actions generated by system
- Relevant for identifying which component should have behaviour altered
- By how much
- Important in MLPs when there are many hidden neurons