stem/AI/Neural Networks/RNN
andy 23991f92c9 vault backup: 2023-05-31 21:29:04
Affected files:
.obsidian/app.json
.obsidian/workspace-mobile.json
.obsidian/workspace.json
Events/🪣🪣🪣.md
STEM/AI/Neural Networks/CNN/FCN/README.md
STEM/AI/Neural Networks/CNN/GAN/README.md
STEM/AI/Neural Networks/CNN/README.md
STEM/AI/Neural Networks/MLP/README.md
STEM/AI/Neural Networks/README.md
STEM/AI/Neural Networks/RNN/README.md
STEM/AI/Neural Networks/SLP/README.md
STEM/AI/Neural Networks/Transformers/README.md
Untitled.canvas
2023-05-31 21:29:04 +01:00
..
LSTM.md vault backup: 2023-05-26 06:37:13 2023-05-26 06:37:13 +01:00
README.md vault backup: 2023-05-31 21:29:04 2023-05-31 21:29:04 +01:00
RNN.md vault backup: 2023-05-26 06:37:13 2023-05-26 06:37:13 +01:00
VQA.md vault backup: 2023-05-26 06:37:13 2023-05-26 06:37:13 +01:00

Recurrent Neural Network

  • Hard to train on long sequences
    • Weights hold memory
      • Implicit
    • Lots to remember

Text Analysis

  • Train sequences of text character-by-character
    • Maintains state vector representing data up to current token
    • Combines state vector with next token to create new vector
    • In theory, info from one token can propagate arbitrarily far down the sequence
      • In practice suffers from vanishing gradient
        • Can't extract precise information about previous tokens

!rnn-input.png !rnn-recurrence.png