Bidirectional RNN and Deep RNN

Original Source: https://www.coursera.org/specializations/deep-learning

Bidirectional RNN

Let’s say that we want to do name entity recognition. Following two examples are included in the training set; ‘He said, “Teddy bears are on sale!”’, and ‘He said, “Teddy Roosevelt was a great President!”’.

If the information flows only from left to right, our model has no way to learn that “Teddy” in the second example is a name.

To solve this kind of problem, bidirectional RNN (BRNN) is introduced.

In BRNN, there are two forward propagations; left to right and right to left. For each layer we combine activations from these two forward props to output prediction.

For each sequential layer, output is computed as following;

\[\hat{y}^{<t>} = g(W_y[\overrightarrow{a}^{<t>}, \overleftarrow{a}^{<t>}] + b_y)\]

brnn

In natural language processing, BRNN with LSTM is the most standard model.

Deep RNN

We can make the RNN model more complex by adding layers vertically.

Deep RNN

Share on

Twitter Facebook Google+ LinkedIn

YoonSoo

Bidirectional RNN and Deep RNN

Bidirectional RNN

Deep RNN

Share on

Leave a Comment

You May Also Enjoy

Generalized Linear Models (GLM)

“ALBERT: A Lite BERT for Self-supervised Learning of Language Representations” Summarized

“Generative Pretraining from Pixels” Summarized

“Language Models are Few-Shot Learners” Summarized