Update readme

2017-11-13 00:34:59 +05:30 · 2017-11-13 00:34:59 +05:30 · a28a0760d5
commit a28a0760d5
parent fb274aa3fb
1 changed files with 42 additions and 4 deletions
--- a/README.md
+++ b/README.md
@ -30,7 +30,7 @@ Following are the constructor parameters:
 | bias | `True` | Bias |
 | batch_first | `True` | Whether data is fed batch first |
 | dropout | `0` | Dropout between layers in the controller |
-| bidirectional | `False` | If the controller is bidirectional (Not yet implemented) |
+| bidirectional | `False` | If the controller is bidirectional (Not yet implemented |
 | nr_cells | `5` | Number of memory cells |
 | read_heads | `2` | Number of read heads |
 | cell_size | `10` | Size of each memory cell |
@ -45,11 +45,11 @@ Following are the forward pass parameters:
 | --- | --- | --- |
 | input | - | The input vector `(B*T*X)` or `(T*B*X)` |
 | hidden | `(None,None,None)` | Hidden states `(controller hidden, memory hidden, read vectors)` |
-| reset_experience | `False` | Whether to reset memory (This is a parameter for the forward pass) |
-| pass_through_memory | `True` | Whether to pass through memory (This is a parameter for the forward pass) |
+| reset_experience | `False` | Whether to reset memory (This is a parameter for the forward pass |
+| pass_through_memory | `True` | Whether to pass through memory (This is a parameter for the forward pass |


-Example usage:
+### Example usage:

 ```python
 from dnc import DNC
@ -72,6 +72,44 @@ output, (controller_hidden, memory, read_vectors) = \
  rnn(torch.randn(10, 4, 64), (controller_hidden, memory, read_vectors, reset_experience=True))
 ```

+### Debugging:
+
+The `debug` option causes the network to return its memory hidden vectors (numpy `ndarray`s) for the first batch each forward step.
+These vectors can be analyzed or visualized, using visdom for example.
+
+```python
+from dnc import DNC
+
+rnn = DNC(
+  input_size=64,
+  hidden_size=128,
+  rnn_type='lstm',
+  num_layers=4,
+  nr_cells=100,
+  cell_size=32,
+  read_heads=4,
+  batch_first=True,
+  gpu_id=0,
+  debug=True
+)
+
+(controller_hidden, memory, read_vectors) = (None, None, None)
+
+output, (controller_hidden, memory, read_vectors), debug_memory = \
+  rnn(torch.randn(10, 4, 64), (controller_hidden, memory, read_vectors, reset_experience=True))
+```
+
+Memory vectors returned by forward pass (`np.ndarray`):
+
+| Key | Y axis (dimensions) | X axis (dimensions) |
+| --- | --- | --- |
+| `debug_memory['memory']` | layer * time | nr_cells * cell_size
+| `debug_memory['link_matrix']` | layer * time | nr_cells * nr_cells
+| `debug_memory['precedence']` | layer * time | nr_cells
+| `debug_memory['read_weights']` | layer * time | read_heads * nr_cells
+| `debug_memory['write_weights']` | layer * time | nr_cells
+| `debug_memory['usage_vector']` | layer * time | nr_cells
+
 ## Example copy task

 The copy task, as descibed in the original paper, is included in the repo.