fix CuDNN RecurrentOp Gradient init

Aapo Kyrola · facebook-github-bot · commit f71695dcc270 · 2017-04-18T08:36:18.000-07:00
Summary: CuDNN RecurrentNet GradientOp did not pass the DROPOUT information to the initializer, causing incorrect scratch space size to be estimated. We have an assertion encorcing that scratch space is same for forward and backward ops, so this failed an assertion. We currently hard-code dropout to be 1.0, so this has had no effect on correctness in our tests. For some reason with num_layers=1 there wasn't an issue, but with num_layers&gt;=2, the scratch space size was different.

Reviewed By: salexspb

Differential Revision: D4904715

fbshipit-source-id: 780266c5ecf1f7a32387edcb6fc498a13ac782ac
diff --git a/caffe2/operators/recurrent_op_cudnn.cc b/caffe2/operators/recurrent_op_cudnn.cc
@@ -295,7 +295,7 @@ template <typename T>
 bool RecurrentGradientOp<T>::RunOnDevice() {
   const int seqLength = Input(INPUT).dim32(0);
   if (Input(INPUT).dims() != cachedInputDims_) {
-    initialize(Input(INPUT));
+    initialize(Input(INPUT), Output(DROPOUT_STATES));
     cachedInputDims_ = Input(INPUT).dims();
   }
   CUDNN_ENFORCE(cudnnGetRNNTrainingReserveSize(

Original file line number	Diff line number	Diff line change
`@@ -295,7 +295,7 @@ template <typename T>`
`295`	`295`	`bool RecurrentGradientOp<T>::RunOnDevice() {`
`296`	`296`	`const int seqLength = Input(INPUT).dim32(0);`
`297`	`297`	`if (Input(INPUT).dims() != cachedInputDims_) {`
`298`		`- initialize(Input(INPUT));`
	`298`	`+ initialize(Input(INPUT), Output(DROPOUT_STATES));`
`299`	`299`	`cachedInputDims_ = Input(INPUT).dims();`
`300`	`300`	`}`
`301`	`301`	`CUDNN_ENFORCE(cudnnGetRNNTrainingReserveSize(`