Scaling up Natural Gradient by Sparsely Factorizing the Inverse Fisher Matrix
Published on Dec 05, 20151803 Views
Second-order optimization methods, such as natural gradient, are difficult to apply to high-dimensional problems, because they require approximately solving large linear systems. We present FActorized