access icon free Training Restricted Boltzmann Machine Using Gradient Fixing Based Algorithm

Most of the algorithms for training restricted Boltzmann machines (RBM) are based on Gibbs sampling. When the sampling algorithm is used to calculate the gradient, the sampling gradient is the approximate value of the true gradient and there is a big error between the sampling gradient and the true gradient, which seriously affects the training effect of the network. Aiming at this problem, this paper analysed the numerical error and orientation error between the approximate gradient and the true gradient. Their influence on the performance of network training is given then. An gradient fixing model was established. It was designed to adjust the numerical value and orientation of the approximate gradient and reduce the error. We also designed gradient fixing based Gibbs sampling training algorithm (GFGS) and gradient fixing based parallel tempering algorithm (GFPT), and the comparison experiment of the novel algorithms and the existing algorithms is given. It has been demonstrated that the new algorithms can effectively tackle the issue of gradient error, and can achieve higher training accuracy at a reasonable expense of computational runtime.

Inspec keywords: gradient methods; sampling methods; learning (artificial intelligence); Boltzmann machines; parallel algorithms

Other keywords: GFGS; approximate gradient; sampling gradient; gradient fixing model; gradient error; Gibbs sampling training algorithm; true gradient; restricted Boltzmann machines training; numerical error; gradient fixing based parallel tempering algorithm; orientation error; network training; GFPT; sampling algorithm

Subjects: Learning in AI (theory); Optimisation techniques; Interpolation and function approximation (numerical analysis); Parallel programming and algorithm theory; Neural nets (theory); Other topics in statistics

http://iet.metastore.ingenta.com/content/journals/10.1049/cje.2018.05.007
Loading

Related content

content/journals/10.1049/cje.2018.05.007
pub_keyword,iet_inspecKeyword,pub_concept
6
6
Loading