1.School of Aerospace Engineering, Xiamen University, Xiamen, Fujian 361005, China
2.Key Laboratory of Big Data Intelligent Analysis and Decision-marking of Xiamen Province, Xiamen University, Xiamen, Fujian 361005, China
[1] LECUN Y,BENGIO Y,HINTON G.Deep learning[J].Nature,2015,521(7553):436-444.
[2] LOWD D,MEEK C.Adversarial learning[C]//Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery in Data Mining,2005:641-647.
[3] SUTSKEVER I,VINYALS O,LE Q V.Sequence to sequence learning with neural networks[C]//Advances in Neural Information Processing Systems,2014:3104-3112.
[4] VINYALS O,FORTUNATO M,JAITLY N.Pointer networks[C]//Advances in Neural Information Processing Systems,2015:2692-2700.
[5] HOCHREITE R S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[6] SOCHER R,LIN C C Y,NG A Y,et al.Parsing natural scenes and natural language with recursive neural networks[C]//Proceedings of ICML,2011.
[7] BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[J].arXiv:1409.0473,2014.
[8] BELLO I,PHAM H,LE Q V,et al.Neural combinatorial optimizationwith reinforcement learning[J].arXiv:1611.09940,2016.
[9] MNIH V,BADIA A P,MIRZA M,et al.Asynchronous methods for deep reinforcement learning[C]//Interntional Conference on Machine Learning,2016:1928-1937.
[10] KHALIL E,DAI H,ZHANG Y,et al.Learning combinatorial optimization algorithms over graphs[C]//Advances in Neural Information Processing Systems,2017:6348-6358.
[11] DAI H,DAI B,SONG L.Discriminative embeddings of latent variable models for structured data[C]//International Conference on Machine Learning,2016:2702-2711.
[12] MNIH V,KAVUKCUOGLU K,SILVER D,et al.Human-level control through deep reinforcement learning[J].Nature,2015,518(7540):529-533.
[13] KOOL W,VAN HOOF H,WELLING M.Attention,learn to solve routing problems![J].arXiv:1803.08475,2018.
[14] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[C]//Advances in Neural Information Processing Systems,2017:5998-6008.
[15] RENNIE S J,MARCHERET E,MROUEH Y,et al.Self-critical sequence training for image captioning[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2017:7008-7024.
[16] PAPADIMITRIOU C H.The Euclidean travelling salesman problem is NP-complete[J].Theoretical Computer Science,1977,4(3):237-244.
[17] HE K,ZHANG X,REN S,et al.Deep residual learning for image recognition[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition,2016:770-778.
[18] IOFFE S,SZEGEDY C.Batch normalization:accelerating deep network training by reducing internal covariate shift[J].arXiv:1502.03167,2015.
[19] WILLIAMS R J.Simple statistical gradient-following algorithms for connectionist reinforcement learning[J].Machine Learning,1992,8(3/4):229-256.