1 |
Jozefowicz R, Zaremba W, and Sutskever I (2015). An empirical exploration of recurrent network architectures. In Proceedings of the 32nd International Conference on International Conference on Machine Learning (ICML'15), pp 2342-2350.
|
2 |
Kyubyong P (2016). Pre-trained word vectors of 30+ languages. Available from: https://github.com/Kyubyong/wordvectors
|
3 |
Mikolov T, Chen K, Corrado G, and Dean J (2013). Efficient Estimation of Word Representations in Vector Space, arXiv preprint arXiv:1301.3781.
|
4 |
Papineni K, Roukos S, Ward T, and Zhu WJ (2002). BLEU: a Method for Automatic Evaluation of Machine Translation, Proceedings of the 40th Annual Meeting of the Association for Computational Linguistics (ACL), pp. 311-318.
|
5 |
Pennington J, Socher R, and Manning CD (2014). GloVe: Global Vectors for Word Representation.
|
6 |
Sutskever I, Vinyals O, and Le QV (2014). Sequence to Sequence Learning with Neural Networks, arXiv preprint arXiv:1409.3215.
|
7 |
Wu Y, Schuster M, Chen Z, Le QV, and Norouzi M (2016). Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation, arXiv:1609.08144, Retrived 2018.
|
8 |
Chollet F (2017). Deep Learning with Python, Manning, New York.
|
9 |
Bojanowski P, Grave E, Joulin A, and Mikolov T (2016). Enriching Word Vectors with Subword Information.
|
10 |
Bahdanau D, Cho K, and Bengio Y (2015). Neural machine translation by jointly learning to align and translate. In ICLR.
|
11 |
Cho K, Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, and Bengio Y (2014). Learning Phrase Representations using RNN Encoder Decoder for Statistical Machine Translation, arXiv preprint arXiv:1406.1078.
|
12 |
Chung J, Gulcehre C, Cho K, and Bengio Y (2014). Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling, arXiv preprint arXiv:1412.3555.
|
13 |
Facebook Inc. Word vectors for 157 languages, Available from: https://fasttext.cc/docs/en/crawl-vectors.html
|
14 |
Grave E, Bojanowski P, Gupta P, Joulin A, and Mikolov T (2018). Learning Word Vectors for 157 Languages, arXiv preprint arXiv:1802.06893.
|
15 |
Jordan MI (1986). Attractor dynamics and parallelism in a connectionist sequential machine, Cogitive Science Conference, pp 531-546.
|
16 |
Greff K, Srivastava RK, Koutnik J, Steunebrink BR, and Schmidhuber J (2015). LSTM: A Search Space Odyssey, arXiv preprint arXiv:1503.04069.
|
17 |
Hochreiter S and Schmidhuber J (1997). Long short-term memory, Neural Computation, 9, 1735-1780.
DOI
|