Multi-way, multilingual neural machine translation

2017-09-01
Firat, Orhan
Cho, Kyunghyun
Sankaran, Baskaran
Yarman Vural, Fatoş Tunay
Bengio, Yoshua
We propose multi-way, multilingual neural machine translation. The proposed approach enables a single neural translation model to translate between multiple languages, with a number of parameters that grows only linearly with the number of languages. This is made possible by having a single attention mechanism that is shared across all language pairs. We train the proposed multi-way, multilingual model on ten language pairs from WMT'15 simultaneously and observe clear performance improvements over models trained on only one language pair. We empirically evaluate the proposed model on low-resource language translation tasks. In particular, we observe that the proposed multilingual model outperforms strong conventional statistical machine translation systems on Turkish-English and Uzbek-English by incorporating the resources of other language pairs. (C) 2016 Elsevier Ltd. All rights reserved
COMPUTER SPEECH AND LANGUAGE

Suggestions

Domain-Structured Chaos in a Hopfield Neural Network
Akhmet, Marat (World Scientific Pub Co Pte Lt, 2019-12-30)
In this paper, we provide a new method for constructing chaotic Hopfield neural networks. Our approach is based on structuring the domain to form a special set through the discrete evolution of the network state variables. In the chaotic regime, the formed set is invariant under the system governing the dynamics of the neural network. The approach can be viewed as an extension of the unimodality technique for one-dimensional map, thereby generating chaos from higher-dimensional systems. We show that the dis...
Single exponent in l1 multiple exponents in l2: consequences for l2
Kurumlu, Zehra; Gracanın Yüksek, Martına; Department of English Language Teaching (2013)
The thesis hypothesized that when an exponent of a linguistic concept in the native language maps onto several different exponents in the target language, learners have difficulty when acquiring those structures in the target language. By contrast, when an exponent of a linguistic concept in the native language and its counterpart in the target language stand in a one-to-one correspondence, the possibility of making errors decreases to a considerable extent. In order to test this hypothesis, I examined thre...
Mobile Robot Heading Adjustment Using Radial Basis Function Neural Networks Controller and Reinforcement Learning
BAYAR, GÖKHAN; Konukseven, Erhan İlhan; Koku, Ahmet Buğra (2008-10-28)
This paper proposes radial basis function neural networks approach to the Solution of a mobile robot heading adjustment using reinforcement learning. In order to control the heading of the mobile robot, the neural networks control system have been constructed and implemented. Neural controller has been charged to enhance the control system by adding some degrees of strength. It has been achieved that neural networks system can learn the relationship between the desired directional heading and the error posi...
Verbal morphemes guide thematic role assignment in children
Özge, Duygu; Arslan Uzundağ, Berna; Küntay Copty, Ayşe Aylin (null; 2014-09-03)
Architectures and Mechanisms for Language Processing (AMLaP)
Representing temporal knowledge in connectionist expert systems
Alpaslan, Ferda Nur (1996-09-27)
This paper introduces a new temporal neural networks model which can be used in connectionist expert systems. Also, a Variation of backpropagation algorithm, called the temporal feedforward backpropagation algorithm is introduced as a method for training the neural network. The algorithm was tested using training examples extracted from a medical expert system. A series of experiments were carried out using the temporal model and the temporal backpropagation algorithm. The experiments indicated that the alg...
Citation Formats
O. Firat, K. Cho, B. Sankaran, F. T. Yarman Vural, and Y. Bengio, “Multi-way, multilingual neural machine translation,” COMPUTER SPEECH AND LANGUAGE, pp. 236–252, 2017, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/42997.