title: Can oracle-based imitation learning improveneural machine translation with dataaggregation?
creator: Hormann, Luca
subject: ddc-004
subject: 004 Data processing Computer science
description: Through  globalization  of  the  industry  and  the  collaboration  of  nations  world  wide  theimportance  of  machine  translated  documents  is  rising.   Many  new  ideas  and  conceptswere proposed in the recent years to improve the overall quality of machine translation(MT).  A  lot  of  focus  in  machine  learning  is  going  towards  the  research  of  alternativelearning techniques, as the basic existing paradigms such as supervised, unsupervised andreinforcement learning are not a perfect fit for every task.  Imitation learning is a techniquewhich combines the exploratory aspect of reinforcement with the efficiency of supervisedlearning.   Through  the  usage  of  an  interactive  expert,  the  learning  model,  also  calledstudent, is able to obtain intermediate feedback for it’s predictions at any given point intime.  Thereby the student is aware of it’s mistakes by considering the difference of it’s andthe expert’s prediction from this point.  Imitation learning can improve all major problemsfor training artificial neural networks given the right expert.  These are:  the time it takesto train a model,  the acquisition of data and most importantly the overall performancegiven some specific metric.The  most  common  quality  evaluation  metric  used  for  MT  is  BLEU.  It  is  based  on  then-gram precision between the generated translation, given some input, and the referencetranslation.  Therefore it is non-differentiable and can not be used as a loss to train a MTmodel directly. For the training usually the maximum likelihood estimation (MLE) is used,such that the likelihood of each token in the output, given the input sequence, is maximized.This creates a discrepancy between training (MLE) and validation objective (BLEU). Thisthesis tries to overcome this issue by directly learning on the differences of the expectedBLEU from the student and the expert in an imitation learning scenario.  The expert isrepresented by a traditional statistical machine translation (SMT) model that should helpthe student in solving the problems mentioned above.  For this a novel data aggregationmethodAggregateData  using  approximatedBLEUexplorations  (ADBLEU)  based  onimitation  learning  was  implemented.   After  conducting  several  experiments,  validatingdifferent approaches of data aggregation, it is shown that it is not possible to significantlyimprove the state-of-the-art student, due to limitations of the SMT expert.
date: 2021
type: Master's thesis
type: info:eu-repo/semantics/masterThesis
type: NonPeerReviewed
format: application/pdf
identifier: https://archiv.ub.uni-heidelberg.de/volltextserver/30516/1/Master_Thesis.pdf
identifier: DOI:10.11588/heidok.00030516
identifier: urn:nbn:de:bsz:16-heidok-305169
identifier:   Hormann, Luca  (2021) Can oracle-based imitation learning improveneural machine translation with dataaggregation?  [Master's thesis]     
relation: https://archiv.ub.uni-heidelberg.de/volltextserver/30516/
rights: info:eu-repo/semantics/openAccess
rights: http://archiv.ub.uni-heidelberg.de/volltextserver/help/license_urhg.html
language: eng