<> <http://www.w3.org/2000/01/rdf-schema#comment> "The repository administrator has not yet configured an RDF license."^^<http://www.w3.org/2001/XMLSchema#string> .
<> <http://xmlns.com/foaf/0.1/primaryTopic> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Article> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/title> "Response-Based and Counterfactual Learning for Sequence-to-Sequence Tasks in NLP"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/ontology/bibo/abstract> "Many applications nowadays rely on statistical machine-learnt models, such as a rising\r\nnumber of virtual personal assistants. To train statistical models, typically large amounts\r\nof labelled data are required which are expensive and difficult to obtain. In this thesis, we\r\ninvestigate two approaches that alleviate the need for labelled data by leveraging feedback to model outputs instead. Both scenarios are applied to two sequence-to-sequence\r\ntasks for Natural Language Processing (NLP): machine translation and semantic parsing\r\nfor question-answering. Additionally, we define a new question-answering task based on\r\nthe geographical database OpenStreetMap (OSM) and collect a corpus, NLmaps v2, with\r\n28,609 question-parse pairs. With the corpus, we build semantic parsers for subsequent experiments. Furthermore, we are the first to design a natural language interface to OSM, for\r\nwhich we specifically tailor a parser.\r\nThe first approach to learn from feedback given to model outputs, considers a scenario\r\nwhere weak supervision is available by grounding the model in a downstream task for\r\nwhich labelled data has been collected. Feedback obtained from the downstream task is\r\nused to improve the model in a response-based on-policy learning setup. We apply this\r\napproach to improve a machine translation system, which is grounded in a multilingual\r\nsemantic parsing task, by employing ramp loss objectives. Next, we improve a neural semantic parser where only gold answers, but not gold parses, are available, by lifting ramp\r\nloss objectives to non-linear neural networks. In the second approach to learn from feedback, instead of collecting expensive labelled data, a model is deployed and user-model\r\ninteractions are recorded in a log. This log is used to improve a model in a counterfactual\r\noff-policy learning setup. We first exemplify this approach on a domain adaptation task for\r\nmachine translation. Here, we show that counterfactual learning can be applied to tasks\r\nwith large output spaces and, in contrast to prevalent theory, deterministic logs can successfully be used on sequence-to-sequence tasks for NLP. Next, we demonstrate on a semantic parsing task that counterfactual learning can also be applied when the underlying\r\nmodel is a neural network and feedback is collected from human users. Applying both approaches to the same semantic parsing task, allows us to draw a direct comparison between\r\nthem. Response-based on-policy learning outperforms counterfactual off-policy learning,\r\nbut requires expensive labelled data for the downstream task, whereas interaction logs for\r\ncounterfactual learning can be easier to obtain in various scenarios."^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/date> "2019" .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Document> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://www.w3.org/2002/07/owl#sameAs> <https://doi.org/10.11588/heidok.00026477> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/creator> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/person/ext-44cf50aaf6bab3d8abdcd39729b63656> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/ontology/bibo/authorList> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477#authors> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477#authors> <http://www.w3.org/1999/02/22-rdf-syntax-ns#_1> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/person/ext-44cf50aaf6bab3d8abdcd39729b63656> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/person/ext-44cf50aaf6bab3d8abdcd39729b63656> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/person/ext-44cf50aaf6bab3d8abdcd39729b63656> <http://xmlns.com/foaf/0.1/givenName> "Carolin"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/person/ext-44cf50aaf6bab3d8abdcd39729b63656> <http://xmlns.com/foaf/0.1/familyName> "Lawrence"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/person/ext-44cf50aaf6bab3d8abdcd39729b63656> <http://xmlns.com/foaf/0.1/name> "Carolin Lawrence"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/EPrint> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/DoctoralThesisEPrint> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/isPartOf> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/repository> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://eprints.org/ontology/hasDocument> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> <http://www.w3.org/2000/01/rdf-schema#label> "Response-Based and Counterfactual Learning for Sequence-to-Sequence Tasks in NLP (PDF)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> <http://eprints.org/ontology/hasFile> <https://archiv.ub.uni-heidelberg.de/volltextserver/26477/1/20190510_Thesis_Carolin.pdf> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> <http://purl.org/dc/terms/hasPart> <https://archiv.ub.uni-heidelberg.de/volltextserver/26477/1/20190510_Thesis_Carolin.pdf> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/26477/1/20190510_Thesis_Carolin.pdf> <http://www.w3.org/2000/01/rdf-schema#label> "20190510_Thesis_Carolin.pdf"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://eprints.org/ontology/hasDocument> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> <http://www.w3.org/2000/01/rdf-schema#label> "Response-Based and Counterfactual Learning for Sequence-to-Sequence Tasks in NLP (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> <http://eprints.org/relation/isVersionOf> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> <http://eprints.org/relation/isVolatileVersionOf> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> <http://eprints.org/relation/isIndexCodesVersionOf> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216349> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> <http://eprints.org/ontology/hasFile> <https://archiv.ub.uni-heidelberg.de/volltextserver/26477/2/indexcodes.txt> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/document/216350> <http://purl.org/dc/terms/hasPart> <https://archiv.ub.uni-heidelberg.de/volltextserver/26477/2/indexcodes.txt> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/26477/2/indexcodes.txt> <http://www.w3.org/2000/01/rdf-schema#label> "indexcodes.txt"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <https://archiv.ub.uni-heidelberg.de/volltextserver/26477/> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/26477/> <http://purl.org/dc/elements/1.1/title> "HTML Summary of #26477 \n\nResponse-Based and Counterfactual Learning for Sequence-to-Sequence Tasks in NLP\n\n" .
<https://archiv.ub.uni-heidelberg.de/volltextserver/26477/> <http://purl.org/dc/elements/1.1/format> "text/html" .
<https://archiv.ub.uni-heidelberg.de/volltextserver/26477/> <http://xmlns.com/foaf/0.1/primaryTopic> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-000> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2004/02/skos/core#Concept> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-000> <http://www.w3.org/2004/02/skos/core#prefLabel> "000 Allgemeines, Wissenschaft, Informatik"@de .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-000> <http://www.w3.org/2004/02/skos/core#prefLabel> "000 Generalities, Science"@en .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/subject> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-000> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-004> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2004/02/skos/core#Concept> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-004> <http://www.w3.org/2004/02/skos/core#prefLabel> "004 Informatik"@de .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-004> <http://www.w3.org/2004/02/skos/core#prefLabel> "004 Data processing Computer science"@en .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/subject> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-004> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-310> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2004/02/skos/core#Concept> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-310> <http://www.w3.org/2004/02/skos/core#prefLabel> "310 Statistik"@de .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-310> <http://www.w3.org/2004/02/skos/core#prefLabel> "310 General statistics"@en .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/subject> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-310> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-400> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2004/02/skos/core#Concept> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-400> <http://www.w3.org/2004/02/skos/core#prefLabel> "400 Sprachwissenschaft"@de .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-400> <http://www.w3.org/2004/02/skos/core#prefLabel> "400 Linguistics"@en .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/subject> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-400> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-420> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2004/02/skos/core#Concept> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-420> <http://www.w3.org/2004/02/skos/core#prefLabel> "420 Englisch"@de .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-420> <http://www.w3.org/2004/02/skos/core#prefLabel> "420 English"@en .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/subject> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-420> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-490> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2004/02/skos/core#Concept> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-490> <http://www.w3.org/2004/02/skos/core#prefLabel> "490 Andere Sprachen"@de .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-490> <http://www.w3.org/2004/02/skos/core#prefLabel> "490 Other languages"@en .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/subject> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-490> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-500> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/2004/02/skos/core#Concept> .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-500> <http://www.w3.org/2004/02/skos/core#prefLabel> "500 Naturwissenschaften und Mathematik"@de .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-500> <http://www.w3.org/2004/02/skos/core#prefLabel> "500 Natural sciences and mathematics"@en .
<https://archiv.ub.uni-heidelberg.de/volltextserver/id/eprint/26477> <http://purl.org/dc/terms/subject> <https://archiv.ub.uni-heidelberg.de/volltextserver/id/subject/ddc-500> .