eprintid: 17050 rev_number: 18 eprint_status: archive userid: 1213 dir: disk0/00/01/70/50 datestamp: 2014-06-25 10:46:49 lastmod: 2014-07-24 08:03:36 status_changed: 2014-06-25 10:46:49 type: doctoralThesis metadata_visibility: show creators_name: Roth, Michael title: Inducing Implicit Arguments via Cross-document Alignment: A Framework and its Applications subjects: 400 divisions: 90500 adv_faculty: af-09 cterms_swd: Computerlinguistik cterms_swd: Frame-Semantik abstract: Natural language texts frequently contain related information in different positions in discourse. As human readers, we can recognize such information across sentence boundaries and correctly infer relations between them. Given this inference capability, we understand texts that describe complex dependencies even if central aspects are not repeated in every sentence. In linguistics, certain omissions of redundant information are known under the term ellipsis and have been studied as cohesive devices in discourse (Halliday and Hasan, 1976). For computational approaches to semantic processing, such cohesive devices are problematic because methods are traditionally applied on the sentence level and barely take surrounding context into account. In this dissertation, we investigate omission phenomena on the level of predicate-argument structures. In particular, we examine instances of structures involving arguments that are not locally realized but inferable from context. The goal of this work is to automatically acquire and process such instances, which we also refer to as implicit arguments, to improve natural language processing applications. Our main contribution is a framework that identifies implicit arguments by aligning and comparing predicate-argument structures across pairs of comparable texts. As part of this framework, we develop a novel graph-based clustering approach, which detects corresponding predicate-argument structures using pairwise similarity metrics. To find discourse antecedents of implicit arguments, we further design a heuristic method that utilizes automatic annotations from various linguistic pre-processing tools. We empirically validate the utility of automatically induced instances of implicit arguments and discourse antecedents in three extrinsic evaluation scenarios. In the first scenario, we show that our induced pairs of arguments and antecedents can successfully be applied to improve a pre-existing model for linking implicit arguments in discourse. In two further evaluation settings, we show that induced instances of implicit arguments, together with their aligned explicit counterparts, can be used as training material for a novel model of local coherence. Given discourse-level and semantic features, this model can predict whether a specific argument should be explicitly realized to establish local coherence or whether it is inferable and hence redundant in context. date: 2014 id_scheme: DOI id_number: 10.11588/heidok.00017050 ppn_swb: 1658708717 own_urn: urn:nbn:de:bsz:16-heidok-170501 date_accepted: 2013-12-03 advisor: HASH(0x556120873db0) language: eng bibsort: ROTHMICHAEINDUCINGIM2014 full_text_status: public citation: Roth, Michael (2014) Inducing Implicit Arguments via Cross-document Alignment: A Framework and its Applications. [Dissertation] document_url: https://archiv.ub.uni-heidelberg.de/volltextserver/17050/1/thesis.pdf