Directly to content
  1. Publishing |
  2. Search |
  3. Browse |
  4. Recent items rss |
  5. Open Access |
  6. Jur. Issues |
  7. DeutschClear Cookie - decide language by browser settings

Social Commonsense Reasoning with Structured Knowledge in Text

Paul, Debjit

[thumbnail of MY_PhD_THESIS.pdf]
Preview
PDF, English - main document
Download (5MB) | Terms of use

Citation of documents: Please do not cite the URL that is displayed in your browser location input, instead use the DOI, URN or the persistent URL below, as we can guarantee their long-time accessibility.

Abstract

Understanding a social situation requires the ability to reason about the underlying emotions and behaviour of others. For example, when we read a personal story, we use our prior commonsense knowledge and social intelligence to infer the emotions, motives and anticipate the actions of the characters in a story. For machines to understand text related to personal stories and social conversations, they must be able to make commonsense inferences. While most people can reason deeply about the social implications of the text, it is challenging for natural language processing systems as these implications are often subtle and implicit.

This dissertation argues that NLP systems must learn to reason more explicitly about the underlying social knowledge in text to perform social commonsense reasoning. We divide the above argument into two sub-problems: (i) understanding the underlying social knowledge and (ii) explicitly reasoning about such knowledge for social commonsense reasoning. To address these problems, we propose building NLP systems that integrate neural network-based learning with structured knowledge representations.

In the first part of this dissertation, we study the role of structured commonsense knowledge in understanding the social dynamics of characters and their actions in stories. Our motivation behind enriching the model with structured commonsense knowledge is to bridge the gap between the surface meanings of texts and the underlying social implications of each event in the stories. We develop a novel model that incorporates commonsense knowledge into neural models and showcases the importance of commonsense knowledge in understanding the social dynamics of story characters. Further, we investigate the role of temporal dynamics of story events in understanding social situations. We develop a model that can explicitly learn about what social event follows another event from personal narrative stories. We demonstrate that implicitly leveraging such temporal knowledge about story events can support social commonsense reasoning tasks.

In the second part of this dissertation, we investigate methods to explicitly reason about the knowledge related to social dynamics of characters (behaviour, mental states) and the cause/effect of social events. We propose a novel model named as multi-head knowledge attention that incorporates such social knowledge into state-of-the-art neural NLP models to address two complex commonsense inference tasks. We demonstrate that our method of incorporating knowledge can improve -- (i) the robustness and the interpretability of the model and (ii) the overall performance of the model compared to other knowledge integration methods. We also aim to investigate social commonsense reasoning as a natural language generation task. We design a story completion task that requires natural language generation models to perform both forward and backward reasoning. We study the role of contextualized commonsense knowledge in natural language generation tasks. We propose a model that jointly learns to generate contextualized inference rules as well as narrative stories. We demonstrate that our model can outperform state-of-the-art non-contextualized commonsense knowledge-based generation models.

We hope that the research presented in this dissertation will open up interesting scopes for future research involving social commonsense reasoning and other related topics.

Document type: Dissertation
Supervisor: Frank, Prof. Dr. Anette
Place of Publication: Heidelberg
Date of thesis defense: 20 April 2022
Date Deposited: 12 Feb 2024 06:58
Date: 2024
Faculties / Institutes: Neuphilologische Fakultät > Institut für Computerlinguistik
Controlled Keywords: Commonsense Reasoning, Natural Langauge Processing, Natural Language Inference, Interpretability
Additional Information: Commonsense Reasoning, Natural Language Reasoning, Social Commonsense Reasoning, Knowledge-based Reasoning
About | FAQ | Contact | Imprint |
OA-LogoDINI certificate 2013Logo der Open-Archives-Initiative