Semantic Representation and Inference for NLP
Dongsheng Wang

TL;DR
This paper advances NLP semantic inference by developing new datasets, models capturing non-compositional semantics, and innovative deep learning architectures for improved inference accuracy.
Contribution
It introduces a large factual claim dataset, a novel non-compositional semantic model, and multiple deep learning architectures enhancing inference capabilities.
Findings
Created the MultiFC dataset for claim verification
Developed a multi-scale CNN inference model
Proposed models capturing non-compositional semantics
Abstract
Semantic representation and inference is essential for Natural Language Processing (NLP). The state of the art for semantic representation and inference is deep learning, and particularly Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), and transformer Self-Attention models. This thesis investigates the use of deep learning for novel semantic representation and inference, and makes contributions in the following three areas: creating training data, improving semantic representations and extending inference learning. In terms of creating training data, we contribute the largest publicly available dataset of real-life factual claims for the purpose of automatic claim verification (MultiFC), and we present a novel inference model composed of multi-scale CNNs with different kernel sizes that learn from external sources to infer fact checking labels. In terms of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Semantic Web and Ontologies
