← Seminars

Facilitate Document Understanding by Leveraging Language Structures

Liyan Xu

Abstract

This work explores the limitations of pretrained language models (PLMs) in effectively processing multi-sentence or multi-paragraph inputs for document understanding tasks. To overcome this challenge, the dissertation investigates the utilization of different intrinsic language structures, including syntactic, discourse, and knowledge-specific structures, to enhance context understanding. The dissertation presents four distinct works that demonstrate the effectiveness of incorporating these structures for machine reading comprehension, coreference resolution, and information extraction tasks. The empirical results of each experiment suggest that modeling these structures can complement the sequence modeling of PLMs and significantly improve performance on document-oriented tasks. Ultimately, this dissertation contributes to the research community's understanding of the potential benefits of leveraging language structures to advance natural language understanding.

Term
Spring 2023
Date
February 24, 2023
Time
4:00 - 5:00 PM
Location
MSC W301