Font Size: a A A

Dialogue Analysis And Automatic Summarization Of Business Dialogues

Posted on:2006-10-17Degree:MasterType:Thesis
Country:ChinaCandidate:X LuFull Text:PDF
GTID:2155360152975919Subject:Foreign Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
The purpose of this thesis is to explore the relevant issues in automatic dialogue summarization (ADS) within the context of the speech summarization project While the majority of techniques in computational linguistics for summarization are "computationally" oriented, our approach to the task is largely "linguistically" oriented.To start with, we build a corpus of dialogues from the business domain and classify them into 16 sub-domains, each of which is concerned with a specific activity in business transaction, such as inquiring, offering, price negotiation, etc. The dialogues are then segmented and tagged with part-of-speech labels.Taking summarization as sentence extraction we try to identify the distinguished linguistic features of dialogues. Through the linguistic analysis of dialogues we define two information structures: the information inquiry dialogue with the parallel information allocation and the negotiation dialogue with the sequential information allocation. Based on this observation we developed different summarization methods for the two types of dialogues in our corpus. We use the Edmundsonian paradigm to assign a score for each utterance in the dialogue, rank the utterances and select the top n (adjustable) utterances as the summary. We also propose a new evaluation method which evaluates both the informativeness and redundancy of the summary. The evaluation results indicate the better performance of our methods compared with the MMR baseline.Taking summarization as language understanding, we adopt the concept of speech act to determine the function of an utterance. Summarization using this method will take three stages: 1) tagging each utterance with proper dialogue act; 2) dialogue act recognition (DAR); 3) information identification using natural language processing techniques. Our work in this thesis is in the first stage, the design of a dialogue act tag-set. We define the tags on the three dimensions: dialogue, task and emotion. We then annotate some price negotiation dialogues. The percentage distribution of tags is given and the related tagging problems are discussed.
Keywords/Search Tags:summarization, automatic dialogue summarization, dialogue analysis, information structure, dialogue act, sentence extraction
PDF Full Text Request
Related items