DiVAS: Digital-video-audio-sketch capture, retrieval, and understanding of unstructured multimodal design knowledge

Posted on:2007-02-22

Degree:Ph.D

Type:Dissertation

University:Stanford University

Candidate:Yin, Zhen

Full Text:PDF

GTID:1458390005483622

Subject:Engineering

Abstract/Summary:

Managing and reusing knowledge can lead to greater competitive advantages and improved designs. However, design knowledge reuse often fails in the AEC (Architect, Engineering, and Construction) industry. I performed an ethnographic study focusing on one phase of the building life cycle, i.e. the shop drawing production and review process. Observations show that most of the design and detailing knowledge among the project stakeholders is typically communicated through discourse, sketch, and gesture. The knowledge created during such informal multimodal communicative events is typically lost. Even if multimedia technology is used to capture such events, the archived digital content is unstructured. Consequently, users are facing a formidable challenge to retrieve relevant content from the digital archive. This research addresses the need to capture, retrieve, and understand such digital design knowledge in the domain of shop drawing production and review.; DiVAS(TM) (Digital, Video, Audio, and Sketch), a multimedia information capture and retrieval system was developed and implemented to support these needs. A cross-media relevance model was formalized to integrate gesture-discourse-sketch digital data. This provides a macro (gesture and keywords in discourse transcript) and micro (sketch) index to a multimodal digital knowledge archive. I-Dialogue(TM) prototype was developed and implemented as a module of DiVAS(TM) system. It supports effective information retrieval over imperfect discourse transcripts. I-Dialogue(TM) provides two information retrieval algorithms that: (1) identify significant notions from imperfect speech transcripts clustered through vector analysis; and (2) disambiguate notions in the specific professional domain. I-Dialogue(TM) can be used as part of DiVAS(TM) to process speech transcripts, or as an independent system. DiVAS(TM) leverages the cross-media information retrieval process by using the macro-micro-index, which is constructed based on the formalized cross-media relevance model.; An evaluation methodology was developed to assess the recall and precision performance of DiVAS(TM) and I-Dialogue(TM). An academic test bed and a benchmark document collection from Linguistic Data Consortium were used. The evaluation results demonstrate the performance improvements achieved by DiVAS(TM) and I-Dialogue(TM) in support of effective capture, retrieval, and understanding of digital content.

Keywords/Search Tags:

Divas, Digital, Retrieval, Capture, I-dialogue, Multimodal, Sketch

Related items

1	Research And Implementation Of Multimodal Teaching Dialogue Systems
2	Research And Implementation Of Multimodal Dialogue Management System
3	Free-hand Sketch Based Visual Retrieval Study
4	Research On Retrieval-based Dialogue System Based On Knowledge Fusion Matching Network
5	Research On Retrieval-based Multi-turn Dialogue With Deep Context Modeling
6	Research And Implementation Of Sketch-based Image Retrieval
7	3D Model Retrieval System Based On Freehand Sketch
8	Human Motion Retrieval Based On Hand-drawn Sketch
9	Research On Key Technologies And System Implementation Of Multimodal Human-computer Interaction And Dialogue
10	The Recognition And Retrieval Of Hand-drawn Sketches