| With the vigorous development of the digital information age,information sharing is becoming more and more common.In order to ensure the integrity of the information,the document is usually converted into a portable document format for preservation.It is very common and strong.At this stage,most of the presentation documents are restored manually,which is inefficient and the file size after restoration is huge.Therefore,a technical means that can reconstruct and optimize the presentation documents is urgently needed.Based on this research background,this paper analyzes the portable document through finegrained structured analysis,reconstructs the structure of the presentation document element,and proposes an adaptive JPEG quantization table optimization strategy,and uses this strategy to compress the presentation document image to achieve the compression and optimization of the document.Finally,a presentation document reconstruction system based on image understanding is designed and implemented.The research content of this paper mainly includes the following four parts:(1)Demonstrate document element extraction and structured parsing.Through the study of reconstructing the data organization of the portable document and the presentation document,the elements of the portable document and the presentation document are parsed from both physical and logical perspectives,and the structure is defined,so that the two types of documents can be transformed interactively.(2)Compression optimization technology of PPT document image.We first demonstrate that compressing document images can reduce document storage space.Then we find that JPEG compression strategy is correlated with brightness,color richness,image edge and image texture.Then we propose an adaptive compression strategy for document-image based on JPEG quantization table.Experiments show that the optimization strategy proposed in this paper can effectively optimize the document without affecting its uses.(3)Demonstration document evaluation.Combined with the intuitive representation of presentation document elements and the underlying data organization,a presentation document evaluation algorithm based on image semantics and XML similarity is proposed.Evaluation experiments show that the image understanding-based demo document reconstruction scheme proposed in this paper is effective and feasible as a whole,and can completely restore the demo document and compress and optimize the document without affecting the look and feel and usage.(4)Research and implementation of a Powerpoint document reconstruction system based on image understanding.Combined with the presentation document structure analysis and optimization reconstruction algorithm proposed in this paper,the presentation document reconstruction system running on the WEB side is researched and implemented,and the system is thoroughly tested.The reconstruction system has high feasibility and effectiveness... |