| Recently, owing to the characteristic of highly readable, self-description, theportability and ease of scalability, XML becomes a common form of data representationand exchange, which is widely used in many fields, such as Web server, distributecomputing systems, electronic business. However, the structures of XML files differ fromone to another, and the processing methods are not uniform, which exert a big burden tothe process work of XML files. Actually, there are some tools could handle theseprocesses now, and can deal with the content of XML files as well. The problem is thesetools are usually too complex to use which extremely lower the processing efficiency.Moreover, each enterprise or team has their own way to process the XML files, so that isnot benefit the communication between them. According to what I have mentioned above,the W3C launched a standard to normalization the operation of XML document processingin May2010, which is called Xproc. With Xproc to deal with the operation of XML files,programmers only need to know the basic operation processes of XML files, and need notpay too much concern about the processing details.Based on the deep understand of the basic concept and syntax of Xproc, research themajor operating Xproc offered. Meanwhile, analysis one of the Xproc processor—Calumet,especially concentrate on how to realize the functions its support, the interface its offered andits architecture. In addition, design the XML document processing system based on theunderstanding of Xproc and Calumet, and according to the structured analysis methodanalyzed the major operating of XML document processing, workflow, and processing details,furthermore, follow the principle of software design that high cohesion and low coupling.Used the structure of three-tier model to design the architecture of system, and implement itwith Java. The main manipulation of XML document processing is descript by Xproc, andthen processed by Calumet which is one of the mature Xproc engine in nowadays. TheApache FOP processor is a plug-in of Calumet to accomplish the output format as PDF.Moreover, some of the auxiliary detect function of XML related syntax is completed by athird-part open source project which is called Saxon. In order to simplify the operation ofusers, the easy-used user interface was designed. The test result reveals that this system cancomplete the major XML file operation. |