Font Size: a A A

System Design Of Data Collection System For College Entrance Examination Aspiration Plan

Posted on:2022-10-29Degree:MasterType:Thesis
Country:ChinaCandidate:R WangFull Text:PDF
GTID:2517306575481334Subject:Project management
Abstract/Summary:PDF Full Text Request
Therecommended industry for college entrance examination volunteers is an emerging industry that has developed in recent years relying on information and big data technology.The entrance examination voluntary application recommendation is inseparable from the college entrance examination plan information of the year.However,the examination enrollment plans are almost published by the provincial examination institutes in the form of paper books,few provinces will publish the PDF version on the official website.At present,no province has released structured data that can be directly used by the system.In view of the short period from the publication of the college entrance examination enrollment plan book to the early approval of the voluntary filing,an urgent need for a method that can quickly convert the paper or PDF version of the enrollment plan book into structured relational data.Although the existing OCR technology can accurately complete the recognition of image text,unable to reconstruct the text of the recognition result based on the context business logic,also unable to directly obtain structured relational data that can be used by the voluntary reporting and recommendation system.The logical reorganization of text recognition results can be realized by using finite state machine,get structured relational data.This method first analyzes the image format information,obtains the dominant line status of the text line,reuse business logic to identify all hidden row states,build a finite state machine based on row state.By combining dominant line status and line text characteristics,can realize the logical reorganization of the text recognition result,get structured relational data with contextual relations finally.Practice results show,this method can be successfully applied to the data collection work of college entrance examination enrollment plans in Shanxi,Hebei,Guangdong,Liaoning and other provinces,ables to perfectly identify the affiliation relationship,has the advantages of fast speed,high accuracy,automatic error correction,can greatly improve the efficiency and accuracy of data collection in the college entrance examination enrollment plan.Figure:41;Table:0;Reference:40...
Keywords/Search Tags:data collection, layout recognition, finite state machine, logical text reconstruction
PDF Full Text Request
Related items