| Big data technology and computer science and technology in recent years,rapid development in the geology,biology,medicine and industry in many fields such as to big data technologies are widely used,this technique has become the most familiar people technical vocabulary.The emergence of big data has led to a new understanding of the theory of scientific research methods,which has resulted in a new thinking model.In the face of huge amounts of data,people just need to get valuable information from these data and convert it into knowledge.Large geological data mainly includes public geological data and geological data,core public geological data is mainly used for science popularization,etc.,the core geological data contains a lot of confidential data,cannot be made public,mainly stored in a local area network(LAN),combined with big data technology method,fully tap the potential value of public data,is an important aspect of geology big data applications.The study of geological big data has become an important part of China’s national big data strategy.In the emerging era of big data,it is of great significance to effectively excavate high value data and information,reasonably use geological big data,and scientifically analyze relevant data and information.Hidden in a large amount of data in the field of geology science this very important information,by using big data technology to these important information hidden in the data,thus can promote the continuous development of the discipline and in-depth study.Big data technology in the development of recent weeks have been in many disciplines and has been widely used in the field of using big data to data analysis and mining is of great significance,especially in the field of health care and education,adopts the technology of the large data analysis can bring more convenience to people’s life.This paper mainly studies the related demand of the foreign language text big data applications,sum up big data related basic theoretical knowledge,from large data found theoretical model combined with the large data is the key technology and method,this paper puts forward text data discovery theory model,parts of reasonable solution measures are put forward in the system,technology and application.Found in the data structure tree module based on demand,from two aspects of keywords and urls for thematic information extraction,to ensure the comprehensive and systemic of data,using online tools,set in line with the requirements of rules of web crawler,access to huge amounts of data,and to clean rough for effective project information data;In the data analysis module,the python language is used to realize the translation function of multiple text documents in multiple languages,so as to reduce the human workload and accelerate the translation speed.Finally,discusses the application of geological information service as an example,the application of this technology,combined with literature study,analysis of the traditional way to solve geological information service products abroad related data acquisition and analysis,and integration analysis results. |