With the rapid development of digital agriculture,a great deal of agricultural germplasm resource data is acquired.There is abundance of hereditary regularity in these huge data,but little technique can find out the knowledge in these data so far.That means "too much data,too little knowledge".At present time,breeding experts are interested in how to extract useful information and knowledge from the data,which can instruct agricultural management and plant breeding and help us to know genetic character of inbred line and parent resource,to analyze relative,to predict trend and successfully choose inbreeds and parents.Data mining provides an effective method for extracting interesting knowledge in huge data.The research is in the support of natural science foundation of Anhui province Research on germplasm resources characteristics hereditary regularity and dominative combination for Red-using Watermelon.In this paper,the red seed-using watermelon inbred line dataset of 9 continual generations was acquired by abundant experiments and analysis in the field and lab.SPSS software and decision tree,cluster,rough set, canocorrelation etc technologies in Clementine are applied in quantitative inheritance breeding of red seed-using watermelon.The research indicated genetic law of inbred line and the relationships among economic character,which provided theory basis for the inbred line purge,parent choosing and predominance hybrid combination cultivating in process of breed cultivation.The main content and production on our research as followings:The paper introduced the conception,method,status and development of data mining dissertated,the characters and functions of Clementine and SPSS.It also analyzed complexity and diversity of agricultural germplasm resources data.The feasibility and importance of data mining in crop genetic analysis has been researched in this paper.According to the importance of economic characters in breeding cultivation,the relationships between main economic traits for red seed-using watermelon were mined. Path analysis technique was used to interpret the direct and indirect relationship between single-fruit seed-weight and kilo-seed weight,seed weight,seed volume and other traits.According to the classification and relationship between factors of economic characters for red seed-using watermelon,principal component and canocorrelation analysis technique were used to divide economic characters of red seed-using watermelon into product factor,grain weight factor,growth factor,produce seed factor and quality factor.Researches on the relationship between characters within factor indicated significant positive correlation between characters.Decision tree algorithm was used to discovery the knowledge and relationship between main economic characters in dataset of red seed-using watermelon.And the model of decision tree for single-fruit seed-weight was constructed,which can help to analyze economic character of red seed-using watermelon inbred line and provide decision-making for the breed worker when choosing inbred line and parents.Genetic analyzing for red seed-using watermelon inbred line with UPGMA method, this paper find out average genetic distance of red seed-using watermelon inbred line has a down trend and cophenetic correlation coefficient are importance significant, that means the result is reliable. |