| Compositional data is exists in many practices problems,such as the chemical composition of rocks and the consumption structure of residents.One of the concerned issues in the research of compositional data is the interaction relationship between the component variables.However,due to the unit sum—special numerical characteristics of compositional data,this problem becomes more challenging.This paper uses a graphical models to describe the relationships between the variables of the compositional data.This paper is divided into two parts.The first part is the inference of the graphical model structure based on the stability selection.On the basis of the gCoda method,we refer to our method of using stability selection to select the parameters in the penalty likelihood function of the compositional data as ss.gCoda.This method utilizes the information of multiple λ s and improves the accuracy of the estimation.We First write the penalty likelihood function of the compositional data-combining the negative log likelihood with l1 penalty,and then use the MM algorithm to solve it by converting the objective function to a standard glasso problem,and then use stability selection to select the best penalty parameter for this graphical model.Through numerical simulations,we find that the values of tpr and mcc of the ss.gCoda method are higher than those of the gCoda method.By ss.gCoda,more true edges can be found,which leads to a better recovery performance,and also the occurrence of empty model and tpr being zero can be avoided.The second part is the structure learning of the Bayesian network for compositional data.We write the Bayesian network in the form of a matrix according to the joint probability.A DAG can be obtained by the search-based method,as well as its corresponding completed partially directed acyclic graph(CPDAG)and skeleton graph.Through numerical simulations.we find that for a single model,the shd(structural Hamming distance)between the estimated DAG and the real DAG is relatively stable,as well as that between the estimated CPDAG and the real one.This method performs good in recovering true edges.Finally,we apply the gCoda,ss.gCoda and search-based method to the ash data and olive data respectively to explore the relationships between the components in the dust and the compositions in the olive oil. |