The Strong And Weak Identifiability In Causal Discovery

Posted on:2024-06-26

Degree:Master

Type:Thesis

Country:China

Candidate:M J Li

Full Text:PDF

GTID:2530307052495644

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

Exploring and discovering the causal relations among things are core issues in scientific research.The standard gold of such exploration and discovery is to conduct randomized controlled trials(RCTs).However,RCTs often have limitations in practical including expensive cost,limited participation group,and ethical issues.Therefore,discovering causal relations from passively observed data is an essential problem.Indeed,causal discovery methods based on passive observed data have made great progress in the past decades and have been widely used in various fields and such methods have become an unignorable way to discover causal relations.In this kind of method,a core concept is identifiability i.e.,whether the discovered causal model is unique.If so,this causal model is called identifiable,otherwise it’s non-identifiable.A natural and essential question is: For all identifiable causal models,whether they are of the same difficulty to identify? This paper focuses on this question and the main contents and contributions of this paper include:1.For non-linear additive noise model which is identifiable,we point out that the difficulty of identification is different,we then propose to further divide the identifiable causal models into strongly identifiable and weakly identifiable ones according to the difficulty of identification,we also give theoretical definition and criteria with proof.Besides,this paper reveals the relationship between the strength of identifiability and two metrics commonly used in causal discovery: the goodness of fit and the independence of residual.We further points out that due to the ignorance of this point,existing causal discovery methods can only deal with one of the two types(strongly or weakly identifiable)of the problem.2.To cope with the issue that existing causal discovery methods can only handle one of the two types(strongly or weakly identifiable)of the problem,this paper propose a unified framework named GENIUS(A Generic Framework for Non-linear Causal Discovery)for both strongly and weakly identifiable problems.Experiments on synthetic data demonstrate the effectiveness of GENIUS far beyond existing methods.3.In this paper,GENIUS is applied to two real-world problem scenarios,namely,the discovery of the causal relations between the expression levels of various sub-stances in the protein signaling network in bioinformatics which is a traditional causal discovery problem and the discovery of the causal relations between the knowledge concepts mastered by students in the field of intelligent education which aims to help students find the weakness in their knowledge system.Experiments on these real-world problems demonstrate that GENIUS is far more effective on causal discovery than other methods.

Keywords/Search Tags:

Causality, Causal discovery, Identifiability, Additive noise model

PDF Full Text Request

Related items

1	The Discovery Of Disease Causality Knowledge Based On Semantic Predicate Optimization
2	A Study Of High-dimensional Causal Discovery Methods
3	Research Of Causality In Cellular Molecules Interaction Network Based On Functional Causal Models
4	Research On Causality Discovery Algorithm Based On Bayesian Network
5	Research On Discovery Of Causalities Among The Gene Mutations
6	Causal Inference On Additive Model With Confounders
7	An Enhanced Causal Discovery Algorithm Based On Sparse Causal Graph
8	Applying Causal Discovery And Causal Hypothesis Testing To Single-Cell Transcriptomic Data
9	Linear Structure Causal Model Estimation Based On Observed Data
10	Across Boundary Causality And Modeling Study