Skip to content

Commit

Permalink
n
Browse files Browse the repository at this point in the history
  • Loading branch information
Julia-lirong committed Oct 6, 2023
1 parent 1b031c8 commit bcf74fa
Show file tree
Hide file tree
Showing 4 changed files with 544 additions and 10 deletions.
Binary file added Figure/.DS_Store
Binary file not shown.
Binary file added Figure/figure-github.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
522 changes: 522 additions & 0 deletions README.html

Large diffs are not rendered by default.

32 changes: 22 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,10 @@
# Genotype and Phenotype Network (GPN)

A novel method for multiple phenotype association studies by constructing a bipartite signed network, linking phenotypes and genotypes into a Genotype and Phenotype Network (GPN), which is a new insight to investigate correlation among phenotypes. The GPN can be constructed by both quantitative and qualitative traits, especially phenotypes have extremely unbalanced case-control ratios.
A novel method for multiple phenotype association studies by constructing a bipartite signed network, linking phenotypes and genotypes into a Genotype and Phenotype Network (GPN), which is a new insight to investigate correlation among phenotypes. The GPN can be constructed by both quantitative and qualitative traits, especially phenotypes have extremely unbalanced case-control ratios. The bipartite network shelds the light to Multiple Phenotype Assotiation Test, Colocalization analysis, Heritability Enrichment analysis, and Phenome-Wide Association studies.

This package "GPN" includes the following parts:

- **GPN construction**: Construction of the Genotype and Phenotype Network with the individual-level phenotype and genotype data;
- **Construction of bipartite GPN**: Construction of the Genotype and Phenotype Network with the individual-level phenotype and genotype data or GWAS Summary statistics; Then detect the well-defined representations of GPN by comparing the network properties with the random networks, including connectivity, centrality, and system entropy;
- **Community detection method**: Novel community detection method to partition K phenotypes into disjoint network modules based on the similarity matrix from GPN;
- **Multiple Phenotype Association Tests with and without considering the network modules**: test the association between K phenotypes with a SNP. The phenotypes can be either qualitative or binary, espectially the binary phenotypes with the extremely case-control ratio (the test statistics has been adjusted by the saddlepoint approximation)
- ceCLC: Modified computational efficient clustering linear combination method
Expand All @@ -14,6 +14,8 @@ This package "GPN" includes the following parts:
- O'Brien: Modified O'Brien (SSU) method
- Omnibus: Modified omnibus (Chi-square) method
- TATES: Modified TATES method
- ACAT: Cauchy combination method
- **Heritability Enrichment Analysis**: Construct network topology annotations of genetic variants that quantify the possibility of pleiotropy, and apply stratified linkage disequilibrium (LD) score regression to highly correlated phenotypes to identify enriched annotations.


## Installation
Expand All @@ -28,22 +30,32 @@ library(GPN)
## Reference
Xuewei Cao, Shuanglin Zhang, Qiuying Sha*. A novel method for multiple phenotype association studies based on genotype and phenotype network.

Xuewei Cao #, Lirong Zhu #, Xiaoyu Liang, Shuanglin Zhang, Qiuying Sha*. Constructing genotype and phenotype network helps reveal disease heritability and phenome-wide association studies


**Any questions**? xueweic_AT_mtu_DOT_edu


## Graphical Overview

<p align="center">
<img src="Figure/Figure1 copy.png" width="1000">
<img src="Figure/figure-github.png" width="1000">
</p>

- **a.** Construction of a signed bipartite Genotype and Phenotype Network (GPN). Each phenotype (yellow square) and each SNP form a directed edge which represents the strength of the association, where the red dashed line indicates that the minor allele of the SNP is a protective allele to the phenotype, and the blue dashed line indicates that the minor allele of the SNP is a risk allele to the phenotype.
- **b.** Construction of a signed network, named the Phenotype and Phenotype Network (PPN), which is the one-mode projection of GPN on phenotypes.
- **c.** The powerful community detection method is used to partition phenotypes into disjoint network modules with different colors.
- **d.** Multiple phenotype association tests are applied to test the association between phenotypes in each of the network modules and a SNP, then the Bonferroni correction is used to obtain the overall p-value.
- **e.** GWAS signals are identified by a multiple phenotype association test with or without considering network modules.
- **f.** Functional enrichment analysis based on the detected GWAS signals and the publicly available functional database.
- **g.** Colocalization of GWAS signals and eQTL analysis.
##### Construction of bipartite GPN
- **a – c.** Construction of the denser and well-defined representations of GPN by comparing the network properties with the random networks, including connectivity, centrality, and system entropy. Specifically, each phenotype and each SNP form a directed edge which represents the strength of the association, where the red color indicates that the minor allele of the SNP is a protective allele to the phenotype, and the blue color indicates that the minor allele of the SNP is a risk allele to the phenotype.
- **d.** The weighted degree distributions with different thresholds and the examples of two network topology annotations, approximate betweenness centrality and degree centrality, used in the heritability enrichment analysis;
- **e.** The one-mode projection of GPN onto phenotypes that are linked through shared genetic architecture. Heritability enrichment analysis and phenome-wide association studies are introduced as two important applications of the constructed GPN.

##### Multiple Phenotype Assocaition Tests
- **Construction of PPN.** Construction of a signed network, named the Phenotype and Phenotype Network (PPN), which is the one-mode projection of GPN on phenotypes.
- **Community Detection.** The powerful community detection method is used to partition phenotypes into disjoint network modules with different colors.
- **Multiple Phenotype Assocaition Tests.** Multiple phenotype association tests are applied to test the association between phenotypes in each of the network modules and a SNP, then the Bonferroni correction is used to obtain the overall p-value.
- **Colocalization analysis.** Colocalization of GWAS signals and eQTL analysis.

##### Heritability Enrichment Analysis
- **Enrichment Analysis** Construct network topology annotations of genetic variants that quantify the possibility of pleiotropy, and apply stratified linkage disequilibrium (LD) score regression to highly correlated phenotypes to identify enriched annotations. The constructed network topology annotations are informative for disease heritability after conditioning on a broad set of functional annotations from the baseline-LD model.




Expand Down

0 comments on commit bcf74fa

Please sign in to comment.