Home

Awesome

DEAPLOG(Differentially Expression Analysis and Pseudotemporal Locating and Ordering of Gene by Single-cell RNA-seq data)

DEAPLOG is a tool to perform differentially expression analysis for cell clusters and other conditions, calculate the pseudotime of genes and profile genes coordinates accoding to the embedding coordinates of cells.

<p align="center"><img src="figures/Fig1_v1.jpg" alt="DEAPLOG" width="100%"></p>

DEAPLOG consists of three core functions:

Installation

PLOGS depends on numpy, scipy, pandas, scanpy,anndata. The package is available on pip and conda, and can be easily installed as follows:

pip install deaplog

Usage and Documentation

1. identifing marker genes for cell clusters: <br>

The inputs of DEAPLOG is the AnnData object of normlized counts of scRNA-seq data with pre-annotated cell clusters.

deaplog.get_DEG_uniq(rdata, adata,group_key='leiden',power=11,ratio=0.2,p_threshold=0.01,q_threshold=0.05) #find genes that are differentially expressed in only one cell type

or

deaplog.get_DEG_multi(rdata, adata,group_key='leiden',power=11,ratio=0.2,p_threshold=0.01,q_threshold=0.05) #find genes that are differentially expressed in one or more  cell types

the rdata: the Anndata of normlized counts of scRNA-seq data;<br> the adata: the Anndata of scRNA-seq data with pre-annotated cell clusters;<br> the group_key: the label for cell clusters;<br> the power: a parameter for nonlinear regression of gene expression pattern;<br> the ratio: the proportion of gene expression in cell cluter;<br> the ratio: the proportion of gene expression in cell cluter,the value is between 0 and 1;<br> the p_threshold : the threshold of p-value. the value is between 0 and 1;<br> the q_threshold : the threshold of q_value. the value is between 0 and 1;<br>

2. calculate the pseudotime of genes and profile genes map accoding to cell map:<br>

deaplog.get_genes_location_pseudotime(rdata, adata,group_key='leiden',power=11,gene_matrix= markers_s,obsm='X_umap',)

the rdata: the Anndata of normlized counts of scRNA-seq data;<br> the adata: the Anndata of scRNA-seq data with pre-annotated cell clusters;<br> the group_key: the label for cell clusters;<br> the power: a parameter for nonlinear regression of gene expression pattern;<br><br> the gene_matrix: a data.frame producted by get_DEG_uniq or get_DEG_multi. the obsm: the keys of adata.obsm.