clusterProfiler - A Universal Enrichment Tool for Interpreting Omics Data
A universal tool for interpreting functional characteristics of omics data. It supports Over-Representation Analysis (ORA) and Gene Set Enrichment Analysis (GSEA) for both coding and non-coding genomics data of thousands of species. It provides a unified and tidy interface to access, manipulate, and visualize enrichment results. A key capability is the simultaneous analysis and comparison of datasets from multiple treatments or time points. Furthermore, it integrates Large Language Model (LLM) capabilities to provide automated and insightful interpretation of enrichment results.
Last updated
annotationclusteringgenesetenrichmentgokeggmultiplecomparisonpathwaysreactomevisualizationenrichment-analysisgseaquarto
17.35 score 1.2k stars 47 dependents 16k scripts 42k downloadsggtree - an R package for visualization of tree and annotation data
'ggtree' extends the 'ggplot2' plotting system which implemented the grammar of graphics. 'ggtree' is designed for visualization and annotation of phylogenetic trees and other tree-like structures with their annotation data.
Last updated
alignmentannotationclusteringdataimportmultiplesequencealignmentphylogeneticsreproducibleresearchsoftwarevisualizationannotationsggplot2phylogenetic-trees
17.32 score 927 stars 110 dependents 7.4k scripts 46k downloadsGOSemSim - GO-terms Semantic Similarity Measures
The semantic comparisons of Gene Ontology (GO) annotations provide quantitative ways to compute similarities between genes and gene groups, and have became important basis for many bioinformatics analysis approaches. GOSemSim is an R package for semantic similarity computation among GO terms, sets of GO terms, gene products and gene clusters. GOSemSim implemented five methods proposed by Resnik, Schlicker, Jiang, Lin and Wang respectively.
Last updated
annotationgoclusteringpathwaysnetworksoftwarebioinformaticsgene-ontologysemantic-similaritycpp
14.47 score 72 stars 68 dependents 840 scripts 34k downloadsChIPseeker - ChIPseeker for ChIP peak Annotation, Comparison, and Visualization
This package implements functions to retrieve the nearest genes around the peak, annotate genomic region of the peak, statstical methods for estimate the significance of overlap among ChIP peak data sets, and incorporate GEO database for user to compare the own dataset with those deposited in database. The comparison can be used to infer cooperative regulation and thus can be used to generate hypotheses. Several visualization functions are implemented to summarize the coverage of the peak experiment, average profile and heatmap of peaks binding to TSS regions, genomic annotation, distance to TSS, and overlap of peaks or genes.
Last updated
annotationchipseqsoftwarevisualizationmultiplecomparisonatac-seqchip-seqcomparisonepigeneticsepigenomics
13.27 score 256 stars 4 dependents 2.7k scripts 5.1k downloadstidytree - A Tidy Tool for Phylogenetic Tree Data Manipulation
Phylogenetic tree generally contains multiple components including node, edge, branch and associated data. 'tidytree' provides an approach to convert tree object to tidy data frame as well as provides tidy interfaces to manipulate tree data.
Last updated
phylogenetic-treetidyversetree-data
12.91 score 60 stars 135 dependents 1.0k scripts 54k downloadsscatterpie - Scatter Pie Plot
Creates scatterpie plots, especially useful for plotting pies on a map.
Last updated
12.88 score 66 stars 69 dependents 1.3k scripts 42k downloadsaplot - Decorate a 'ggplot' with Associated Information
For many times, we are not just aligning plots as what 'cowplot' and 'patchwork' did. Users would like to align associated information that requires axes to be exactly matched in subplots, e.g. hierarchical clustering with a heatmap. Inspired by the 'Method 2' in 'ggtree' (G Yu (2018) <doi:10.1093/molbev/msy194>), 'aplot' provides utilities to aligns associated subplots to a main plot at different sides (left, right, top and bottom) with axes exactly matched.
Last updated
12.50 score 129 stars 121 dependents 988 scripts 55k downloadsggimage - Use Image in 'ggplot2'
Supports image files and graphic objects to be visualized in 'ggplot2' graphic system.
Last updated
ggplot2imagevisualization
12.22 score 178 stars 22 dependents 4.0k scripts 8.8k downloads
treeio - Base Classes and Functions for Phylogenetic Tree Input and Output
'treeio' is an R package to make it easier to import and store phylogenetic tree with associated data; and to link external data from different sources to phylogeny. It also supports exporting phylogenetic tree with heterogeneous associated data to a single tree file and can be served as a platform for merging tree with associated data and converting file formats.
Last updated
softwareannotationclusteringdataimportdatarepresentationalignmentmultiplesequencealignmentphylogeneticsexporterparserphylogenetic-trees
12.00 score 104 stars 126 dependents 1.9k scriptsggbreak - Set Axis Break for 'ggplot2'
An implementation of scale functions for setting axis breaks of a 'gg' plot (S Xu (2021) <doi:10.3389/fgene.2021.774846>).
Last updated
axis-breakcut-plotggbreakggplot2wrap-plot
11.06 score 146 stars 2 dependents 2.2k scripts 10k downloadsscholar - Analyse Citation Data from Google Scholar
Provides functions to extract citation data from Google Scholar. Convenience functions are also provided for comparing multiple scholars and predicting future h-index values.
Last updated
10.89 score 51 stars 2 dependents 539 scripts 5.2k downloadsggfun - Miscellaneous Functions for 'ggplot2'
Useful functions and utilities for 'ggplot' object (e.g., geometric layers, themes, and utilities to edit the object).
Last updated
10.54 score 18 stars 162 dependents 115 scripts 57k downloadsggtreeExtra - An R Package To Add Geometric Layers On Circular Or Other Layout Tree Of "ggtree"
'ggtreeExtra' extends the method for mapping and visualizing associated data on phylogenetic tree using 'ggtree'. These associated data can be presented on the external panels to circular layout, fan layout, or other rectangular layout tree built by 'ggtree' with the grammar of 'ggplot2'.
Last updated
softwarevisualizationphylogeneticsannotation
10.54 score 97 stars 2 dependents 684 scripts 2.2k downloadsyulab.utils - Supporting Functions for Packages Maintained by 'YuLab-SMU'
Miscellaneous functions commonly used by 'YuLab-SMU'.
Last updated
10.38 score 38 stars 256 dependents 42 scripts 81k downloads
ggmsa - Plot Multiple Sequence Alignment using 'ggplot2'
A visual exploration tool for multiple sequence alignment and associated data. Supports MSA of DNA, RNA, and protein sequences using 'ggplot2'. Multiple sequence alignment can easily be combined with other 'ggplot2' plots, such as phylogenetic tree Visualized by 'ggtree', boxplot, genome map and so on. More features: visualization of sequence logos, sequence bundles, RNA secondary structures and detection of sequence recombinations.
Last updated
softwarevisualizationalignmentannotationmultiplesequencealignment
9.94 score 217 stars 1 dependents 344 scripts 1.1k downloadstidydr - Unify Dimensionality Reduction Results
Dimensionality reduction (DR) is widely used in many domain for analyzing and visualizing high-dimensional data. 'tidydr' provides uniform output and is compatible with multiple methods, including 'prcomp', 'mds', 'Rtsne'. etc.
Last updated
9.77 score 15 stars 58 dependents 163 scripts 23k downloadsMicrobiotaProcess - A comprehensive R package for managing and analyzing microbiome and other ecological data within the tidy framework
MicrobiotaProcess is an R package for analysis, visualization and biomarker discovery of microbial datasets. It introduces MPSE class, this make it more interoperable with the existing computing ecosystem. Moreover, it introduces a tidy microbiome data structure paradigm and analysis grammar. It provides a wide variety of microbiome data analysis procedures under the unified and common framework (tidy-like framework).
Last updated
visualizationmicrobiomesoftwaremultiplecomparisonfeatureextractionmicrobiome-analysismicrobiome-data
9.50 score 192 stars 196 scripts 812 downloadsaisdk - Unified Interface for AI Model Providers
A production-grade AI toolkit for R featuring a layered architecture (Specification, Utilities, Providers, Core), request interception support, robust error handling with exponential retry delays, support for multiple AI model providers ('OpenAI', 'Anthropic', etc.), local small language model inference, distributed 'MCP' ecosystem, multi-agent orchestration, progressive knowledge loading through skills, and a global skill store for sharing AI capabilities.
Last updated
8.91 score 66 stars 48 dependents 62 scripts 14k downloadsggsc - Visualizing Single Cell and Spatial Transcriptomics
Useful functions to visualize single cell and spatial data. It supports visualizing 'Seurat', 'SingleCellExperiment' and 'SpatialExperiment' objects through grammar of graphics syntax implemented in 'ggplot2'.
Last updated
dimensionreductiongeneexpressionsinglecellsoftwarespatialtranscriptomicsvisualizationopenblascppopenmp
7.30 score 51 stars 37 scripts 308 downloadsMicrobiomeProfiler - An R/shiny package for microbiome functional enrichment analysis
This is an R/shiny package to perform functional enrichment analysis for microbiome data. This package was based on clusterProfiler. Moreover, MicrobiomeProfiler support KEGG enrichment analysis, COG enrichment analysis, Microbe-Disease association enrichment analysis, Metabo-Pathway analysis.
Last updated
microbiomesoftwarevisualizationkegg
7.21 score 42 stars 31 scripts 350 downloadsGeoTcgaData - Processing Various Types of Data on GEO and TCGA
Gene Expression Omnibus(GEO) and The Cancer Genome Atlas (TCGA) provide us with a wealth of data, such as RNA-seq, DNA Methylation, SNP and Copy number variation data. It's easy to download data from TCGA using the gdc tool, but processing these data into a format suitable for bioinformatics analysis requires more work. This R package was developed to handle these data.
Last updated
geneexpressiondifferentialexpressionrnaseqcopynumbervariationmicroarraysoftwarednamethylationdifferentialmethylationsnpatacseqmethylationarray
5.83 score 27 stars 25 scripts 343 downloadsSVP - Predicting cell states and their variability in single-cell or spatial omics data
SVP uses the distance between cells and cells, features and features, cells and features in the space of MCA to build nearest neighbor graph, then uses random walk with restart algorithm to calculate the activity score of gene sets (such as cell marker genes, kegg pathway, go ontology, gene modules, transcription factor or miRNA target sets, reactome pathway, ...), which is then further weighted using the hypergeometric test results from the original expression matrix. To detect the spatially or single cell variable gene sets or (other features) and the spatial colocalization between the features accurately, SVP provides some global and local spatial autocorrelation method to identify the spatial variable features. SVP is developed based on SingleCellExperiment class, which can be interoperable with the existing computing ecosystem.
Last updated
singlecellsoftwarespatialtranscriptomicsgenetargetgeneexpressiongenesetenrichmenttranscriptiongokeggopenblascppopenmp
5.56 score 12 stars 6 scripts 331 downloadsivolcano - Interactive Volcano Plot
Generate interactive volcano plots for exploring gene expression data. Built with 'ggplot2', the plots are rendered interactive using 'ggiraph', enabling users to hover over points to display detailed information or click to trigger custom actions.
Last updated
quarto
5.45 score 7 stars 6 scripts 204 downloadsggtreeSpace - Visualizing Phylomorphospaces using 'ggtree'
This package is a comprehensive visualization tool specifically designed for exploring phylomorphospace. It not only simplifies the process of generating phylomorphospace, but also enhances it with the capability to add graphic layers to the plot with grammar of graphics to create fully annotated phylomorphospaces. It also provide some utilities to help interpret evolutionary patterns.
Last updated
annotationvisualizationphylogeneticssoftware
5.32 score 5 stars 14 scripts 234 downloadsshinyTempSignal - Explore Temporal and Other Phylogenetic Signals
Sequences sampled at different time points can be used to infer molecular phylogenies on natural time scales, but if the sequences records inaccurate sampling times, that are not the actual sampling times, then it will affect the molecular phylogenetic analysis. This shiny application helps exploring temporal characteristics of the evolutionary trees through linear regression analysis and with the ability to identify and remove incorrect labels. The method was extended to support exploring other phylogenetic signals under strict and relaxed models.
Last updated
molecular-clockphylogeneticsshiny-app
5.26 score 9 stars 7 scripts 201 downloadsepiSeeker - epiSeeker: an R package for Annotation, Comparison and Visualization of multi-omics epigenetic data
This package implements functions to analyze multi-omics epigenetic data. Data of fragment type and base type are supported by epiSeeker. It provides functions to retrieve the nearest genes around the peak, annotate genomic region of the peak, statistical methods to estimate the significance of overlap among peak data sets, and motif analysis. It incorporates the GEO database for users to compare their own dataset with those deposited in the database. The comparison can be used to infer cooperative regulation and thus can be used to generate hypotheses. Several visualization functions are implemented to summarize the coverage of the peak experiment, average profile and heatmap of peaks binding to TSS regions, genomic annotation, distance to TSS, overlap of peaks or genes, and the single-base resolution epigenetic data by considering the strand, motif, and additional information.
Last updated
annotationchipseqsoftwarevisualizationmultiplecomparisoncoveragemotifannotationgeneregulation
5.15 score 54 downloadsTDbook - Companion Package for the Book "Data Integration, Manipulation and Visualization of Phylogenetic Trees" by Guangchuang Yu (2022, ISBN:9781032233574, doi:10.1201/9781003279242)
The companion package that provides all the datasets used in the book "Data Integration, Manipulation and Visualization of Phylogenetic Trees" by Guangchuang Yu (2022, ISBN:9781032233574, doi:10.1201/9781003279242).
Last updated
5.04 score 13 stars 85 scripts 787 downloadsMMINP - Microbe-Metabolite Interactions-Based Metabolic Profiles Predictor
Implements a computational framework to predict microbial community-based metabolic profiles with 'O2PLS' model. It provides procedures of model training and prediction. Paired microbiome and metabolome data are needed for modeling, and the trained model can be applied to predict metabolites of analogous environments using new microbial feature abundances.
Last updated
metabolite-predictionmetabolitesmicrobes
4.90 score 16 stars 9 scripts 176 downloadsseqmagick - Sequence Manipulation Utilities
Tools for reading and writing biological sequences in multiple formats, including 'FASTA', 'PHYLIP', 'CLUSTAL', 'STOCKHOLM', 'MEGA' and 'GenBank'. Supports interleaved and sequential layouts where applicable, converts between formats, and manipulates sequence sets (e.g., filtering by patterns and computing consensus sequences from alignments). Also includes functions to download nucleotide records from NCBI by accession.
Last updated
4.77 score 6 stars 2 dependents 13 scripts 1.2k downloadswget - Setting Download Method to 'wget'
Provides function, wget_set(), to change the method (default to 'wget -c') using in download.file(). Using 'wget -c' allowing continued downloading, which is especially useful for slow internet connection and for downloading large files. User can run wget_unset() to restore previous setting.
Last updated
4.51 score 16 stars 40 scripts 292 downloadsaplotExtra - Creating Composite Plots using 'aplot'
Many complex plots are actually composite plots, such as 'oncoplot', 'funkyheatmap', 'upsetplot', etc. We can produce subplots using 'ggplot2' and combine them to create composite plots using 'aplot'. In this way, it is easy to customize these complex plots, by adding, deleting or modifying subplots in the final plot. This package provides a set of utilities to help users to create subplots and complex plots.
Last updated
4.48 score 10 stars 1 scripts 208 downloadswikiprofiler - 'WikiPathway' Based Data Integration and Visualization
Queries online 'WikiPathway' graphics and allows mapping user data (e.g., expression values) on the graph. The package designs a grammar of graphic syntax that using pipe operator to add graphic layer.
Last updated
3.95 score 6 stars 7 scripts 548 downloadsmirrorselect - Test CRAN/Bioconductor Mirror Speed
Testing CRAN and Bioconductor mirror speed by recording download time of 'src/base/COPYING' (for CRAN) and 'packages/release/bioc/html/ggtree.html' (for Bioconductor).
Last updated
3.60 score 8 stars 1 scripts 174 downloads