Taxon-aware analysis of clustered protein sequences
-
Updated
Jul 1, 2024 - Python
Taxon-aware analysis of clustered protein sequences
Pangenome-Analysis.py is written for extraction of core, accessory and unique genomes (Gene families and proteins) along with filtration of single copy orthologues, creation of binary and count matrices as well as removal of duplicates.
NextFlow pipelines and Python scripts for comprehensive analyses of proteomes data, providing: Clustering via MMseqs2; Labelling, filtering and aggregation of clusters; Alignments and Variation analyses; Meta clustering comparisons
Add a description, image, and links to the proteome-clustering topic page so that developers can more easily learn about it.
To associate your repository with the proteome-clustering topic, visit your repo's landing page and select "manage topics."