Bio Softwares using EESSI

A large number of bioinformatics programs are readily available for Komondor users through the EESSI module.

The following software packages are accessible via environment modules:

Software

Description

Link

BCFtools/1.17-GCC-12.2.0, BCFtools/1.18-GCC-12.3.0 (D)

Manipulate and analyze genetic variant data in VCF/BCF formats.

GitHub

BLAST+/2.14.0-gompi-2022b, BLAST+/2.14.1-gompi-2023a (D)

Compare biological sequences against databases to find homologs.

NCBI BLAST

BWA

Align short high-throughput sequencing reads to a reference genome.

GitHub

BamTools/2.5.2

Tools and C++ API for working with BAM files.

GitHub

Bio-DB-HTS/3.01

Perl module for accessing HTS data.

MetaCPAN

Bio-SearchIO-hmmer/1.7.3

BioPerl parser for HMMER output.

MetaCPAN

BioPerl/1.7.8 (D)

Perl modules for biological computation.

bioperl.org

Biopython/1.81, 1.83 (D)

Python tools for biological computation.

biopython.org

Bowtie2/2.5.1

Ultrafast tool for aligning sequencing reads.

Bowtie2

CD-HIT/4.8.1

Cluster protein or nucleotide sequences.

CD-HIT

Cassiopeia/2.0.0

Reconstruct single-cell lineages.

GitHub

DIAMOND/2.1.8

Fast protein aligner, BLAST-compatible.

GitHub

DendroPy

Python library for phylogenetic analysis.

dendropy.org

ETE/3.1.3

Toolkit for phylogenetic trees.

ETEtookit

FragGeneScan/1.31

Predict genes in fragmented metagenomic reads.

GitHub

Tombo/1.5.1

Detect DNA modifications in nanopore data.

Tombo

Transrate/1.0.3

Evaluate de novo transcriptome assemblies.

GitHub

VCFtools/0.1.16

Tools for VCF file analysis.

VCFtools

VSEARCH/2.30.0

Versatile tool for metagenomics.

GitHub

WhatsHap (D)

Phase variants using long reads.

Docs

bcgTree/1.2.1

Build bacterial core genome trees.

GitHub

crb-blast/0.6.9

Check redundancy in BLAST hits.

GitHub

cutadapt/5.0

Trim adapters and low-quality bases.

Docs

GATK/4.5.0.0

Variant calling toolkit from Broad Institute.

GATK

fastp/0.23.4

Fast preprocessing for FASTQ files.

GitHub

GROMACS/2024.x

Molecular dynamics simulation suite.

GROMACS

Gblocks/0.91b

Eliminate unreliable alignment regions.

Gblocks

GenomeTools/1.6.2

Genome analysis toolkit.

genometools.org

HMMER/3.4

Profile HMM-based sequence searches.

hmmer.org

HTSlib

Library for reading/writing sequencing data.

GitHub

hic-straw/1.3.1

Parse Hi-C interaction data.

GitHub

IQ-TREE/2.3.5

Phylogenetic tree inference using ML.

IQ-TREE

KronaTools/2.8.1

Interactive metagenomic visualization.

GitHub

MAFFT

Multiple sequence alignment program.

MAFFT

MDAnalysis/2.4.2

Analyze molecular dynamics trajectories.

MDAnalysis

MEGAHIT/1.2.9

Ultra-fast metagenomic assembler.

GitHub

MMseqs2/14-7e284

Fast sequence search and clustering.

MMseqs2

MUSCLE/5.1.0

Multiple sequence alignment.

MUSCLE

Mash/2.3

Genome distance estimation using MinHash.

Docs

MetaEuk/6

Gene prediction in metagenomics.

GitHub

MultiQC/1.14

Summarize QC reports across samples.

multiqc.info

Nextflow

Workflow language for reproducible pipelines.

Nextflow

OrthoFinder/2.5.5

Determine orthogroups and species trees.

GitHub

ncbi-vdb

Tools to access NCBI SRA data.

GitHub

Pysam

Python bindings for SAMtools and HTSlib.

Docs

pyfaidx

Random access to FASTA files.

GitHub

R-bundle-Bioconductor

Genomic analysis packages for R.

Bioconductor

RAxML

Phylogenetic inference using ML.

RAxML

SAMtools

Tools for manipulating alignments.

HTSlib

SEPP

Place short sequences into reference trees.

GitHub

Last update by Milán SZŐRI: 2025-06-15