GitHub - Arkadiy-Garber/FeGenie: HMM-based identification and categorization of iron genes and iron gene operons in genomes and metagenomes

FeGenie

Please see the Wiki page for an introduction and tutorial on how to use this tool.

Citing FeGenie

Garber AI, Nealson KH, Okamoto A, McAllister SM, Chan CS, Barco RA and Merino N (2020) FeGenie: A Comprehensive Tool for the Identification of Iron Genes and Iron Gene Neighborhoods in Genome and Metagenome Assemblies. Front. Microbiol. 11:37. doi: 10.3389/fmicb.2020.00037

Special thanks to Michael Lee for helping to put together the Conda environment for FeGenie. Thanks to Natasha Pavlovikj for creating the Conda recipe for FeGenie. Thanks to Michał Sitko for creating a Dockerfile for FeGenie. Thanks to Michelle Hallenbeck for helping to modernize the installation process.

Installation with Mamba / Conda

FeGenie requires several external dependencies, including Python, R packages, HMMER, DIAMOND, BLAST, Prodigal, and MetaBAT2.

Create the environment with:

mamba create -n fegenie \
  -c conda-forge -c bioconda \
  --strict-channel-priority \
  python=3.10 \
  r-base \
  r-ggplot2 \
  r-stringi \
  r-ggpubr \
  r-reshape \
  r-reshape2 \
  r-tidyverse \
  r-argparse \
  r-ggdendro \
  r-pvclust \
  hmmer \
  diamond \
  prodigal \
  blast \
  metabat2 \
  -y

Activate the environment:

conda activate fegenie

After activating the environment, set up the required environment variables:

echo ${CONDA_PREFIX}
mkdir -p ${CONDA_PREFIX}/etc/conda/activate.d
echo '#!/bin/sh
export PATH="'$(pwd)':$PATH"
export rscripts="'$(pwd)'/rscripts"
export iron_hmms="'$(pwd)'/hmms/iron"' > ${CONDA_PREFIX}/etc/conda/activate.d/env_vars.sh

You can then confirm that the HMM path is available with:

echo ${iron_hmms}

And check that FeGenie is available with:

FeGenie.py -h

When you are done using FeGenie, deactivate the environment with:

conda deactivate

Installation without Conda

git clone https://github.com/Arkadiy-Garber/FeGenie.git
cd FeGenie
bash setup.sh
./FeGenie.py -h

Quick start

Run FeGenie on a directory of genome bins:

FeGenie.py -bin_dir /directory/of/bins/ -bin_ext fasta -t 16

The argument for -bin_ext should match the filename extension of the FASTA files you want analyzed (for example: fa, fasta, fna).

Quick start (if installed using the no-Conda setup)

./FeGenie.py -bin_dir /directory/of/bins/ -bin_ext fasta -t 16 -out output_fegenie

The hmms/iron directory can be found within the main FeGenie repository.

The -t argument sets the number of threads used for HMMER and BLAST. For example, -t 8 uses 8 threads. If your system has fewer available threads, set this number lower. The default is 1.

Tutorial (Binder)

FeGenie introductory slideshow:

Content | Video presentation

FeGenie video tutorial:

Content | Video presentation

To start the tutorial, hit the "launch binder" button below, and follow the commands in "Walkthrough".

(Initially forked from here. Thank you to the Binder team.)

Walkthrough

Enter the main FeGenie directory:

cd FeGenie

Print the FeGenie help menu:

FeGenie.py -h

Run FeGenie on the test dataset:

FeGenie.py -bin_dir genomes/ -bin_ext fna -out fegenie_out

Go into the output directory and inspect the output files:

cd fegenie_out
less FeGenie-geneSummary-clusters.csv

Run FeGenie on gene calls:

FeGenie.py -bin_dir ORFs/ -bin_ext faa -out fegenie_out --orfs

Run FeGenie on gene calls and use a reference database (RefSeq sub-sample) for cross-validation:

FeGenie.py -bin_dir ORFs/ -bin_ext faa -out fegenie_out --orfs -ref refseq_db/refseq_nr.sample.faa

Running with Docker

If running FeGenie with Docker, the only dependency you need installed is Docker itself (installation guide).

With Docker installed, you can run FeGenie like this:

docker run -it -v $(pwd):/data --env iron_hmms=/data/hmms/iron --env rscripts=/data/rscripts note/fegenie-deps ./FeGenie.py -bin_dir /data/test_dataset -bin_ext txt -out fegenie_out -t $(nproc)

./FeGenie.py ... follows the normal non-Dockerized flow of arguments.

Be aware that you need to mount the directories containing the files FeGenie is supposed to read. If you are not familiar with Docker, run the docker run command from the directory into which you cloned the FeGenie repository. If all the files you pass to FeGenie are inside this directory and you use relative file paths (for example hmms/iron), everything should work as expected.

Upcoming updates (suggestions welcome via Issues)

Ability to accept previously annotated genomes and gene calls
Include Cytochrome 579 (and possible rusticyanin)
Improve delineation between MtrA and MtoA for better resolution of iron reduction vs. iron oxidation
Option to report absolute values for gene counts rather than normalized gene counts
Include option to release all results regardless of whether reporting rules were met
Identification of iron-sulfur proteins

Name		Name	Last commit message	Last commit date
Latest commit History 329 Commits
ci		ci
depracated		depracated
hmms		hmms
iron		iron
rscripts		rscripts
.DS_Store		.DS_Store
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
FeGenie Icon_whitebg.png		FeGenie Icon_whitebg.png
FeGenie-workflow.png		FeGenie-workflow.png
FeGenie.py		FeGenie.py
Installing_R_packages_with_Rscript.readme		Installing_R_packages_with_Rscript.readme
LICENSE		LICENSE
README.md		README.md
anotherSampleBamsfile.txt		anotherSampleBamsfile.txt
sample_bams_file.txt		sample_bams_file.txt
setup-conda-env.sh		setup-conda-env.sh
setup.sh		setup.sh
test_dataset.tar.gz		test_dataset.tar.gz
test_run.sh		test_run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FeGenie

Citing FeGenie

Installation with Mamba / Conda

Installation without Conda

Quick start

Quick start (if installed using the no-Conda setup)

Tutorial (Binder)

Walkthrough

Running with Docker

Upcoming updates (suggestions welcome via Issues)

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FeGenie

Citing FeGenie

Installation with Mamba / Conda

Installation without Conda

Quick start

Quick start (if installed using the no-Conda setup)

Tutorial (Binder)

Walkthrough

Running with Docker

Upcoming updates (suggestions welcome via Issues)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages