Skip to content

sanger-pathogens/pathogen-informatics-training

Repository files navigation

Pathogen Informatics Training

A set of bioinformatics training courses developed by Pathogen Informatics at Wellcome Sanger Institute.

License: CC BY 4.0

Content

Introduction

These bioinformatics training courses use Jupyter notebooks to provide pathogen informatics training and the following notebooks are avaiable:

  • UNIX for Bioinformatics
  • Introduction to BLAST
  • NGS Data Formats and QC
  • An Introduction to IGV
  • RNA-Seq Expression Analysis
  • Differential Expression and GO Term Analysis using DEAGO
  • Pangenome Construction using Roary
  • Antimicrobial Resistance Identification using ARIBA
  • Serotype Detection using SeroBA
  • PathFind (pf) scripts
  • ChiP-Seq
  • LSF

Installation

The courses use Jupyter notebooks, which means that Jupyter must be installed to use them. Please see the Jupyter installation instructions for details.

If you are running Jupyter on MacOS you may have to install the bash kernel. To do so, run the following commands:

pip install bash_kernel

python -m bash_kernel.install

The courses assume that you have the relevant tools installed (e.g. ARIBA and SeroBA). Further information about the relevant dependencies can be found inside each individual notebook.

Docker

The following tutorials can be run in a Docker container:

  • UNIX for Bioinformatics
  • Introduction to BLAST
  • Antimicrobial Resistance Identification using ARIBA
  • Serotype Detection using SeroBA

First install Docker, then pull down the Docker image:

docker pull sangerpathogens/pathogen-informatics-training

To start the Notebook, run:

docker run -p 8888:8888 -d sangerpathogens/pathogen-informatics-training jupyter notebook

This will print a URL with a token. Copy and paste this in your browser. This will open the notebook in the "Notebooks" directroy. Click on the index.ipynb and navigate your way to the desired notebook from there.

If for some reason you need to close the tutorial and want to continue at a later date (provided that you have saved your progress in the notebooks using the save button), you can restart the container by running:

docker start -a CONTAINER_ID

Where CONTAINER_ID is the id of the container. You can find out what the id of the container is by running:

docker container ls -a

Again, copy the URL into your browser and you are ready to pick up where you left off.

Usage

Clone this repository:

git clone https://github.com/sanger-pathogens/pathogen-informatics-training.git

Start Jupyter at the main index page to view the available courses:

jupyter notebook pathogen-informatics-training/Notebooks/index.ipynb

Select a course and follow the instructions given in the notebook.

License

This is free software and educational material, licensed under CC BY.

Feedback/Issues

Please report any issues to the issues page or email [email protected]