Training Course in Galaxy for Metagenomic Analysis on Targeted and Whole Genome Sequencing Data


Hands-On

Where you'll apply the knowledge and techniques you've acquired throughout the previous modules.

Welcome and introduction

Agenda, resources and other useful information

Galaxy at a Glance

A brief introduction to Galaxy. What is Galaxy? Why should you use Galaxy? How do you use Galaxy?

Introduction to Galaxy Analyses

How to get started in Galaxy. Here we'll cover the core tasks in Galaxy: uploading files, using tools, viewing histories, and running workflows.

Exploring datasets with statistics and scatterplots

Often, one or more data pre-processing step(s) may be required to proceed with the analysis. And, after the analysis, maybe you would like to visualize the result or datasets

Data Manipulation

How can I do basic data manipulation in Galaxy? Which tools are available to convert, reformat, filter, sort etc my text-based data?

Introduction to Metagenomics

Why study the microbiome? Amplicon and shotgun sequencing. Analysis pipelines and Visualisation options

Analyses of metagenomics data

How to analyze metagenomics data? What information can be extracted of metagenomics data? What is the difference between amplicon and shotgun data? What are the difference in the analyses of amplicon and shotgun data?

16S Microbial Analysis with Mothur

What is the effect of normal variation in the gut microbiome on host health?

Introduction to Visualisations in Galaxy

Visualizations are vital for the interpretation of large datasets. Galaxy has numerous visualization options available.

Genomic Data Visualisation with JBrowse

How can I visualise features or blast data? How can I visualise sequencing data in a workflow? Build several visualisations in JBrowse. Have basic familiarity with moving around JBrowse, and loading several data tracks

From NCBI's Sequence Read Archive (SRA) to Galaxy

How can you download public sequencing data deposited in the NCBI Sequence Read Archive (SRA) into a Galaxy history for analysis?

Quality Control

How to perform quality control of NGS raw data? What are the quality parameters to check for a dataset? How to improve the quality of a dataset?

Taxonomic Profiling and Visualization of Metagenomic Data

Which species (or genera, families, …) are present in my sample? What are the different approaches and tools to get the community profile of my sample? How can we visualize and compare community profiles?

Assembly of metagenomic sequencing data

Why metagenomic data should be assembled? What is the difference between co-assembly and individual assembly? What is the difference between reads, contigs and scaffolds? How to assess the quality of metagenomic data assembly?

Genome annotation with Prokka

How can we annotate a bacterial genome? How can we visualize annotated genomic features?

Microbial Variant Calling

How do we detect differences between a set of reads from a microorganism and a reference genome?

Pangenome Analysis

If we analyse DNA from several bacterial strains, we may want to know which genes they have in common and which are unique to some strains.

Pathogen detection

How to identify pathogens using sequencing data? How to track the found pathogens through all your samples datasets?


Authors and Contributors


Some of the materials used in this training are from the Galaxy community and are the result of a collaborative work. Thanks to all the contributors and the Galaxy Training Network!