- This event has passed.
Genetic data analysis and exploration using R (GDAR02)
16th August 2016 - 20th August 2016
This course will provide an extensive overview and comprehensive introduction to exploratory methods and various statistical approaches for the analysis of genetic data using the software R and aim to equip participants with powerful resources for tackling increasingly common challenges in genetic data analysis.
The course is aimed at PhD students, research postgraduates, and practicing academics as well as persons in industry working with genetic data in fields such as molecular ecology, evolutionary biology, and phylogenetics.
Venue – Milport – Download directions PDF
Availability – 30 places
Duration – 5 days
Contact hours – Approx. 35 hours
ECT’s – Equal to 3 ECT’s
Language – English
A mixture of lectures and hands-on practicals. Data sets for computer practicals will be provided by the instructors, but participants are welcome to bring their own data.
Assumed quantitative knowledge
A basic understanding of concepts in population genetics and the statistical analysis of genetic data.
Assumed computer background
Previous experience with data analysis using R is required such as the ability to import/export data, manipulate data frames, fit basic statistical models & generate simple exploratory and diagnostic plots.
Equipment and software requirements
A laptop/personal computer with a working version or R and RStudio installed. R and RStudio are supported by both PC and MAC and can be downloaded for free by following these links.
It is essential that you come with all necessary software and packages already installed (you will be sent a list of packages prior to the course) internet access may not always be available.
UNSURE ABOUT SUITABLILITY THEN PLEASE ASK firstname.lastname@example.org
Sunday 15thMeet at Millport field station at approximately 18:30 (Download directions PDF)
Monday 16th – Classes from 09:00 to 17:00
Intro to phylogenetic reconstruction.
Module 1a: Reconstructing phylogenies from genetic sequence data. Three main approaches covered: distance-based phylogenies; maximum parsimony; and likelihood-based approaches.
Module 1b: Short R refresher.
Practical 1: Phylogenetic reconstruction using R. Three main approaches plus rooting a tree; assessing/testing for a molecular clock; and bootstrapping.
Main packages: ape, phangorn.
Tuesday 17th – Classes from 09:00 to 17:00
Intro to multivariate analysis of genetic data
Module 2: Key concepts in multivariate analysis. Focus on using factorial methods for genetic data analysis.
Practical 2: Basics of multivariate analysis of genetic data in R. Topics include: data handling, population genetic tests of population structure (PCA, PCoA).
Main packages: adegenet, ade4, ape.
Wednesday 18th – Classes from 09:00 to 17:00
Exploring group diversity
Module 3: Approaches to identifying and describing genetic clusters. Topics include: hierarchical clustering, K-means, population-level multivariate analysis (between-group-PCA, DA, DAPC).
Practical 3: Applying the approaches covered in morning lecture and emphasising their strengths and weaknesses.
Main packages: adegenet, ade4.
Thursday 19th – Classes from 09:00 to 17:00
Spatial genetic structures
Module 4: Discussing the origin and significance of spatial genetic patterns, and how to test for them.
Practical 4: Visualising and analysing spatial genetic data. Topics: spatial density estimates, Moran/Mantel tests, mapping principal components in PCA, spatial PCA.
Main packages: adegenet, adehabitat, ade4.
Friday 20th – Classes from 09:00 to 16:00
Using R for reproducible science.
Module 5: Using R for reproducible science
Practical 5: Practical session based on morning lecture
Main packages: knitr, Sweave, rmarkdown