Analysis & data types

Learn about using different data types and how to perform different analyses.

Proteomics

Integrative analysis of UKB proteomics data

The UK Biobank Pharma Proteomics Project (UKB-PPP) was launched by a consortium of thirteen pharmaceutical companies in November 2020 with the aim to measure circulating concentrations of plasma proteins in approximately 55,000 UK Biobank participants. This project has now resulted in a first tranche of proteomics data for ~1,500 proteins that will soon be released to the broader UK Biobank research community.

Representatives from UK Biobank, Janssen, Biogen, Olink, Weill Cornell Medicine - Qatar & DNAnexus walk ways to access and analyze the data on the UK Biobank Research Analysis Platform. They discuss the new dataset, the collection and sequencing process and helpful analysis tips to get you started.

chevron-rightVideo chapters & slideshashtag

Slides are available herearrow-up-right

Analyzing UKB proteomics data

The UK Biobank Pharma Proteomics Project (UKB-PPP) was launched by a consortium of thirteen pharmaceutical companies in November 2020 with the aim to measure circulating concentrations of almost 1,500 plasma proteins in approximately 55,000 UK Biobank participants. The first tranche of proteomics data from this project is now available for the broader UK Biobank research community.

Learn how to analyze the new proteomics data on the UK Biobank Research Analysis Platform (UKB-RAP). Bioinformatics expert Alexandra Lee walks attendees through accessing the data and demonstrates a couple of use cases for working with the data.

Topics covered include:

  • Introduction to the new proteomics data that is now available on the UKB-RAP

  • Walking users through how they can access this data on the platform

  • Demonstrating a couple of example use cases for how users can analyze this new proteomics data including pQTL and differential expression

chevron-rightVideo chaptershashtag

Metabolomics

Analysis of accelerometer and metabolomic data in the UKB

The rich UK Biobank dataset has extensive accelerometry and metabolic data on 100,000 participants. This lifestyle and health data matched with the large amount of genetic data presents an exciting opportunity to perform more complex analysis combining these disparate data types.

Learn how to analyze accelerometer and metabolomic data within the UK Biobank Dataset from expert speakers Rosemary Walmsley, Researcher in Reproducible Machine Learning at the University of Oxford, and Dr. Karsten Suhre, Professor of Physiology and Biophysics, Director Bioinformatics Core at Weill Cornell Medicine - Qatar. First, Rosemary discusses accelerometer data in the UK Biobank dataset, strategies for analysis, and points to repositories to get researchers started. Then Karsten walks through analyzing metabolic data and the necessary scripts and notebooks used in his workflows.

chevron-rightVideo chaptershashtag

Images

Image analysis on UKB-RAP

Learn about the data formats available to start running your image analysis on the Research Analysis Platform. Ondrej Klempir, Sr. Community Engagement Scientist at DNAnexus, walks you through how to access the data, reviews basic image (pre)processing steps & shows examples of image visualization on the cloud.

chevron-rightVideo chaptershashtag

Advanced image management

Interested in analyzing the extensive imaging data available on the UK Biobank Research Analysis Platform (UKB RAP)? Do you have your own data to analyze and manage? Experts from DNAnexus (UKB RAP), the FSL group at Oxford and MathWorks (makers of MATLAB and Simulink) will walk you through how to access or transfer the data, as well as label and process bulk data.

Agenda

  • Introduction and Short Discussion about Integrating Image data with Genomic Data (Ben Busby)

  • Advanced Imaging and the UKB RAP (Fidel Alfaro-Almagro)

  • Advanced Imaging on the DNAnexus Commercial Platform (Ondrej Klempir)

  • Image analysis on DNAnexus and RAP with Matlab (Rob Holt & Renee Qian)

  • Budgetary considerations around large scale data integration (Asha Collins)

chevron-rightVideo chaptershashtag

Dementia and multimorbidity in late-life disease

Population-scale datasets offer researchers the ability to derive deeper insights into the genetic mechanisms of late-life diseases like Dementia, but understanding how to tackle analyzing the sheer amount of data remains a challenge.

Our panel of research experts review a broad range of multimodal data science approaches that researchers can use to explore the UK Biobank dataset for new discovery. They discuss linking different data types including proteomics, imaging, wearables & genetics and also describe tools powering their analysis and how to access the tools.

GWAS

GWAS on UKB-RAP using regenie

An introduction to running GWAS using Regenie on the Research Analysis Platform. DNAnexus experts demonstrate how to run the analysis using a diabetes phenotype on the 300k data.

Topics include:

  • Genomic file preprocessing and filtering using the Swiss Army Knife App

  • Building phenotype/covariate files for cohorts using Spark JupyterLab

  • Running regenie using the Swiss Army Knife App

You can also learn more about regenie by viewing this paperarrow-up-right referenced in the presentation

chevron-rightVideo chaptershashtag

Visualizing and annotating GWAS results

So you've run a GWAS analysis on RAP and obtained p-values. Now what? This session details visualizing GWAS results using the Research Analysis Platform. The webinar covers interactive exploration of GWAS results using the LocusZoom App and then dives deeper into variant annotation and visualization using both R and Python Jupyter Notebooks.

chevron-rightVideo chaptershashtag

End to end target discovery with GWAS and PheWAS

Learn how to create and run your own target discovery pipeline with GWAS and PheWAS. This webinar shows an end-to-end target discovery pipeline run entirely on UKB RAP that you will be able to replicate for your own research projects on the platform.

Learn how to:

  • Extract data using dx extract_dataset -Make sample and variant QC

  • Run GWAS analysis using REGENIE app

  • Perform LD clumping using PLINK to cluster significant GWAS variants

  • Run PheWAS analysis

As this is an advanced webinar, we expect that audience has experience with:

  • Performing analysis on UKB RAP -Running JupyterLab on UKB RAP

  • Compiling and running WDL workflow

  • Running SAK using GUI or CLI

chevron-rightVideo chaptershashtag

Code is available on githubarrow-up-right.

Variant annotation

Flexible variant annotation

Learn how to perform large-scale variant annotation with tools that are easy to run in Jupyter Notebooks. This session covers strategies for implementing these tools and sample code for you to get up and running on your projects.

chevron-rightVideo chaptershashtag

Large-scale analysis

Efficiently analyzing large-scale WGS data

Learn the tips and tricks to faster and more cost effective large-scale WGS analysis. This webinar covers how to set up reproducible analysis pipelines, what apps & tools you should incorporate and how to build your own custom workflows.

chevron-rightVideo chaptershashtag

Last updated

Was this helpful?