Analysis & data types
Learn about using different data types and how to perform different analyses.
Last updated
Was this helpful?
Learn about using different data types and how to perform different analyses.
Last updated
Was this helpful?
The UK Biobank Pharma Proteomics Project (UKB-PPP) was launched by a consortium of thirteen pharmaceutical companies in November 2020 with the aim to measure circulating concentrations of plasma proteins in approximately 55,000 UK Biobank participants. This project has now resulted in a first tranche of proteomics data for ~1,500 proteins that will soon be released to the broader UK Biobank research community.
Representatives from UK Biobank, Janssen, Biogen, Olink, Weill Cornell Medicine - Qatar & DNAnexus walk ways to access and analyze the data on the UK Biobank Research Analysis Platform. They discuss the new dataset, the collection and sequencing process and helpful analysis tips to get you started.
The UK Biobank Pharma Proteomics Project (UKB-PPP) was launched by a consortium of thirteen pharmaceutical companies in November 2020 with the aim to measure circulating concentrations of almost 1,500 plasma proteins in approximately 55,000 UK Biobank participants. The first tranche of proteomics data from this project is now available for the broader UK Biobank research community.
Learn how to analyze the new proteomics data on the UK Biobank Research Analysis Platform (UKB-RAP). Bioinformatics expert Alexandra Lee walks attendees through accessing the data and demonstrates a couple of use cases for working with the data.
Topics covered include:
Introduction to the new proteomics data that is now available on the UKB-RAP
Walking users through how they can access this data on the platform
Demonstrating a couple of example use cases for how users can analyze this new proteomics data including pQTL and differential expression
The rich UK Biobank dataset has extensive accelerometry and metabolic data on 100,000 participants. This lifestyle and health data matched with the large amount of genetic data presents an exciting opportunity to perform more complex analysis combining these disparate data types.
Learn how to analyze accelerometer and metabolomic data within the UK Biobank Dataset from expert speakers Rosemary Walmsley, Researcher in Reproducible Machine Learning at the University of Oxford, and Dr. Karsten Suhre, Professor of Physiology and Biophysics, Director Bioinformatics Core at Weill Cornell Medicine - Qatar. First, Rosemary discusses accelerometer data in the UK Biobank dataset, strategies for analysis, and points to repositories to get researchers started. Then Karsten walks through analyzing metabolic data and the necessary scripts and notebooks used in his workflows.
Learn about the data formats available to start running your image analysis on the Research Analysis Platform. Ondrej Klempir, Sr. Community Engagement Scientist at DNAnexus, walks you through how to access the data, reviews basic image (pre)processing steps & shows examples of image visualization on the cloud.
Interested in analyzing the extensive imaging data available on the UK Biobank Research Analysis Platform (UKB RAP)? Do you have your own data to analyze and manage? Experts from DNAnexus (UKB RAP), the FSL group at Oxford and MathWorks (makers of MATLAB and Simulink) will walk you through how to access or transfer the data, as well as label and process bulk data.
Agenda
Introduction and Short Discussion about Integrating Image data with Genomic Data (Ben Busby)
Advanced Imaging and the UKB RAP (Fidel Alfaro-Almagro)
Advanced Imaging on the DNAnexus Commercial Platform (Ondrej Klempir)
Image analysis on DNAnexus and RAP with Matlab (Rob Holt & Renee Qian)
Budgetary considerations around large scale data integration (Asha Collins)
Population-scale datasets offer researchers the ability to derive deeper insights into the genetic mechanisms of late-life diseases like Dementia, but understanding how to tackle analyzing the sheer amount of data remains a challenge.
Our panel of research experts review a broad range of multimodal data science approaches that researchers can use to explore the UK Biobank dataset for new discovery. They discuss linking different data types including proteomics, imaging, wearables & genetics and also describe tools powering their analysis and how to access the tools.
An introduction to running GWAS using Regenie on the Research Analysis Platform. DNAnexus experts demonstrate how to run the analysis using a diabetes phenotype on the 300k data.
Topics include:
Genomic file preprocessing and filtering using the Swiss Army Knife App
Building phenotype/covariate files for cohorts using Spark JupyterLab
Running regenie using the Swiss Army Knife App
You can also learn more about regenie by viewing this paper referenced in the presentation
So you've run a GWAS analysis on RAP and obtained p-values. Now what? This session details visualizing GWAS results using the Research Analysis Platform. The webinar covers interactive exploration of GWAS results using the LocusZoom App and then dives deeper into variant annotation and visualization using both R and Python Jupyter Notebooks.
Learn how to create and run your own target discovery pipeline with GWAS and PheWAS. This webinar shows an end-to-end target discovery pipeline run entirely on UKB RAP that you will be able to replicate for your own research projects on the platform.
Learn how to:
Extract data using dx extract_dataset -Make sample and variant QC
Run GWAS analysis using REGENIE app
Perform LD clumping using PLINK to cluster significant GWAS variants
Run PheWAS analysis
As this is an advanced webinar, we expect that audience has experience with:
Performing analysis on UKB RAP -Running JupyterLab on UKB RAP
Compiling and running WDL workflow
Running SAK using GUI or CLI
Code is available on github.
Learn how to perform large-scale variant annotation with tools that are easy to run in Jupyter Notebooks. This session covers strategies for implementing these tools and sample code for you to get up and running on your projects.
Learn the tips and tricks to faster and more cost effective large-scale WGS analysis. This webinar covers how to set up reproducible analysis pipelines, what apps & tools you should incorporate and how to build your own custom workflows.