Lecture 5: Gene Expression Data & Annotations (06/24/2024)
What we will learn
Gene Expression Data
RNA-seq (and other methods) allow for globally studying gene expression to understand protein and drawing biological conclusions (treatment, prevention, and prognosis of diseases)
The basic workflow for NGS data analysis can be divided into three major parts:
I. RNA-seq Library Prep & Sequencing II. Data Preprocessing III. Data Analysis
Normalization is essential to make accurate comparisons of gene expression between samples and there are different ways to normalize the data depending on what concerns you have with the data
To get more information, annotations from database are used and can be merged with
dplyr::*_join()
functions gene expression
What we will Practice
Using a publicly-available RNA seq data (see overview here), we will perform each step in the data analysis workflow