Laptop Setup Instructions
Instructions for setting up your laptop can be found here: Laptop Setup Instructions
1) R Preparation tutorials: You are expected to have completed the following tutorials in R beforehand. The tutorial should be very accessible even if you have never used R before.
2) UNIX Preparation tutorials:
R Review Session
.zip CEL files containing:
Welcome*Faculty: Michelle Brazas*
Module 1: Introduction to Microarrays and R*Faculty: Paul Boutros*
Module 2: Quality Control of Microarrays*Faculty: Paul Boutros*
Integrated Assignment*Faculty: Nicholas Harding*
Note: You will have to create your own phenotype data .txt file, using the sample annotations in the links.
phenotypedata.txt Many people had issues with creating the phenotype data file. The phenotype data must be: - TAB delimited - Must contain a header, the header has one fewer column than the other rows. The header also contains a preceding tab. This is because the first column, i.e. the file names are read in as rownames. For differences between the rownames of a data frame and a column, check the dataframe documentation. - Beware of spaces- as the file is tab delimited, any trailing/leading spaces will be incorporated into the cells. Be careful, as ‘Control ‘ is not the same as ‘Control’. Hint: some text editors have options that displays whitespace characters.
remember you can check your file has been read in correctly using the:
function, which returns your phenotype annotation as a dataframe.
For further help see
This points you to another function that loads the dataframe, and tells you exactly what it is expecting.
You can troubleshoot any problems with your phenotype data using this function directly.
Integrated Assignment Data
- Rat: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE10770
- Mouse: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE10769
Note: The 11 samples above are the same as in the link below. Only this time they are part of a larger set. Use the link below to prepare PhenoData file:
For CDF file: Download for alternative-CDF package from: http://brainarray.mbni.med.umich.edu/Brainarray/Database/CustomCDF/17.1.0/entrezg.asp
- .zip file of rat CEL files containing:
.zip file of mouse CEL files containing:
Module 3: Statistical Analysis*Faculty: Paul Boutros*
Module 4: Beyond the Microarray Experiment*Faculty: Paul Boutros*
Other (more advanced) resources
More detailed introduction to R. Not a basic tutorial, this is for people who really want to know more about R.
1) “Introductory Statistics with R” by Peter Dalgaard. It is not required for this workshop but if you are interested in buying a good book and/or want to know more, you might want to consider getting a copy.
Section 1-5 give a very good (perhaps very detailed) idea of what I plan to discuss during the workshop.
2) Statistics for Biology and Health by Robert Gentleman, Vincent Carey, Wolfgang Huber, Rafael Irizarry and Sandrine Dudoit
3) Building Bioinformatics Solutions with Perl, R and MySQL by Conrad Bessant, Ian Shadforth and Darren Oakley