Standard data processing and quality control

clusterHigh-performance computing cluster at the WTCHG.

All data generated within the Oxford Genomics Centre undergoes a rigorous quality control procedure to ensure that researchers receive the best data and to identify any issues arising. Data is processed through our primary pipeline to generate raw sequence data (FASTQ files) and usually aligned reads (BAM files) for each sequenced sample. Associated quality control (QC) metrics are also provided and data is made available via an ftp link.

Our data storage policy includes keeping the raw data for up to 2 years in archive storage, although we strongly recommend prompt download and storage of all data provided. The bioinformatics support is intended to enable individual research groups to take advantage of the latest genomics techniques without needing their own extensive computing/bioinformatics resources. Additional bioinformatics steps that may be performed on request (where resources permit) are detailed below for different applications.