Translational database selection and multiplexed sequence capture for up front filtering of reliable breast cancer biomarker candidates.

Ståhl PL, Bjursell MK, Mahdessian H, Hober S, Jirström K, Lundeberg J

PLoS ONE 6 (6) e20794 [2011-06-15; online 2011-06-15]

Biomarker identification is of utmost importance for the development of novel diagnostics and therapeutics. Here we make use of a translational database selection strategy, utilizing data from the Human Protein Atlas (HPA) on differentially expressed protein patterns in healthy and breast cancer tissues as a means to filter out potential biomarkers for underlying genetic causatives of the disease. DNA was isolated from ten breast cancer biopsies, and the protein coding and flanking non-coding genomic regions corresponding to the selected proteins were extracted in a multiplexed format from the samples using a single DNA sequence capture array. Deep sequencing revealed an even enrichment of the multiplexed samples and a great variation of genetic alterations in the tumors of the sampled individuals. Benefiting from the upstream filtering method, the final set of biomarker candidates could be completely verified through bidirectional Sanger sequencing, revealing a 40 percent false positive rate despite high read coverage. Of the variants encountered in translated regions, nine novel non-synonymous variations were identified and verified, two of which were present in more than one of the ten tumor samples.

NGI Stockholm (Genomics Applications)

NGI Stockholm (Genomics Production)

National Genomics Infrastructure

PubMed 21698250

DOI 10.1371/journal.pone.0020794

Crossref 10.1371/journal.pone.0020794

pii: PONE-D-11-04237
pmc: PMC3115972