Main content area

No-match ORESTES explored as tumor markers

Mello, Barbara P., Abrantes, Eduardo F., Torres, César H., Machado-Lima, Ariane, Fonseca, Rogério da Silva, Carraro, Dirce M., Brentani, Ricardo R., Reis, Luiz F.L., Brentani, Helena
Nucleic acids research 2009 v.37 no.8 pp. 2607-2617
bioinformatics, complementary DNA, databases, expressed sequence tags, gene expression regulation, genome, humans, microarray technology, neoplasms, non-coding RNA, open reading frames, quantitative polymerase chain reaction, tissues, transcription (genetics), transcriptome
Sequencing technologies and new bioinformatics tools have led to the complete sequencing of various genomes. However, information regarding the human transcriptome and its annotation is yet to be completed. The Human Cancer Genome Project, using ORESTES (open reading frame EST sequences) methodology, contributed to this objective by generating data from about 1.2 million expressed sequence tags. Approximately 30% of these sequences did not align to ESTs in the public databases and were considered no-match ORESTES. On the basis that a set of these ESTs could represent new transcripts, we constructed a cDNA microarray. This platform was used to hybridize against 12 different normal or tumor tissues. We identified 3421 transcribed regions not associated with annotated transcripts, representing 83.3% of the platform. The total number of differentially expressed sequences was 1007. Also, 28% of analyzed sequences could represent noncoding RNAs. Our data reinforces the knowledge of the human genome being pervasively transcribed, and point out molecular marker candidates for different cancers. To reinforce our data, we confirmed, by real-time PCR, the differential expression of three out of eight potentially tumor markers in prostate tissues. Lists of 1007 differentially expressed sequences, and the 291 potentially noncoding tumor markers were provided.