Loading…

Identification and complete sequencing of novel human transcripts through the use of mouse orthologs and testis cDNA sequences

The correct identification of all human genes, and their derived transcripts, has not yet been achieved, and it remains one of the major aims of the worldwide genomics community. Computational programs suggest the existence of 30,000 to 40,000 human genes. However, definitive gene identification can...

Full description

Saved in:
Bibliographic Details
Published in:Genetics and molecular research 2004-12, Vol.3 (4), p.493-511
Main Authors: Ferreira, Elisa N, Pires, Lilian C, Parmigiani, Raphael B, Bettoni, Fabiana, Puga, Renato D, Pinheiro, Daniel G, Andrade, Luís Eduardo C, Cruz, Luciana O, Degaki, Theri L, Faria, Jr, Milton, Festa, Fernanda, Giannella-Neto, Daniel, Giorgi, Ricardo R, Goldman, Gustavo H, Granja, Fabiana, Gruber, Arthur, Hackel, Christine, Henrique-Silva, Flávio, Malnic, Bettina, Manzini, Carina V B, Marie, Suely K N, Martinez-Rossi, Nilce M, Oba-Shinjo, Sueli M, Pardini, Maria Ines M C, Rahal, Paula, Rainho, Cláudia A, Rogatto, Silvia R, Romano, Camila M, Rodrigues, Vanderlei, Sales, Magaly M, Savoldi, Marcela, da Silva, Ismael D C G, da Silva, Neusa P, de Souza, Sandro J, Tajara, Eloiza H, Silva, Jr, Wilson A, Simpson, Andrew J G, Sogayar, Mari C, Camargo, Anamaria A, Carraro, Dirce M
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The correct identification of all human genes, and their derived transcripts, has not yet been achieved, and it remains one of the major aims of the worldwide genomics community. Computational programs suggest the existence of 30,000 to 40,000 human genes. However, definitive gene identification can only be achieved by experimental approaches. We used two distinct methodologies, one based on the alignment of mouse orthologous sequences to the human genome, and another based on the construction of a high-quality human testis cDNA library, in an attempt to identify new human transcripts within the human genome sequence. We generated 47 complete human transcript sequences, comprising 27 unannotated and 20 annotated sequences. Eight of these transcripts are variants of previously known genes. These transcripts were characterized according to size, number of exons, and chromosomal localization, and a search for protein domains was undertaken based on their putative open reading frames. In silico expression analysis suggests that some of these transcripts are expressed at low levels and in a restricted set of tissues.
ISSN:1676-5680