Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2004 Oct 12;32(18):5452-63.
doi: 10.1093/nar/gkh885. Print 2004.

'Conserved hypothetical' proteins: prioritization of targets for experimental study

Affiliations
Review

'Conserved hypothetical' proteins: prioritization of targets for experimental study

Michael Y Galperin et al. Nucleic Acids Res. .

Abstract

Comparative genomics shows that a substantial fraction of the genes in sequenced genomes encodes 'conserved hypothetical' proteins, i.e. those that are found in organisms from several phylogenetic lineages but have not been functionally characterized. Here, we briefly discuss recent progress in functional characterization of prokaryotic 'conserved hypothetical' proteins and the possible criteria for prioritizing targets for experimental study. Based on these criteria, the chief one being wide phyletic spread, we offer two 'top 10' lists of highly attractive targets. The first list consists of proteins for which biochemical activity could be predicted with reasonable confidence but the biological function was predicted only in general terms, if at all ('known unknowns'). The second list includes proteins for which there is no prediction of biochemical activity, even if, for some, general biological clues exist ('unknown unknowns'). The experimental characterization of these and other 'conserved hypothetical' proteins is expected to reveal new, crucial aspects of microbial biology and could also lead to better functional prediction for medically relevant human homologs.

PubMed Disclaimer

References

    1. Koonin E.V. and Galperin,M.Y. (2002) Sequence-Evolution-Function. Computational Approaches in Comparative Genomics. Kluwer Academic Publishers, Boston, MA. - PubMed
    1. Bernal A., Ear,U. and Kyrpides,N. (2001) Genomes OnLine Database (GOLD): a monitor of genome projects world-wide. Nucleic Acids Res., 29, 126–127. - PMC - PubMed
    1. Dunham I. (2000) Genomics—the new rock and roll? Trends Genet., 16, 456–461. - PubMed
    1. Bork P. (2000) Powers and pitfalls in sequence analysis: the 70% hurdle. Genome Res., 10, 398–400. - PubMed
    1. Kaneko T., Nakamura,Y., Wolk,C.P., Kuritz,T., Sasamoto,S., Watanabe,A., Iriguchi,M., Ishikawa,A., Kawashima,K., Kimura,T. et al. (2001) Complete genomic sequence of the filamentous nitrogen-fixing cyanobacterium Anabaena sp. strain PCC 7120. DNA Res., 8, 205–213. - PubMed