Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Jan;36(Database issue):D114-9.
doi: 10.1093/nar/gkm799. Epub 2007 Oct 16.

ProTISA: a comprehensive resource for translation initiation site annotation in prokaryotic genomes

Affiliations

ProTISA: a comprehensive resource for translation initiation site annotation in prokaryotic genomes

Gang-Qing Hu et al. Nucleic Acids Res. 2008 Jan.

Abstract

Correct annotation of translation initiation site (TIS) is essential for both experiments and bioinformatics studies of prokaryotic translation initiation mechanism as well as understanding of gene regulation and gene structure. Here we describe a comprehensive database ProTISA, which collects TIS confirmed through a variety of available evidences for prokaryotic genomes, including Swiss-Prot experiments record, literature, conserved domain hits and sequence alignment between orthologous genes. Moreover, by combining the predictions from our recently developed TIS post-processor, ProTISA provides a refined annotation for the public database RefSeq. Furthermore, the database annotates the potential regulatory signals associated with translation initiation at the TIS upstream region. As of July 2007, ProTISA includes 440 microbial genomes with more than 390 000 confirmed TISs. The database is available at http://mech.ctb.pku.edu.cn/protisa.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Sequence logo and spacer length distribution of representative signals for the genomes (A) E. coli k-12; (B) S. coelicolor; (C) A. fulgidus; and (C) Synechocystis sp. PCC 6803. The positional weight matrix of the signal is visualized by a sequence logo in which the height of a letter on a given position is proportional to its occurring frequency. A letter is bottom-up shown if the occurring frequency is lower than that from the background. The consensus is shown below the logo. The spacer length is defined as the distance (or the number of nucleotides) between the TIS and each of all annotated signals, which are calculated by the positional weight matrix visualized in sequence logo.

References

    1. Poole F.L., II, Gerwe B.A., Hopkins R.C., Schut G.J., Weinberg M.V., Jenney F.E., Jr, Adams M.W.W. Defining genes in the genome of the hyperthermophilic archaeon Pyrococcus furiosus: implications for all microbial genomes. J. Bacteriol. 2005;187:7325–7332. - PMC - PubMed
    1. Makita Y., de Hoon M.J.L., Danchin A. Hon-yaku: a biology-driven Bayesian methodology for identifying translation initiation sites in prokaryotes. BMC Bioinformatics. 2007;8:e47. - PMC - PubMed
    1. Ma J., Campbell A., Karlin S. Correlations between Shine-Dalgarno sequences and gene features such as predicted expression levels and operon structures. J. Bacteriol. 2002;184:5733–5745. - PMC - PubMed
    1. Rudd K.E. EcoGene: a genome sequence database for Escherichia coli K-12. Nucleic Acids Res. 2000;28:60–64. - PMC - PubMed
    1. Aivaliotis M., Gevaert K., Falb M., Tebbe A., Konstantinidis K., Bisle B., Klein C., Martens L., Staes A., et al. Large-scale identification of N-terminal peptides in the halophilic archaea Halobacterium salinarum and Natronomonas pharaonis. J. Proteome Res. 2007;6:2195–2204. - PubMed

Publication types

Substances