Loading…

The Complete Genome Sequence of Escherichia coli K-12

The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes withi...

Full description

Saved in:
Bibliographic Details
Published in:Science (American Association for the Advancement of Science) 1997-09, Vol.277 (5331), p.1453-1469
Main Authors: Blattner, Frederick R., Plunkett, Guy, Bloch, Craig A., Perna, Nicole T., Burland, Valerie, Riley, Monica, Collado-Vides, Julio, Glasner, Jeremy D., Rode, Christopher K., Mayhew, George F., Gregor, Jason, Davis, Nelson Wayne, Kirkpatrick, Heather A., Goeden, Michael A., Rose, Debra J., Mau, Bob, Shao, Ying
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The 4,639,221-base pair sequence of Escherichia coli K-12 is presented. Of 4288 protein-coding genes annotated, 38 percent have no attributed function. Comparison with five other sequenced microbes reveals ubiquitous as well as narrowly distributed gene families; many families of similar genes within E. coli are also evident. The largest family of paralogous proteins contains 80 ABC transporters. The genome as a whole is strikingly organized with respect to the local direction of replication; guanines, oligonucleotides possibly related to replication and recombination, and most genes are so oriented. The genome also contains insertion sequence (IS) elements, phage remnants, and many other patches of unusual composition indicating genome plasticity through horizontal transfer.
ISSN:0036-8075
1095-9203
DOI:10.1126/science.277.5331.1453