Identifying the major proteome components of Haemophilus influenzae type-strain NCTC 8143.

Link AJ, Hays LG, Carmack EB, Yates JR
Electrophoresis. 1997 18 (8): 1314-34

PMID: 9298647 · DOI:10.1002/elps.1150180808

With the completion of the Haemophilus influenzae Rd genomic sequence, we know the identity of most of the theoretical proteins in the proteome of this bacterium. However, the most abundant components of the actual proteome are unknown. Using mass spectrometry and two-dimensional gel electrophoresis (2-DE), we sequenced and analyzed the most abundant proteins observed in the ATCC reference strain of H. influenzae, NCTC 8143 (303 of approximately 400 Coomassie-stained 2-DE spots). To automate the identification of 2-DE spots, we coupled a liquid autosampler to a microcolumn liquid chromatography electrospray ionization tandem mass spectrometer capable of identifying 22 spots per day. From the 303 sequenced spots, we identified 263 unique proteins. Most of the abundant proteins lie in an isoelectric point range of pH 4-7 and a molecular mass range of 10-100 kDa. Of the observed proteins, the most abundant is the outer membrane protein P2. Based on variety and abundance, proteins involved in energy metabolism and macromolecular synthesis are the dominant classes of proteins. Unexpectedly, tryptophanase was identified as a highly abundant protein in the strain NCTC 8143 whose sequence is not present in the genome of the Rd strain. By searching the tandem mass spectra against the translated genomic sequence, we identified several proteins which were not annotated in the genomic sequence. Surprisingly, 22% of the identified 2-DE spots represent isoforms in which gene products with the same primary sequence have different observed pI and M(r), indicating that these proteins are post-translationally processed. Although most proteins' predicted and observed isoelectric points and molecular masses show reasonable concordance, the observed values for several proteins deviate significantly from the predicted values. These anomalies may represent either highly processed proteins or misinterpretations of the genomic sequence. Using the technology developed in this project, the protein expression of other strains of H. influenzae grown under different environmental conditions can be compared to identify differences in their proteomes.

MeSH Terms (8)

Bacterial Proteins Electrophoresis, Gel, Two-Dimensional Genome, Bacterial Haemophilus influenzae Isoelectric Point Mass Spectrometry Molecular Weight Peptide Mapping

Connections (1)

This publication is referenced by other Labnodes entities: