Helicobacter pylori is a genetically diverse bacterial species that colonizes the stomach in about half of the human population. Most persons colonized by H. pylori remain asymptomatic, but the presence of this organism is a risk factor for gastric cancer. Multiple populations and subpopulations of H. pylori with distinct geographic distributions are recognized. Genetic differences among these populations might be a factor underlying geographic variation in gastric cancer incidence. Relatively little is known about the genomic features of African H. pyloristrains compared to other populations of strains. In this study, we first analyzed the genomes of H. pylori strains from seven globally distributed populations or subpopulations and identified encoded proteins that exhibited the highest levels of sequence divergence. These included secreted proteins, an LPS glycosyltransferase, fucosyltransferases, proteins involved in molybdopterin biosynthesis, and Clp protease adaptor (CIpS). Among proteins encoded by the cag pathogenicity island, CagA and CagQ exhibited the highest levels of sequence diversity. We then identified proteins in strains of Western African origin (classified as hspWAfrica by MLST analysis) with sequences that were highly divergent compared to those in other populations of strains. These included ATP-dependent Clp protease, CIpS, and proteins of unknown function. Three of the divergent proteins sequences identified in West African strains were characterized by distinct insertions or deletions up to 8 amino acids in length. These polymorphisms in rapidly evolving proteins represent robust genetic signatures for H. pylori strains of West African origin.
Bullock, Kennady K.; Shaffer, Carrie L.; Brooks, Andrew W.; Secka, Ousman; Forsyth, Mark H.; McClain, Mark S.; and Cover, Timothy L., Genetic signatures for Helicobacter pylori strains of West African origin (2017). PLOS ONE, 12(11).