One method for detecting horizontal gene transfer events is to find genes that use different codons than the rest of the genome. This is also reflected in the underlying GC content of genes because GC content is correlated with codon usage, which in turn is associated their encoded amino acids. In other words, GC rich genes use GC rich codons. This bias in GC content is often reflected in the wobble position of codons so that GC rich genes use codons that end in G or C. For this example, a cryptic prophage is easily visualized in the genome of Escherichia coli K12 substrain MG1655 by looking for genes that have a different percentage of GC content in the wobble position as compared to the neighboring genomic regions.
To do this type of analysis with CoGe:
The color scheme for the wobble GC usage is:
The majority of coding sequence in K12 have a yellow-green color (showing a GC content slightly greater than 50%), the some of the phage's coding sequence is heavily biased towards a high AT content in the wobble position. This results in those coding sequences being colored red. This is further seen by turning on the "Other Feature" visualization layer where a misc_feature that annotates the prophage is shown by a large orange block. This view can be recreated at: http://synteny.cnr.berkeley.edu/CoGe/GenomeView.pl?z=5&x=570000&dsgid=4242&chr=1&dsid=36725&gstid=1 .
Another feature of horizontal genome transfer events, such as this phage integration into a bacterial genome, is that the genome structure is markedly different. While most of this genome is structured into obvious operons, the phage's genome structure is not order as such.