Correlated evolution of nearby residues in Drosophilid proteins. Author Benjamin Callahan, Richard Neher, Doris Bachtrog, Peter Andolfatto, Boris Shraiman Publication Year 2011 Type Journal Article Abstract Here we investigate the correlations between coding sequence substitutions as a function of their separation along the protein sequence. We consider both substitutions between the reference genomes of several Drosophilids as well as polymorphisms in a population sample of Zimbabwean Drosophila melanogaster. We find that amino acid substitutions are "clustered" along the protein sequence, that is, the frequency of additional substitutions is strongly enhanced within ≈10 residues of a first such substitution. No such clustering is observed for synonymous substitutions, supporting a "correlation length" associated with selection on proteins as the causative mechanism. Clustering is stronger between substitutions that arose in the same lineage than it is between substitutions that arose in different lineages. We consider several possible origins of clustering, concluding that epistasis (interactions between amino acids within a protein that affect function) and positional heterogeneity in the strength of purifying selection are primarily responsible. The role of epistasis is directly supported by the tendency of nearby substitutions that arose on the same lineage to preserve the total charge of the residues within the correlation length and by the preferential cosegregation of neighboring derived alleles in our population sample. We interpret the observed length scale of clustering as a statistical reflection of the functional locality (or modularity) of proteins: amino acids that are near each other on the protein backbone are more likely to contribute to, and collaborate toward, a common subfunction. Keywords Animals, Drosophila Proteins, Mutation, Evolution, Molecular, Amino Acid Substitution, Drosophila melanogaster, Multigene Family, Selection, Genetic, Phylogeny Journal PLoS Genet Volume 7 Issue 2 Pages e1001315 Date Published 02/2011 Alternate Journal PLoS Genet. Google ScholarBibTeXEndNote X3 XML