Figure 6

Observed and predicted site-specific amino acid distribution (a), divided by the expected frequencies under mutation alone wAA(a), for (a) the mutation models 1 ('opt. freq. at β ≡ 0'), (b) mutation model 2 ('constant'), (c) mutation model 4 ('opt. freq.'), and (d) mutation model 5 ('CpG'). For the theoretical models where mutation satisfies detailed balance, (a)/wAA(a) ∝ exp [-β i h(a)], so that the slope of the plot represents β i at this site class. For illustration, site class with c i /<c> ∈ [0.435, 0.545] was selected. Full symbols show the observed distributions obtained from sequences in the PDB, whereas the open symbols and the lines display the mean-field model.