Principal coordinates analysis has been proposed as an efficient way of predicting the binding affinity of a transcription factor to different DNA motifs, as it can model complex interactions that are difficult to represent with standard position-weight matrices. Here we evaluate its ability to distinguish the DNA binding properties of two closely related proteins, the homodimeric forms of NF-kappaB p50 and p52. When tested experimentally against 50 different variants of the generalised NF-kappaB motif GGRRNNYYCC, the binding specificities of p50 and p52 were similar but not identical (correlation rho = 0.86). These experimental data can be modelled accurately with six principal coordinates that are similar for p50 and p52, plus one principal coordinate that is significantly stronger for p52 than for p50, relating to the inner positions of the binding site. These findings are compatible with crystallographic data showing that p52 has greater ability than p50 to form water molecule-mediated hydrogen bonds with inner nucleotide positions of the binding site.


Journal article


Nucleic Acids Res

Publication Date





1497 - 501.