Jump to Main Content
Peptide Retention Time Prediction in Hydrophilic Interaction Liquid Chromatography: Data Collection Methods and Features of Additive and Sequence-Specific Models
- Krokhin, Oleg V., Ezzati, Peyman, Spicer, Vic
- Analytical chemistry 2017 v.89 no.10 pp. 5526-5533
- algorithms, amino acids, data collection, hydrogen bonding, hydrophilic interaction chromatography, models, peptides, prediction
- The development of a peptide retention prediction model for hydrophilic interaction liquid chromatography (XBridge Amide column) is described for a collection of ∼40 000 tryptic peptides. Off-line 2D LC-MS/MS analysis (HILIC-RPLC) of S. cerevisiae whole cell lysate has been used to acquire retention information for a HILIC separation. The large size of the optimization data set (more than 2 orders of magnitude compared to previous reports) permits the accurate assignment of hydrophilic retention coefficients of individual amino acids, establishing both the effects of amino acid position relative to peptide termini and the influence of peptide secondary structure in HILIC. The accuracy of a simple additive model with peptide length correction (R² value of ∼0.96) was found to be much higher compared to an algorithm of similar complexity applied to RPLC (∼0.91). This indicates significantly smaller influence of peptide secondary structure and interactions with counterions in HILIC. Nevertheless, sequence-specific features were found. Helical peptides that tend to retain stronger than predicted in RPLC exhibit negative prediction errors using an additive HILIC model. N-cap helix stabilizing motifs, which increase retention of amphipathic helical peptides in RP, reduce peptide retention in HILIC independently of peptide amphipathicity. Peptides carrying multiple Pro and Gly (residues with lowest helical propensity) retain stronger than predicted. We conclude that involvement of the peptide backbone’s carbonyl and amide groups in hydrogen-bond stabilization of helical structures is a major factor, which determines sequence-dependent behavior of peptides in HILIC. The incorporation of observed sequence dependent features into our Sequence-Specific Retention Calculator HILIC model resulted in 0.98 R² value correlation, significantly exceeding the predictive performance of all RP and HILIC models developed for complex mixtures of tryptic peptides.