|
|
||||||||


,¶




,*
* Partners AIDS Research Center, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02129;
Theoretical Biology and Biophysics, Los Alamos National Laboratory, Los Alamos, NM 87545;
School of Mathematics, University of Manchester, Manchester, United Kingdom;
Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark;
¶ Theoretical Biology/Bioinformatics, Utrecht University, Utrecht, The Netherlands;
|| Endocrine Unit, Massachusetts General Hospital, Charlestown, MA 02114;
# Department of Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL 60611;
** International Institute for Nanotechnology, Northwestern University, Evanston, IL 60208;

Medical Research Council (MRC) Human Immunology Unit, Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, United Kingdom;
* Howard Hughes Medical Institute, Chevy Chase, MD 20815; and
* Santa Fe Institute, Santa Fe, NM 87501
| Abstract |
|---|
|
|
|---|
ELISpot assay, these "toggled" peptides detected HIV-specific CD4+ and CD8+ T cell responses of significantly higher breadth and magnitude than matched consensus peptides. The observed increases were explained by a closer match of the toggled peptides to the autologous viral sequence. Toggled peptides therefore afford a cost-effective and significantly more complete view of the host immune response to HIV and are directly applicable to other variable pathogens. | Introduction |
|---|
|
|
|---|
Historically, either natural strains or consensus sequences have been used as the basis for peptide synthesis when probing the cellular immune response to HIV (12, 13, 14, 15). Since consensus protein sequences, by definition, contain the residue most frequently found in circulating viruses, they provide wider coverage of viral sequence diversity than individual isolate-specific sequences, yet fail to properly represent subdominant variants. We overcame this limitation by incorporating amino acid mixes in the peptide synthesis for positions displaying sequence variability, creating a peptide set that captures sequence diversity in variable regions of the virus. Given the recurrent nature of sequence changes between HIV clades, such a "toggled" peptide set can cover a significant degree of diversity in all group M sequences and may significantly improve the accurate assessment of the host immune response to HIV. Indeed, data presented here establish toggled peptides as a powerful approach to detect significantly stronger and broader CD4 and CD8 T cell responses than consensus peptides. These data demonstrate the usefulness of the toggled peptides approach to more accurately assess the host immune response to HIV, and by inference, to other variable pathogens.
| Materials and Methods |
|---|
|
|
|---|
Subjects at different stages of HIV infection (see Results) were recruited at three hospitals in the Boston area. Additionally, 11 HIV-negative subjects were recruited at Massachusetts General Hospital. All human subject protocols have been approved by the Partners Human Research Committee, and all subjects provided written informed consent before enrollment.
Quantification of surface preference of toggling sites
The solvent-exposed surface area of each atom in the respective protein (reverse transcriptase and capsid) was calculated by the MSMS algorithm, described at: http://www.scripps.edu/mb/olson/people/sanner/html/msms_home.html and implemented in the Chimera program (http://www.cgl.ucsf.edu/chimera/). Backbone atoms were excluded from the comparison of atom-wise solvent-exposed surface area. The cumulative distribution functions of the levels of solvent exposure of each side chain atom in the sets of toggled and conserved positions were compared using a two-sample Kolmogorov-Smirnov test as implemented in Splus 10.0 (16).
Toggled peptides
Toggled peptides were based on the design of a previously published set of consensus B 18-mer peptides overlapping by 10 amino acids (12). The toggled peptides were synthesized on an AAPPTEC (Louisville, KY), model Apex 396 multiple peptide synthesizer using 9-fluorenylmethoxycarbonyl/tertiary butyl (Fmoc/tBu) solid-phase chemistry. An optimized 20 µmol scale synthesis cycle with a 6-fold molar excess of Fmoc amino acid and 1H-benzotriazolium, 1-[bis(dimethyamino)methylene] 5-chloro- hexafluorophosphate(1-), 3-oxide/N,N-diisopropylethyamine activation was used. Each amino acid was double coupled each time for 75 min. For the toggled positions in the sequence, a mixture containing an equimolar amount of the representative Fmoc amino acids was used. Peptides were cleaved from the solid support and deprotected using reagent K (trifluoroacetic acid/phenol/thioanisole/water/ethanedithiol; 82.5/5.0/5.0/5.0/2.5 v/v) for 3 h at ambient temperature. Peptides were precipitated using cold methyl tertiary butyl ether. The precipitate was washed 3 times with methyl tertiary butyl ether followed by freeze-drying. All toggled peptides were characterized by MALDI-mass spectrometry (MS).3 Edman degradation was used for some toggled peptides to confirm the presence of similar ratios of variants in the mix (data not shown). For peptide design, the variability cutoff was 5% (i.e., all variant amino acids present in at least 5% of clade B database sequences were included in the mix) for Gag p24 and Pol, and 10% for Nef. This gave rise to toggles containing between two and 1296 peptide variants with a median of four variants per toggle product and only five toggled peptides with >100 variants in the mixture. The vast majority (87%, 148/170) of toggles had <20 variants in the toggle mixture and were adjusted for peptide concentrations so that each single peptide was present in the toggle at the same concentration as it was in the consensus overlapping peptide (OLP) preparation. The individual variant peptide concentration in the remaining 13% of toggles containing >20 variants (median 42) was lower depending on the number of variants, since the overall peptide concentration in DMSO would have exceeded solubility if adjusted, and higher solvent concentration could potentially cause adverse effects in the ELISpot assays due to DMSO toxicity.
ELISpot assays
ELISpot assays were performed as described previously (12), using 2 µg/ml coating Ab. PBMCs were incubated with consensus or toggled peptides overnight. Cells in medium without peptide were used as negative control (
6 wells), PHA was added as positive control. Responses were considered positive when exceeding all of the following criteria: four times mean negative wells, mean negative wells plus three SDs, 5 spots/well and 55 SFC/million input cells. Assays with more than mean three spots/well in the negative controls were disregarded. Testing 11 HIV-negative individuals with all toggled peptides resulted in 3 weak positive responses, representing a false positive rate of 0.1%. All chronic subjects had detectable responses to both consensus and toggled peptides.
For assessment of HIV-specific CD4 T cell responses, PBMCs were CD8 depleted by RosetteSep (StemCell Technologies) or with Dynabeads (Invitrogen life Technologies) according to the manufacturers instructions.
For simplicity and to allow for a direct assessment of responses to toggled vs consensus-based peptides on freshly isolated samples, all toggled as well as consensus peptides were tested separately without pooling. All assays for total T cell responses were done in duplicate, whereas assays using CD8-depleted samples were done without replicates.
Statistical methods
To assess the efficacy of toggle peptides, the relative frequency of yes/no outcomes using toggled peptide sets or consensus sequence peptides was predicted. For the breadth of responses, we considered whether there is or is not a response to a particular peptide. The natural tool for this sort of Boolean yes/no problem is binomial logistic regression: it amounts to making a generalized linear model (GLM) for the "logit"-transformed probability of a positive response (17, 18)
![]() |
There were 1060 duplicate reactions performed, and comparing duplicates gave us a foundation for defining a conservative cut-off for a significant difference in ELISpot scores. The absolute value of the difference between all duplicate reactions was determined, and 95% of the duplicates had differences of <160 spot forming cells (SFC)/million (median 40, IR 12–80).
Sequencing
Genomic DNA was isolated from PBMC samples using the QIAamp DNA blood mini kit. Nested PCR protocols with limiting dilution were used to amplify near full-length HIV genomes as previously described (21). Purified PCR products were directly sequenced using Clade B consensus primers. Sequence data were manually edited using Sequencher 4.6 (Gene Code Corporation). All available sequences were screened for subtype and potential recombination or contamination. All sequences were submitted to GenBank and are available under accession numbers DQ886031–DQ886038 for chronic samples, EF090287–EF090289 for controllers, and EF680862–EF680881 for acute samples.
| Results |
|---|
|
|
|---|
Despite extensive HIV sequence diversity on a full genome level, the variation in individual positions of viral proteins is often restricted to a limited number of amino acid substitutions that recur or "toggle" throughout the different HIV clades (22). Toggling residues often include amino acid pairs such as arginine (R) and lysine (K) or leucine (L) and valine (V), which are biochemically similar and are therefore likely to be tolerated by the virus without serious impacts on its replication efficiency. In addition, these changes are generally readily achieved as they can often be obtained by single nucleotide substitutions (23).
We designed a strategy to incorporate this considerably limited variability into peptide test sets that allow more accurate detection of virus-specific T cell responses. This is illustrated in Fig. 1 (and shown for HIV Pol in Table I), which provides detailed comparisons of amino acid variation in the Gag protein, showing levels of HIV subtype B population coverage at each position achieved by either a natural strain, a consensus sequence, or a toggled sequence incorporating the most common amino acids occurring at a given position. Gag p17 is representative of a more variable protein, while Gag p24 is representative of a highly conserved protein (24). Notably, variable positions are relatively infrequent and generally well dispersed among highly conserved positions, so that within the boundaries of a CD8 T cell epitope, which is typically nine amino acids long, a large fraction of HIV population diversity can be covered by a small number of variants.
|
|
Also of importance, the common amino acids found in the toggling positions tend to recur in different regional viral epidemics and thus provide improved in vitro test sets to assess immune responses across clades or in the entire HIV group M sequences (Fig. 2). Furthermore, toggled amino acids often have similar side chain chemistry, probably because such changes are less disruptive to protein structure and function and hence are tolerated by the virus (Table II). Despite their chemical similarities, such conserved changes have been found to still permit immune escape, which is presumably why they recur (25). There are, however, exceptions to this: some combinations of amino acids in a given position actually have very distinctive side chain chemistry (Table II). Nonetheless, these substitutions must still be tolerated in the protein structure and function, reflecting the need of the pathogen to balance effective immune evasion with structural and functional integrity of the viral proteins. For example, a relatively variable position in p24, presumably evolving under immune pressure, is constrained to vary between the amino acids Asn, Ser, and Thr. These amino acids all have hydroxyl groups that can form hydrogen bonds resulting in the termination of an
helix (Fig. 3a); retention of an amino acid with the capacity to form a hydrogen bond in this variable position is found throughout the entire primate lentiviral tree. So while the virus is rapidly changing in this position, it is limited in the way it can mutate. Furthermore, T cell epitopes thread in and out of the folded protein with the mutable positions preferentially located at the outside of the protein, likely dictated by functional and structural limitations. This is supported by the observation that the variable positions that toggle are more often found on the surface of the proteins than within (Fig. 3, b and c, Gag p24, p = 0.0007 and reverse transcriptase, p = 0.0109, two-sample Kolmogorov-Smirnov test). Thus, positions that can vary and the extent of variation are dictated by the protein, providing natural constraints that ultimately make a toggled peptide approach to coverage feasible.
|
|
|
For the initial proof-of-concept studies, sets of toggled peptides covering variation within clade B in the immunologically most highly recognized regions of the virus were used (12): Gag p24, Pol, and the central region of Nef. Non-consensus amino acid (s) that were found in at least 5% of all clade B sequences in the 2004 HIV database were included in the synthesis step for peptides spanning the Gag and Pol sequences. In cases where the HIV database contained multiple sequences per subject, only one sequence per individual was included in the alignment. For the more variable central portion of Nef this cut-off was set at 10%. Using these criteria, all but four positions had >90% population coverage by including one, two, or three amino acids. Most variable positions required only two alternative amino acids (Table II), and of the 178 peptides spanning the three proteins, 25 (14%) were so conserved that the consensus sequence provided adequate coverage. In these cases, no separate toggled counterparts to the consensus sequence were synthesized. All toggled peptides and their respective consensus counterpart, as well as the number of variants included in each toggle preparation, are summarized in Table III.
|
|
|
Toggled peptide sets yield more positive responses than consensus peptides
Total T cell responses were tested in 17 chronically HIV clade B infected individuals using an IFN-
ELISpot assay and either toggled or consensus peptides in duplicate wells (12). Applying a GLM (see Materials and Methods), we found a significantly higher probability for a positive response using the toggled peptides than the consensus (odds ratio 1.33, p = 0.0019). These data are illustrated in Fig. 4a, where the number of positive responses for each patient is shown. Although some responses were detected by either the consensus or the toggled peptides only, significantly more reactions were detected using the toggled peptides than the consensus peptides in 14 of 17 patients (p = 0.0038, Wilcoxon test). In 2 of 17 cases the number of responses to consensus and toggled peptides were identical, and in only one patient did the consensus OLP peptides elicit more reactions than the toggled peptide set. Thus, although some consensus peptide responses may be lost, there is a significant net gain of responses detected using toggled peptides.
|
Responses to the toggled peptides are of greater magnitude than responses to consensus OLP
We next assessed whether toggled peptides not only detected more, but also stronger in vitro responses. The magnitude of responses was plotted as magnitude in reaction to either the consensus OLP or the corresponding toggled peptide, and treated as paired events (Fig. 5). Responses where both OLP and toggled peptides reacted were separated from those where only the toggled peptides or the consensus OLP was positive. This approach showed overall significantly stronger responses using toggled peptides relative to the consensus OLP for both the total PBMC ELISpot (Fig. 5a) and the CD4 T cell ELISpot (Fig. 5b) comparisons (p = 2.8 x 10 –2 and p = 4.1 x 10–5, respectively, Wilcoxon test). In addition, the number of the increased responses was significantly greater for the toggled peptides than the consensus when only one of the two peptide test sets scored positive (total T cells: p = 7.3 x 10–8, CD4 T cells: p = 3.3 x 10–6, using a 1-sample proportions test). When both test sets elicited positive responses, only the total T cell ELISpot showed significantly more samples with gain in responses among the toggled peptides (total T cells: p = 0.0013, CD4 T cells: p = 0.33).
|
Toggled peptides could give a stronger response relative to the consensus if T cells are more efficiently triggered by a peptide variant sequence that is found among the toggled peptides but not the consensus. To better resolve how increased coverage of autologous sequences contributes to increased breadth and magnitude of responses, autologous HIV sequences from the study subjects were compared with the consensus sequence and to the toggled peptides (Fig. 6). The magnitude of detected responses was significantly higher when the match with the autologous sequence was improved by the toggles, relative to those cases where the consensus was as close to the autologous sequences as the toggles (p = 0.0054, Wilcoxon test, Fig. 6a). There were fewer sequences available for this comparison in CD4 data, as only Gag and Nef peptides were tested for CD4 T cell responses (Fig. 6b). There was a trend toward greater magnitude of responses elicited by toggles than consensus peptides, which did not, however, reach statistical significance (p = 0.1288).
|
| Discussion |
|---|
|
|
|---|
The benefits of such more comprehensive assessments of HIV-specific T cell responses may be crucial in evaluating the magnitude and breadth of T cell responses at the population level. The rank ordering of number of responses per individuals and the magnitude of responses to different peptides both differ between toggle and consensus (Fig. 4), which would impact the analyses of potential associations between breadth or magnitude of the virus-specific response and in vivo viral control (12, 13, 31). Just as assessing the T cell immune response by IFN-
expression alone gives only a limited view of T cell function (32), using only one peptide to probe T cell specificity fails to adequately detect T cell responses. Although autologous peptides are frequently considered the most appropriate test set, peptide test sets reflecting the individuals autologous viral sequences are prohibitively expensive (33), and given the changing nature of the immune response and viral sequence over time in a given individual, may only allow a limited view of the true extent of the immune response. Thus, toggled peptides may even outperform autologous peptides because they predominantly probe immune responses generated by the incoming virus, but also cover possible escape variants that may have induced de novo responses later in infection (34).
An alternative method has recently been described that also presents HIV sequence diversity in improved peptide test sets, the potential T cell epitope (PTE) approach. The PTE approach is based on a biometric algorithm that includes frequently found k-mer sequences from circulating strains in a sequence database to build a standardized panel of HIV peptides for CTL based vaccine evaluation (35). Similarly to the toggled peptides, PTE peptides detect significantly more and stronger responses than consensus based peptides (36). Toggled peptides and PTE peptides each have different virtues and issues. Each peptide has to be separately synthesized for the PTE approach, so it is more expensive, and the coverage attained with PTE peptides is somewhat reduced relative to toggled peptides (B. Korber and K. Yusim, manuscript in preparation). Toggled peptides provide a clear linear map along a protein, facilitating ease of interpretation and direct comparison to traditional single sequence overlapping peptide methods, but they are mixtures of peptides that may be less reproducible in synthesis, and they will provide less precise information regarding specific peptide reactivity than the PTE approach. Based on similar reasoning regarding the advantages of covering T cell epitope diversity, two recent vaccine Ag designs have been proposed that incorporate variants in an attempt to elicit more broadly cross-reactive T cell responses through vaccination. These are mosaic proteins, which are sets of in silico recombinant full-length HIV proteins that in combination provide optimal coverage of potential T cell epitopes and exclude rare or unique epitopes (9), and COT+ (11), which complements a center of tree (COT) full protein sequence with a set of protein fragments designed to optimize potential epitope coverage.
Despite the overall net advantage in using toggled peptides to detect T cell reactivity, some positive responses to the standard test peptide sets were lost or diminished when using the toggled peptides. Although the consensus sequence was always present in the toggled peptide, some responses, especially weak ones, may potentially be lost due to the dilution of the reactive peptide sequence in the toggle synthesis mix, or due to possible antagonistic or stochastic effects caused by the mixture of multiple sequence variants in the toggle preparation. Specific cases where a consensus OLP elicited response is not detected using toggled peptides could be identified and resolved by testing single variant sequences and all possible combinations thereof. This, in turn, could provide potentially crucial information for polyvalent vaccine Ag design, identifying nonimmunogenic or antagonistic sequence variants with good binding affinity that would preferentially be excluded in vaccine immunogens. In addition, testing toggled peptides in assay systems that detect multiple effector functions may also reveal specific sequence variants that could act as superagonists or mediate wide cross-reactivity, and which could thus represent premier candidates for vaccine immunogen design.
Finally, because relatively conserved regions of the viral genome were chosen for this study, the observed benefits of using toggled peptides when compared with consensus sequence based test sets reflect conservative increases, which may be even more significant when applying this approach to more variable proteins in HV and other variable pathogens (33). However, while designing toggled peptides for highly variable targets is feasible, it will, as illustrated in Fig. 1e, likely require balancing the desired threshold for sequence diversity coverage with the complexity of the toggled peptide mixtures. Indeed, while there was no overall association between toggle complexity and the magnitude of responses (p = 0.1, Spearman one-sided test), focusing on responses to toggled peptides containing >20 variants revealed an inverse correlation between the number of variants and the detected magnitude of the response (p = 0.009), suggesting that the inclusion of additional variants above a certain threshold may reduce the strength of the response in vitro. In contrast, these more diverse toggled peptides correspond to more variable regions of HIV genome, where the consensus peptides have a lower response detection rate (12). Thus, more population-representative, complex toggled peptides may detect responses where their consensus counterpart failed to do so, suggesting that a lower magnitude may be a reasonable price to pay for a more complete view of the HIV-specific immune response. One approach to minimize toggle complexity would allow for an experimentally determined maximum number of variants in each toggle preparation to limit dilution effects. In addition, rational limitations to specific polymorphisms, for instance amino acids that revisit the same sequence space in different clades within the M group, could help developing toggled peptides suitable as global reagents while not being affected by extensive dilution effects. Such limitations to diversity (3) make biological sense given the context of what we know regarding the predictability of drug resistance mutations and some immune escape mutations (37, 38). Of note, M group toggles perfectly match some sequences from multiple clades and recombinant forms (data not shown), emphasizing the global usefulness of the toggle approach especially considering the increasing frequency with which recombinant viruses are being described.
Taken together, toggled peptides are a cost effective means to enable better probing and descriptions of the immune response across populations, and have clear applications for other variable pathogens such as hepatitis C virus, for which truly comprehensive assessments of an individuals immune response are limited by sequence diversity in the infecting viral isolate.
| Disclosures |
|---|
|
|
|---|
| Footnotes |
|---|
1 This work was supported in part by Federal funds from the National Institute of Allergy and Infectious Disease and National Institute of Health Contracts Nos. N01-AI-15442 and AI-30024 (to B.D.W.), R01-AI-067077 (to C.B.), R56-AI-071726 (to C.B.), R01-AI-054178 (to T.M.A.), and R21-AI-055421 (to K.Y.). It was also supported by internal directed research funds (LDRD) from the Los Alamos National Laboratory and a Harvard University Center for Acquired Immunodeficiency Syndrome Research grant (to D.E.K). ![]()
2 N.F., D.E.K., K.Y., C.B., and B.T.K. contributed equally to this work. ![]()
3 Address correspondence and reprint requests to Dr. Bette T. Korber, MS K710, T-10, Los Alamos National Laboratory, Los Alamos, NM 87545. E-mail address: btk{at}lanl.gov; or Dr. Christian Brander, Massachusetts General Hospital, AIDS Research Center, 149 13th Street, Room 5234, Charlestown, MA 02129. E-mail address: cbrander{at}partners.org ![]()
4 Abbreviations used in this paper: MS, mass spectrometry; OLP, overlapping peptide; GLM, generalized linear model; SFC, spot forming cell; PTE, potential T cell epitope. ![]()
Received for publication May 29, 2007. Accepted for publication August 17, 2007.
| References |
|---|
|
|
|---|
interferon-secreting CD8+ T cells in primary HIV-1 infection. J. Virol. 77: 6867-6878. This article has been cited by other articles:
![]() |
E. L. Turnbull, M. Wong, S. Wang, X. Wei, N. A. Jones, K. E. Conrod, D. Aldam, J. Turner, P. Pellegrino, B. F. Keele, et al. Kinetics of Expansion of Epitope-Specific T Cell Responses during Primary HIV-1 Infection J. Immunol., June 1, 2009; 182(11): 7131 - 7145. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. C. Matthews, A. J. Leslie, A. Katzourakis, H. Crawford, R. Payne, A. Prendergast, K. Power, A. D. Kelleher, P. Klenerman, J. Carlson, et al. HLA Footprints on Human Immunodeficiency Virus Type 1 Are Associated with Interclade Polymorphisms and Intraclade Phylogenetic Clustering J. Virol., May 1, 2009; 83(9): 4605 - 4615. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. E. Wang, B. Li, J. M. Carlson, H. Streeck, A. D. Gladden, R. Goodman, A. Schneidewind, K. A. Power, I. Toth, N. Frahm, et al. Protective HLA Class I Alleles That Restrict Acute-Phase CD8+ T-Cell Responses Are Associated with Viral Escape Mutations Located in Highly Conserved Regions of Human Immunodeficiency Virus Type 1 J. Virol., February 15, 2009; 83(4): 1845 - 1855. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Yerly, D. Heckerman, T. Allen, T. J. Suscovich, N. Jojic, C. Kadie, W. J. Pichler, A. Cerny, and C. Brander Design, Expression, and Processing of Epitomized Hepatitis C Virus-Encoded CTL Epitopes J. Immunol., November 1, 2008; 181(9): 6361 - 6370. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Miura, M. A. Brockman, C. J. Brumme, Z. L. Brumme, J. M. Carlson, F. Pereyra, A. Trocha, M. M. Addo, B. L. Block, A. C. Rothchild, et al. Genetic Characterization of Human Immunodeficiency Virus Type 1 in Elite Controllers: Lack of Gross Genetic Defects or Common Amino Acid Changes J. Virol., September 1, 2008; 82(17): 8422 - 8430. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Rousseau, M. G. Daniels, J. M. Carlson, C. Kadie, H. Crawford, A. Prendergast, P. Matthews, R. Payne, M. Rolland, D. N. Raugi, et al. HLA Class I-Driven Evolution of Human Immunodeficiency Virus Type 1 Subtype C Proteome: Immune Escape and Viral Load J. Virol., July 1, 2008; 82(13): 6434 - 6446. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |