Abstract
The genetic variations of the HIV-1 virus and its human host constitute major obstacles for obtaining potent HIV-1-specific CTL responses in individuals of diverse ethnic backgrounds infected with different HIV-1 variants. In this study, we developed and used a novel algorithm to select 184 predicted epitopes representing seven different HLA class I supertypes that together constitute a broad coverage of the different HIV-1 strains as well as the human HLA alleles. Of the tested 184 HLA class I-restricted epitopes, 114 were recognized by at least one study subject, and 45 were novel epitopes, not previously described in the HIV-1 immunology database. In addition, we identified 21 “elite” epitopes that induced CTL responses in at least 4 of the 31 patients. A majority (27 of 31) of the study population recognized one or more of these highly immunogenic epitopes. We also found a limited set of 9 epitopes that together induced HIV-1-specific CTL responses in all HIV-1-responsive patients in this study. Our results have important implications for the validation of potent CTL responses and show that the goal for a vaccine candidate in inducing broadly reactive CTL immune responses is attainable.
The major obstacle in the development of new treatment and vaccine strategies for HIV-1 is the virus’ remarkable genetic variability and ability to mutate. HIV-1 genetic variants appear rapidly within an individual, resulting in numerous closely related viral variants. Globally, the genetic variation of HIV-1 has resulted in the emergence of nine subtypes (A-D, F-H, J, and K), as well as circulating recombinant forms (CRF),3 emphasizing the need for a vaccine to induce broadly reactive immune responses. The cellular immune response exerted by CD8+ CTLs is known to be a crucial part of the immune defense against HIV-1 infection. In vivo evidence for the impact of Ag-specific CTL responses on HIV-1 evolution, viral replication, and disease progression comes from several studies (1, 2, 3, 4). Although the characteristics of a protective HIV-1-specific CTL response are still unknown, several studies indicate that recognition of promiscuous epitopes in conserved regions is preferred (5, 6, 7). Such epitopes might be less sensitive or prone to induce mutations and might also be recognized independent of the HIV-1 subtype.
Ag-specific CTLs recognize Ags presented as 8- to 11-mer peptides (epitopes) by the HLA class I molecule on the APC. The HLA system is extremely polymorphic, and has a diverse distribution among different ethnic groups (8). Thus, the genetic variation of the host contributes to an ever more challenging situation in terms of identifying a broadly reactive CTL response. Recently, however, an analysis of HLA class I molecules with significant overlap among peptide-binding specificities has allowed a functional classification of HLA class I molecules into 12 distinct supertypes (9). Instead of taking into consideration several hundred HLA alleles, assembling HLA alleles into HLA class I supertypes, based on their ability to bind and present similar epitopes (8, 9), would enable large cohort studies of the association between immune responses against a limited number of HLA class I supertype-restricted epitopes and HIV-1 disease progression (10). Importantly, such an approach should allow for identification of promiscuous HLA class I epitopes eliciting broad population coverage.
In this study, bioinformatic prediction methods were used to select a set of 184 HLA class I supertype-restricted epitopes from the Gag, Pol, Env, Nef, and Tat regions for maximum coverage of HIV-1 strains belonging to the five HIV-1 subtypes, A, B, C, D, and CRF01_AE. Identification of an anti-HIV-1-specific CTL response restricted to seven HLA class I supertypes was determined using the IFN-γ ELISPOT assay in 31 HIV-1-infected patients. Using this approach, we have verified responses against 114 of the predicted epitopes. Some epitopes were highly immunogenic and induced a response in >10% of the study subjects. Of the immunogenic epitopes, we identified nine epitopes, which together gave 96% coverage of the study population. The epitopes restricted to the seven HLA class I supertypes were identified in patients with different genetic backgrounds, and infected with different HIV-1 subtypes, thus proving the success of an advanced bioinformatics epitope prediction and selection tool to identify immunogenic elite HLA class I supertype-restricted epitopes.
Materials and Methods
Study subjects
Thirty-one-treated and untreated HIV-1-infected subjects from the Karolinska University Hospital (Solna, Sweden) were identified for the study (Table I⇓). To enable sample collection from a diverse patient population, our only inclusion criterion was a detectable plasma viral load measurement >50 RNA copies/ml obtained within 6 mo before inclusion in the study. This criterion was set to ensure detectable CTL responses (11). The study was approved by the Regional Ethical Council (Stockholm, Sweden (2005/1162)) and all subjects provided written informed consent.
Clinical and virological characteristics of the study population at sampling time point
Prediction, selection, and identification of HLA supertype-restricted HIV epitopes
The CTL epitope predictions were made based on a dataset consisting of 322 HIV-1 strains for which the Gag, Pol, Env, Tat, and Nef proteins had been completely sequenced and which were publicly available in the HIV Molecular Immunology database (www.hiv.lanl.gov). The number of strains for each of the five HIV-1 subtypes used in the prediction were: 40 subtype A (A1 and A2), 119 subtype B, 149 subtype C, 4 subtype D, and 10 subtype CRF01_AE. Epitopes restricted to either of the following HLA class I supertypes were predicted: A1, A2, A3, A24, B7, B44, and B58 using the NetCTL 1.0 method (12) (available at www.cbs.dtu.dk/services/NetCTL). In the NetCTL method, each possible nonamer in a protein is assigned a score based on a combination of prediction of proteasomal cleavage, TAP-transport efficiency, and HLA class I affinity with the highest weight assigned to the HLA class I affinity. For each of the seven HLA class I supertypes and for each of the five HIV-1 proteins from each of the 322 HIV-1 strains, between one and five (depending on the size of the protein) of the top-scoring nonamers were selected as the predicted epitopes. This resulted in a total of 5652 unique predicted epitopes. To reduce this set, a novel algorithm called EpiSelect, was used. The EpiSelect algorithm aims at selecting a given number of predicted epitopes in a way so that most strains are covered with the largest number of epitopes, while maximizing the coverage of the viral strain with the smallest number of epitopes. The algorithm is implemented as follows: for each viral strain, a coverage value is recorded as the number of selected epitopes targeting this viral strain. This number is initially zero for all viral strains. Also, a list of promiscuity values Pij is recorded for each epitope as the set of viral strains (i) covered by a given predicted epitope j. Initially, the epitope that covers the largest number of different viral strains is selected, and the coverage values are updated accordingly. The next epitope is then selected by choosing the epitope that maximizes the relation
where the sum is over the set of viral strains. Pij is the promiscuity value for epitope j targeting the viral strain i (this value is 1 if the epitope is present in the viral strain i, otherwise 0), Ci is the coverage value for the viral strain i, and δ is an offset value defining the penalty for leaving viral strains untargeted. Small values of δ will focus the epitope selection toward maximal coverage of the different viral strains, whereas large values of δ will focus the selection toward epitopes present in most different viral strains. The value for δ = 0.001 was determined empirically to achieve optimal balance between viral coverage and viral promiscuity. Next, the coverage values are updated accordingly. This procedure is repeated until the desired number of predicted epitopes has been selected. Using this algorithm, a total of 174 predicted epitopes were selected (supplementary table 1).4 The final set included an additional 10 epitopes that were known from the literature as immunogenic CTL epitopes restricted by HLA-A2 (13) or HLA-B57/58 (supplementary table 1). The peptides were synthesized by Cambridge Research Biochemicals (Cleveland, U.K.) on a submicroscale in parallel synthesis. Stocks of peptides were dissolved in DMSO and kept at −70°C at concentrations of 10 mg/ml. Final concentrations of 2 μg/ml of each peptide were used in the assays.
Pan-specific prediction of peptide-HLA binding
We predict peptide-HLA binding using the pan-specific NetMHCpan method that allows for quantitative binding predictions for peptides to all HLA class I molecules of known sequence (14). The NetMHCpan method is a data-driven method trained on quantitative peptide primary HLA sequence data. In the original publication predictions were limited to HLA-A and HLA-B loci molecules. Here, we extend the method to also include the HLA-C loci.
Immunological measurement
PBMC were isolated by Ficoll gradient centrifugation and stored in liquid nitrogen until used. The frozen PBMC were gently thawed, washed, and resuspended in RPMI 1640 Glutamax (Invitrogen Life Technologies) containing 10% FCS and 0.1 U/ml penicillin and 0.1 μg/ml streptomycin at a concentration of 1–2 × 106 cells/ml. The cell concentration and the viability were evaluated using a Nucleocounter (ChemoMetec) based on a fluorescence microscope counting propidium iodide-labeled cell nuclei and analyzed by the software Nucleo View. The cell viability was above 88%, with a mean of 93%.
The production of IFN-γ by Ag-specific T cells was detected using the ELISPOT assay (15). Flat-bottom 96-well Multiscreen IP plates (MSIP4510; Millipore) were activated by adding 15 μl 35% EtOH per well during 1 min, washed four times in PBS, and coated with an anti-IFN-γ mAb, 1-D1K (Mabtech), 5 μg/ml, at 4°C for 4 h. The plates were washed four times in PBS. The PBMCs were plated in duplicate at a concentration of 1–2 × 105 cells/well along with peptides at a final concentration of 2 μg/ml per peptide, and incubated at 37°C for 16 h. As a negative control, cells were incubated with medium only to determine the background responses for each patient. The HIV-Gag p55 peptide pool (BD Biosciences) was included as a control of the HIV-1-specific response, and a CMV, EBV, and Flu (CEF) control peptide pool (National Institutes of Health, Germantown, MD) as well as staphylococcal enterotoxin B (SEB; Sigma-Aldrich) were added as positive controls. The plates were washed and incubated with a biotinylated anti-IFN-γ mAb, 7-B6-1 (Mabtech), 1 μg/ml, for 1 h at 37°C. The wells were washed, and alkaline phosphatase-conjugated streptavidin (Jackson ImmunoResearch Laboratories) was added and the plate incubated in room temperature for 1 h in dark. Before adding the substrate, the plate soaked for 1 h in PBS with 0.1% Tween 20 (Sigma-Aldrich) to decrease background signal. The plates were developed by a 20-min incubation with the substrate Vector Blue Alkaline Phosphatase Substrate kit III (SK-5300; Vector Laboratories), and the spot-forming units (SFU) (3) were counted by computer-assisted image analysis using the software KS ELISPOT 4.9 and an Axioplan 2, Imaging system 2 microscope (Carl Zeiss). The net number of SFU per million PBMC was calculated as follows: (number of spots against relevant peptide) − (number of spots against medium only) × (106 × concentration of cells in the well).
To establish background responses in our ELISPOT assay against the HIV-1 peptides, we analyzed PBMC samples from eight healthy volunteers. The background response induced by medium alone was low with a median of 64 SFU per million PBMC (interquartile range (IQR): 37–82). Neither of the peptide pools induced responses in any of the eight HIV-negative donors (median 70, IQR: 43–116 SFU/million cells). Likewise, the background response in our 31 HIV-1-infected study subjects against medium alone was generally low (median: 36, IQR: 22–72 SFU/million PBMC). A sample was considered positive when the responses were at least three times the experimental background and above 80 SFU/million PBMC. Also, due to limitations in detecting >400 SFU per well, the maximum response in the assay was 2000 SFU/million PBMC. All study subjects had detectable CTL responses against the control Ag SEB (supplementary table 1).
HIV-1 sequencing and phylogenetic analysis
HIV-1 RNA was extracted from 200 μl plasma using the QIAamp MinElute Virus Spin kit (Qiagen). Extracted viral RNA was eluted in 30 μl of sterile water with the addition of 1 μl RNase inhibitor (Promega). To increase the viral copy numbers in patient samples with low viral load (<10.000 HIV-1 RNA copies/ml); virus was extracted from 1 ml of plasma.
cDNA synthesis was performed using the SuperScript III RT kit (Invitrogen Life Technologies) with gene-specific primer 5′-TGC TGT CAT CAT TTC TTC TAR TGT-3′ (HXB2 nt position 1836–1813) (0.15 μM), and oligo deoxythymine (oligo dT) (0.15 μM). Our primers were selected to amplify the HIV-1 Gag p17 region (HXB2 nt position 790-1431), using a nested PCR (13). Excess primers, nucleotides, and enzymes were removed using the GFX PCR DNA and Gel Band Purification kit (GE Healthcare).
The sequencing reaction was conducted using the BigDye Terminatior version 3.1 Cycle Sequencing kit (Applied Biosystems), purified through Sephadex G-50 (GE Healthcare) in an Multi Screen-HV plate (Millipore), and detected in the ABI Prism 3130xl Genetic Analyzer (Applied Biosystems). Sequences were imported and manually edited using Sequencher software (Gene Codes). For the phylogenetic analysis, sequences were aligned with reference sequences for each subtype obtained from the HIV Sequence database (www.cbs.dtu.dk/services/NetCTL) using BioEdit (Citeline). Phylogenetic trees were constructed using the software MEGA3.1 (Center for Evolutionary Functional Genomics, The Biodesign Institute, Tempe, AZ). The sequences have been submitted to GenBank (GenBank accession numbers: EU300947–EU300975).
HLA genotyping
Isolation of genomic DNA from patient PBMC samples was performed using the Qiagen EZ1 BioRobot. PCR was conducted to amplify HLA-A, -B, -C, genes using gene-specific primers for each HLA gene (16). Following PCR, samples were examined by agarose gel electrophoresis and purified using Millipore’s Montage PCR96 plates. The purified PCR products were sequenced using an ABI PRISM BigDye Terminator Cycle sequencing system and analyzed with ABI Prism 3100 Genetic Analyzer or 3700 Genetic Analyzer. High-resolution class I HLA genotyping was conducted using computer software based on a taxonomy-based sequence analysis method (17).
Statistical analyses
Statistical analysis and graphical presentations were performed with GraphPad Prism software. Correlations were calculated using Spearman’s nonparametric rank test (two-tailed), and comparisons using the Mann-Whitney nonparametric t test. Analysis with a p value of <0.05 was considered statistically significant.
Results
PBMC and plasma samples were obtained from 16 men and 15 women who had been HIV-1 infected through different routes. Our 31 study-subjects composed a genetically diverse population originating from 13 countries, carrying 18 HLA-A alleles, 24 HLA-B alleles, and 19 HLA-C alleles. They also varied broadly in terms of age, stage of infection, and treatment history, and were infected by different HIV-1-subtypes and CRF (Table I⇑). Eleven patients were infected with subtype B, 5 with subtype A1, 7 with subtype C, 1 with subtype D, 2 with CRF02_AG, two with CRF01_AE, and 1 with the complex CRF06_cpx (AJGK). A template amplification and subsequent HIV subtype analysis from plasma was unsuccessful for 2 patients due to low viral loads. The broad genetic background and distribution of different HIV-1-subtypes made the cohort well-suited for this study, aiming at identifying candidate components in a vaccine and simplify identification of correlates for an effective immune response in a global perspective.
Prediction of HLA class I supertype-restricted epitopes
To identify broadly immunogenic HIV-1-specific CTL epitopes, we first used a bioinformatics method (NetCTL) to predict HLA class I supertype-restricted epitopes. We then selected a number of predicted epitopes that together constitute a broad overall coverage among different HIV-1 subtypes using a newly developed algorithm: EpiSelect. A total of 174 predicted and 10 experimentally verified HLA class I supertype-restricted epitopes located in Gag, Pol, Env, Nef, and Tat were selected (Table II⇓, and supplementary table 1). To reach an optimal coverage among subtypes our final set included 85 variants optimized to reflect variations in 35 epitopes. The HLA-A1, -A2, -A3, -A24, -B7, -B44, and -B58 supertypes were chosen, because they together will give an optimal coverage in the human population (>98%) independent on ethnicity (8). On average, each of the 322 reference HIV-1 strains (subtypes A, B, C, D, and CRF01_AE) used for the prediction was covered by 54 epitopes (minimum 29). The peptides were collected into 24 pools, with 7–8 peptides per pool, and distributed so that each HLA class I supertype was represented once or twice. Two additional pools were added and analyzed separately: one containing only the immunodominant HLA-A2-restricted epitope 77SLYNTVATL85 (SL9), located in the HIV-Gag p17 region, and the other containing five other well-identified immunogenic HLA-A2-restricted epitopes (Gag-433FLGKIWPSHK442, Env-121KLTPLCVTL129, Env-813SLLNATAIAV822, Nef-83GALDLSHFL91, and Nef-83AAVDLSHFL91).
Number of predicted HLA supertype-restricted epitopes within the HIV-regions Gag, Pol, Env, Nef, and Tata
Immunogenicity of HLA class I supertype-restricted epitopes
Ag-specific CTL responses were determined using PBMCs from 31 HIV-1-infected individuals by detection of IFN-γ production in the ELISPOT assay (Fig. 1⇓A, and supplementary table 1). All but one subject tested responded positively to the HIV-p55 control peptide pool, CEF peptide pool, and against at least one of the peptide pools containing predicted HLA class I supertype-restricted epitopes (including pools A2 and SL9). One patient did not respond to any of the 184 selected epitopes nor did he show a positive response against the HIV-p55 control peptide pool, thus indicating a very low HIV-1-specific CTL reactivity. To identify the specificity of the responses, the individual peptides from the pools giving responses were analyzed in a second ELISPOT assay (Fig. 1⇓B). All study subjects showed a clear HIV-1 specific CTL response toward one or more peptides within each responding pool (Fig. 1⇓B, and supplementary table 1). In most patients, broad responses against several individual epitopes were confirmed; a median of six epitopes per patient (IQR 4–11) were recognized (Fig. 2⇓A).
Identification of HIV-1-specific CTL response. To identify the specificity of the responses, the production of IFN-γ by Ag-specific T cells was first detected using the ELISPOT assay against pools of peptides and control Ags (A), and second against the individual peptide within the pools (B). Most of the study subjects responded to peptides restricted to several HLA supertypes. The figure shows the results of HIV-specific CTL responses in patient 13.
Broad CTL responses detected against epitopes restricted by the seven HLA supertypes. We found HIV-specific CTL responses toward a median of six epitopes per patient (A). The immunogenicity frequency of epitopes restricted by the seven HLA supertypes (i.e., percent of epitopes recognized in at least one patient) were similar (B).
Of the tested 184 HLA class I supertype-restricted epitopes, as many as 114 (62%) were recognized by at least one study subject, and 45 were novel epitopes, not previously described in the HIV Molecular Immunology database (18). Targeted epitopes were equally distributed among the 7 HLA class I supertypes included in the study and were not skewed toward a specific HLA class I supertype (Fig. 2⇑B). The magnitude of the epitope-specific CTL responses was generally high with a median response per peptide of 587 SFU/million PBMC (IQR 319–686) (supplemental table 1). The majority of the predicted HLA supertype-restricted epitopes in Gag, Pol, Env, and Nef were shown to be immunogenic in at least one subject (Fig. 3⇓). However, only 3 (38%) of the tested Tat epitopes induced a HIV-specific CTL response (Fig. 3⇓A). Overall, the magnitude of the combined responses detected against HLA class I supertype-restricted epitopes in HIV-Gag (median 732 SFU/million PBMC) was not significantly different to that detected against the HIV-p55 control peptide pool (median 777 SFU/million PBMC) (p = 0.971, Mann-Whitney t test). Similarly to the HIV-Gag epitopes, strong responses were detected against Nef epitopes with a median response of 708 SFU/million PBMC (p = 0.704, Mann-Whitney t test) followed by Pol-specific responses at a median of 500 SFU/million PBMC (p = 0.236, Mann-Whitney t test). Although strong responses against Env epitopes were detected, the median response for all subjects (84 SFU/million PBMC) were significantly lower than against the Gag epitopes (p = 0.025, Mann-Whitney t test). The epitope-specific CTL responses detected against Tat was also significantly lower than against Gag (p < 0.0001, Mann-Whitney t test).
HLA supertype-restricted CTL responses detected against the majority of predicted Gag, Pol, Nef, and Env epitopes. Of the predicted epitopes restricted to the Gag, Pol, Env, Tat, and Nef regions, the Nef epitopes showed to be highly immunogenic in terms of percent (96%) of predicted epitopes recognized (A). Each subject responded to the median of two epitopes located in the HIV-Nef region and one epitope located in the HIV-Gag, -Pol, and -Env regions (B).
Cross-clade recognition of highly immunogenic epitopes
We next evaluated the immunogenicity of the 114 HLA class I supertype-restricted epitopes and identified 21 “elite” epitopes that induced CTL responses in at least four patients, corresponding to 13% of the study subjects (Fig. 4⇓). Taken together, we found that 90% of the HIV-specific CTL responding study population recognized one or more of these highly immunogenic epitopes. When comparing the breadth of the response detected in our study subject toward the number of “elite” epitopes to the response against the total set of 184 predicted epitopes, we found a strong significant correlation (p < 0,001, r2 = 0,760, Spearman rank correlation) (Fig. 5⇓). These results prove that by using a limited set of optimal epitopes, it is possible to obtain information on specificity, magnitude, and breadth of individual HIV-specific CTL responses.
Cross-clade recognition of HLA supertype-restricted elite epitopes. A total of 21 of the HLA supertype-restricted epitopes were highly immunogenic and induced a CTL response in at least four subjects. Shown is the subtype the responding subjects were infected with and at which frequency the epitope sequence is found among the HIV-1 subtype reference strains. 1, The color represents the frequencies of the exact epitopes sequence in the different subtypes; white: 0%, light grey: 1–24%, dark grey: 25–49%, and black: ≥50%. 2, Subtype variants of the same epitope; nd, not determined.
The breadth of responses detected against the 21 elite epitopes correlated significantly with the total breadth of responses. There was a significant correlation between the response to the 21 elite peptides and the entire set of 184 tested epitopes, suggesting that the elite peptides can be used as a tool to measure the breadth, magnitude, and functional characteristics of CTL responses against epitopes located in different HIV regions.
The elite epitopes contained seven subtype variants of three epitopes restricted by HLA-A2 (Nef), -A3 (Nef), and -B7 (Env) (Fig. 4⇑). In response to HLA-A2-restricted epitopes in Nef (83AAVDLSHFL91 and 83GALDLSHFL91), five subjects infected with subtypes A, B, and CRF02_AG cross-recognized the two variants. One subject, infected with a subtype C virus, recognized the Nef-83GALDLSHFL91 variant only. Similarly, in response to the B7-Env epitope variants, three subjects infected with subtypes A, B, and CRF01_AE showed cross-reactive responses, while one subject infected with CPX06 and one infected with subtype C recognized one variant each. In response to the three A3-Nef variants, we found that patients infected with subtypes A, B, and D, cross-recognized the 84AVDLSHFLK92 and 84ALDLSHFLK92 variants, but not the 84AFDLSFFLK92 variant more prevalent in subtypes C and CRF01_AE subtype variants of the same epitope (Fig. 4⇑).
To evaluate the ability of cross-clade recognition of the immunogenic epitopes, we conducted a phylogenetic analysis to establish the infecting viral subtype for each study subject and looked at the frequency at which the epitope sequence was present in our reference set of viral strains belonging to a certain subtype. Importantly, most of the 21 “elite” epitopes were recognized by patients infected with different subtypes, but we found some differences in cross-subtype recognition (Fig. 4⇑). Four patterns of cross-subtype recognition were found: 1) fairly conserved epitopes among all tested subtypes were recognized by patients infected with several different subtypes (e.g., A1-Nef 73QVPLRPMTY81), 2) the epitope sequence was not frequently found in the reference subtype strains but was still recognized in some patients infected by several different subtypes (e.g., A24-Env 680WYIKIFIII688), 3) epitopes more prevalent in a certain subtype were recognized in patients infected with several different subtypes (e.g., B7-Env 843IPRRIRQGF851), and 4) highly subtype-specific sequences and responses were seen against some epitopes indicating the inability of cross-clade recognition or presentation (e.g., A2-Pol 464ILKEPVHGV472). Overall, our results are very promising as the majority of the highly immunogenic epitopes were recognized in individuals infected with a broad range of HIV-1 subtypes.
Identification of a limited set of HLA-restricted epitopes for optimal coverage
As described above, the EpiSelect algorithm was first used to select a set of predicted epitopes that gave a good coverage of a large set of HIV-1 strains. After obtaining the immunogenicity data for the predicted epitopes in each of the patients in the study, we again used the algorithm; this time to select the epitopes that gave the best coverage of the patients in terms of experimentally verified immunogenicity. Fig. 6⇓ shows the 20 first peptides selected by EpiSelect and the cumulative number of responses they gave rise to in each of the 31 patients in the study. The figure shows that a set of only nine peptides can cover 30 of 31 (96%) of the patients.
The top 20 peptides selected by the EpiSelect algorithm to provide optimal coverage in terms of immunogenicity. From top to bottom, the rows indicate the order in which the peptides were selected. The columns 1–4 give the number, sequence, HLA supertype restriction, and HIV proteins sequence, respectively, for each peptide. The columns 5–34 each correspond to one of the patients in the study. Each row gives, for each patient, the accumulative number of peptides that are immunogenic in each of the patients.
HLA type and responses to predicted HLA supertype-restricted epitopes
The majority of subjects with multiple responses recognized epitopes predicted to bind to several HLA class I supertypes. To evaluate how well the predicted epitope restriction explained responses detected in our study subjects, we successfully obtained high-resolution HLA typing results from 26 subjects (Table I⇑). In total, our subjects recognized 225 epitopes of which 149 (66%) could be explained by the donor HLA allele belonging to the predicted HLA supertype. Next, we extended the analysis beyond the supertypes and investigated whether the recognized epitopes could be explained directly in terms of restriction to one of the donor HLA class I alleles. We predicted the rank of the epitope in the source protein using the pan-specific NetMHCpan prediction method (14) for the six HLA class I alleles of the donor. If the restriction allele could place the epitope within the top 10% we say we can successfully explain the epitope, if not that we have failed. By using this method, we can explain 85% of the identified 9-mer epitopes from the HLA types. These results show that by using bioinformatic tools, we have successfully been able to predict, and most importantly detect, broadly immunogenic HLA class I-restricted CTL responses.
Discussion
The HLA alleles have the highest diversity of all human genes and allow cellular immune responses in different persons to be directed against different epitopes. They are responsible for the wide variety of T cell epitopes in HIV-1 and also determine the specificity and repertoire of the CTL response. In addition to the genetic differences of the host, the high evolutionary rate of HIV-1, influenced by changes in the environmental selection pressures, greatly influences the efficacy and specificity of HIV-specific CTL responses. Together, the genetic variation of the host and the virus constitute major obstacles for obtaining and evaluating the potency of HIV-specific CTL responses. We developed and used an algorithm capable of selecting epitopes that together constitute a broad coverage of the different HIV-1 strains. In a highly diverse group of patients infected with different HIV-1 subtypes we have successfully identified a limited set of highly immunogenic epitopes, inducing cross-clade HIV-specific CTL responses.
In this study, we applied a novel algorithm, EpiSelect, to select HLA class I supertype-restricted epitopes in HIV-Gag, Pol, Env, Nef, and Tat to reach a wide-ranging coverage among different HIV-1 subtypes. The set of HLA class I supertype-restricted epitopes were predicted using a program combining predictions of HLA-I binding, TAP transport, and proteasomal cleavage (12), a proven method for screening for immunogenic HLA class I restricted CTL epitopes in viral infections and of potential utility as candidates of virus diagnostics and vaccines (19). Using this approach, we were able to detect epitope-specific CTL responses in 30 of 31 subjects, showing an HIV-specific immune response against the majority of the 184 predicted or experimentally identified epitopes, of which 39% are novel epitopes. By obtaining high resolution HLA typing, we could confirm that 66% of the responses could be attributed a HLA allele belonging to a certain HLA supertype. In addition, a total 85% of the detected epitope-specific T cell responses could be explained by the predicted binding of the epitope to one of the donors HLA allele. Our results support a recent finding that HLA supertypes often provide an oversimplification of the HLA specificity space (20). However, and more importantly, we demonstrate that the observed epitope promiscuity is indeed a direct result of overlapping HLA specificities and that these overlaps are predictable using advanced bioinformatical methods for pan-specific HLA-peptide binding. Longitudinal studies will have to be performed to evaluate the efficacy of individual responses. The design of a vaccine candidate (protein or DNA) which facilitates processing and presentation of such epitopes from full-length regions, would increase the probability of eliciting CTL responses in a global setting, especially because the genetic background of the population might be an important factor for vaccine efficacy.
The efficacy of an HIV-specific CTL response is dependent on the intricate kinetics between viral evolution causing immune escape, the fitness cost of the mutation, and recognition of escape variants. Recently, Fernandez et al. (21) showed that moderate levels of vaccine-induced wild-type specific CTLs capable of killing transiently reverting virus-infected cells, after infection with an escape mutant, were more beneficial in terms of control of viremia than the fitness cost induced by the escape mutant. Complete maintenance of escape virus seen in subjects with high-level immunodominant T cell responses against the wild-type epitope did not result in improved virological control, pointing out that vaccines inducing broad subdominant responses may be more beneficial (21, 22). Lately, studies have shown that a broad cross-recognition is associated with a slower disease progression and a better control of the virus (6, 7). Also, the high mutation rate of HIV-1 makes it impossible to identify CTL epitopes that are conserved among all subtypes, which is why the identification of cross-reactive between-clade T cell responses would be important to achieve in vaccine models (23, 24). In our study, we successfully identified several broadly immunogenic elite epitopes, including six with sequence variants recognized in a diverse patient population, which constitute promising candidates for vaccine-induced cellular immune responses. Immunogenicity analysis of the variant epitopes tested in this study show that three (A2-Nef, A3-Nef, and B7-Env) were cross-recognized by responding subjects infected with different subtypes. However, while two of three A3-Nef variants (AVDLSHFLK and ALDLSHFLK) were cross-recognized by HLA-A3-positive subjects infected with subtypes A, B, and D virus, the third variant (AFDLSFFLK) was not. The AFDLSFFLK variant is predominant in viral strains of subtypes C and CRF01_AE, and four of the six responding subjects recognizing this variant were infected with exactly these subtype variants of the same epitope. The differential targeting of the A3-Nef AFDLSFFLK variant indicates that this epitope sequence is presented, but that the differences may alter the TCR contact in such way, that another T cell repertoire is needed for recognition. Alternatively, this epitope is not efficiently presented by the same HLA alleles as the two other variants, and is thus not recognized by the same subjects (5). This notion is supported by that only 50% of subjects recognizing this epitope variant were HLA-A3 positive. Three additional broadly immunogenic epitopes (A24-Env WYIKIFIII, A24-Gag HYMLKHLVW, and B7-Pol SPAIFQSSM) had subtype variants that were cross-recognized by less than half of the responding subjects. Additional in-depth functional studies will have to be conducted to rule out whether these changes hampered binding to the HLA class I, or prohibited recognition by the TCR of the responding T cell population. Together, our results emphasize the complex interaction between the responding TCR repertoire and the configuration of the HLA-peptide complex, because some epitope variants are efficiently targeted by responding T cells, while others are not (13, 25, 26, 27). Our results are nonetheless promising as they show that many HLA class I supertype-restricted epitopes are recognized in subjects infected with different subtypes. Some of these epitopes are present at a low frequency in most subtypes, while some were more conserved between the subtypes.
In line with the results obtained in different cohorts of clade B- and C-infected subjects using overlapping peptides, we found a hierarchical targeting of our selected set of HLA class I supertype-restricted epitopes in Gag, Nef, Pol, Env, and Tat (28, 29, 30, 31, 32, 33), listed in order of immunogenicity. Almost all of the studied Nef-epitopes were recognized in at least one patient. The identification of Nef epitopes to be stimulated by a preventive HIV vaccine may be of particular interest, as the HIV-Nef region has been shown to be preferentially targeted in primary HIV-1 infection (34, 35). Also, obtaining Gag-specific T cell responses is highly relevant as the breadth of HIV-Gag-specific T cell responses has lately been associated with lower viral loads in subjects infected with subtype B or C virus (30, 32, 36, 37, 38, 39, 40). Although Env, due to its high variability, is not thought to be an optimal T cell Ag, we have identified at least two cross-clade recognized epitope sequences that could be used to optimize a future vaccine candidate, especially, because Env Ag optimization is required for obtaining Ab responses (41) and is thus likely to be included if both B and T cell immunity is to be reached. HIV-Pol responses may have been studied to a lesser extent as a T cell immunogens, although Pol responses may have important implications. New treatment regimens have had a dramatic impact on the mortality among HIV patients, but resistance to antiretroviral therapy is rapidly emerging during low adherence or suboptimal therapy (42). As a consequence, the risk for transmission of resistant HIV variants is a major threat against the last few years’ success with antiretroviral therapy (42, 43, 44, 45, 46). Importantly, we and others have shown that treatment-mediated selected mutations act as novel epitopes (47, 48, 49, 50, 51, 52). Considering the important role of the cellular immune responses in a durable control of drug-resistant HIV-1 (53), we suggest that vaccines targeting epitopes in Pol may be important.
Today, the comparison of induced immune responses by different vaccine candidates is severely hampered by the limitations of the commonly used overlapping peptide sets to identify the breadth and specificity of the response, as they invariably will underestimate or even fail to detect responses against putative autologous epitopic variants (13, 31, 54, 55). Methods have been developed for designing polyvalent T cell vaccines based on designing mosaic proteins with maximal peptide coverage of protein sequences (23, 56) or selecting CTL epitopes with maximum coverage (57). In the study by Fischer et al. (23), sets of one, three, four, or six mosaic proteins with maximal 9-mer peptide coverage were designed for each of the subtypes B and C, and for the M group. Here, we have, as suggested earlier, designed polyvalent T cell vaccines using sets of CTL epitopes with maximum coverage of the genomic variation of the pathogen (57). We compared our 21 elite epitope sequences to the sets of mosaic proteins and found that 12, 13, and 13 of the elite epitopes could be found in the subtypes B and C, and M group sets, respectively, with just one mosaic protein. Compared with the sets containing six mosaic proteins, the vast majority; 18, 16, and 21, of the 21 elite epitope sequences could be found in the subtypes B and C, and M group sets, respectively. The large overlap between the elite epitope set and the mosaic protein sets suggested by Fischer et al. (23) strongly supports the notion that sequence conservation is a key feature in defining broadly reactive T cell epitopes. However, it is important to stress that the mosaic protein construction is a whole protein approach that does not consider Ag presentation and peptide immunogenicity. Furthermore, the use of a limited set of epitopes to evaluate the function and phenotype of specific T cells is of particular relevance to gain insight into the mechanisms important for immune control.
Using a very limited set of HLA class I supertype-restricted epitopes, we have shown that it is possible to detect and evaluate both the magnitude and breadth of epitope-specific CTL responses in a genetically diverse population infected with different HIV subtypes. Our findings are of great relevance for the detection of HIV-specific CTL responses for the validation of vaccines designed to generate promiscuous CTL responses with a global coverage and to simplify the identification of the efficacy of responses restricted by a multitude of HLA alleles.
Acknowledgments
We thank Dr. Ma Lou and Chris Czarnecki at the Department of Medical Microbiology (University of Manitoba, Manitoba, Canada), for help with the high-resolution HLA typing. The CEF Control Peptide Pool was obtained through the Division of AIDS, National Institute of Allergy and Infectious Diseases, AIDS Research and Reference Reagent Program, National Institutes of Health.
Disclosures
The authors have no financial conflict of interest.
Footnotes
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
↵1 This work was supported in part by grants from the Swedish Agency for International Development Cooperation (2005-001756), the Swedish Research Council (K2007-56X-20345-01-3), the Swedish Physicians against AIDS Research Foundation, and National Institutes of Health Grants R01 AI60379 and P01 AI71713 and contract HHSN266200400083C.
↵2 Address correspondence and reprint requests to Dr. Annika C. Karlsson, Department of Virology, The Swedish Institute of Infectious Disease Control, SE-171 82 Solna, Sweden. E-mail address: annika.karlsson{at}smi.ki.se
↵3 Abbreviations used in this paper: CRF, circulating recombinant form; CEF, CMV, EBV, and flu; SEB, staphylococcal enterotoxin B; SFU, spot forming unit; IQR, interquartile range.
↵4 The online version of this article contains supplemental material.
- Received October 16, 2007.
- Accepted January 28, 2008.
- Copyright © 2008 by The American Association of Immunologists
















