|
|
||||||||




* Institut für Immunologie, Johannes-Gutenberg-Universität Mainz, Mainz;
Institut für Zellbiologie/Immunologie, Universität Tübingen, Tübingen;
Institut für Biochemie, Charité, Humboldt Universität Berlin, Berlin, Germany;
La Jolla Institute for Allergy and Immunology, La Jolla, CA 92037; and
¶ Institut National de la Santé et de la Recherche Médicale, Unité 580 and
|| Université Paris Descartes, Faculté de Médecine René Descartes, Paris, France
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
In the ER, the peptides are again subject to proteolytic attack by aminopeptidases (11, 12, 13, 14, 15, 16), which synergize with MHC class I molecules to produce the final ligand (17). Two trimming enzymes, ERAP1 and ERAP2, have been identified in the ER of human cells with complementary trimming specificities (15), while mice express a homologue of ERAP1 but not ERAP2. Remarkably, ERAP1 prefers substrates 9–16 residues long (8, 16), which corresponds to the length of peptides efficiently transported by TAP. The recent availability of ERAP1 knockout mice has made it possible to study the impact of ERAP1 on MHC ligand generation and their recognition by T cells in vivo (18, 19, 20, 21, 22). These studies have shown changes in the epitope repertoire recognized in knockout mice, although with some discrepancies as to the magnitude and extent of the knockout impact. Overall, key enzymes of the ER trimming machinery are now well defined and their selective role in the generation of MHC class I ligands is clearly established.
Any selective process in MHC class I ligand generation is likely to leave a mark in the sequence composition of the ligands themselves or in their flanking regions. Previous studies have quantitatively characterized the sequence patterns preferred by proteasomal cleavage (23, 24, 25, 26, 27, 28), TAP transport (29, 30, 31, 32, 33, 34), and MHC binding (overviews in Refs. 35, 36, 37), and these patterns are indeed frequently found for MHC class I ligands. Most of these analyses focused on the C terminus of presented ligands, as N-terminal sequence patterns are complicated by the existence of different N-terminally prolonged ligand precursors. Although complicated, an analysis of the N-terminal processing motif can provide insights into the length distribution of ligand precursors generated during protein breakdown, the relative influence of cytosolic and endoplasmic trimming events, and the impact of the amino acid composition in the N-terminal extensions of ligand precursors for efficient ligand generation along the presentation pathway.
To address these questions, we here present a combined biostatistical and experimental approach to elucidate the role of amino acids in the sequence region preceding the N terminus of known MHC class I ligands. We demonstrate that the occurrence of these amino acids is not random but on the average forms a distinct processing motif, which in length and residue composition reflects the influence of proteasomal cleavage and TAP transport on Ag processing. However, for the two residues directly preceding the N terminus of MHC class I ligands, we found systematic deviations from the proteasome and TAP preference which we suspected to reflect the influence of trimming processes by aminopeptidases. To test this hypothesis, we determined the cytosolic and endoplasmic aminopeptidase activities toward all 20 amino acids. Relating the residue composition preceding MHC class I ligands to these measured peptidase activities provides evidence that removal of the two residues next to the N terminus of the final ligand is favored to take place inside the ER.
| Materials and Methods |
|---|
|
|
|---|
We refer to MHC class I ligands as peptides that are naturally presented on the cell surface bound to MHC class I molecules, which may or may not be epitopes capable of eliciting a T cell response. We extracted a set of 1565 human MHC class I ligands having a size between 8 and 11 residues with known protein source sequences from the SYFPEITHI database (38). Our analysis was restricted to those ligands being flanked on both sides by at least 20 residues to avoid border effects, such as the possibilities that aminopeptidase activity alone could liberate the ligand from its source or that endopeptidases could not find a binding motif. Excluding those cases leaves us with n = 1543 ligands.
Algorithm for proteasomal cleavage, TAP transport, and aminopeptidase trimming
The sequence dependent efficiencies of proteasomal cleavage, TAP transport and aminopeptidase trimming can be evaluated using the scoring matrices described below. For each of these processes, we use its scores to look for patterns in the residue occurrence upstream the N terminus of MHC class I ligands. We refer to f(d) as the score associated with sequence position d relative to the N terminus of an MHC class I ligand in a given protein. The scores f(d) were then averaged over all protein sequences available: S(d) = <f(d)>proteins and the SD stdev(f(d)) = root(
(S(d)–f(d))2/(n-1)) and SEM sterr(S(d)) = stdev(S(d))/root(n) were calculated. These average scores at fixed positions were compared with average scores calculated for random positions in the same set of proteins S* = <f(d = random)>proteins. The mean score at random positions S* was calculated 1000 times, which allows the determination of its mean <S*> and SD stdev(S*). In Fig. 1 and 4, the x-axis represents different values for position d, while the y-axis depicts the difference between the average score at a fixed position S(d) and the average score at random positions <S*> in units of the SEM sterr(S(d)).
|
|
The scores fTAP(d) for the TAP transport of fragments possessing the residue at position d as the N terminus were calculated using the matrix described in Ref. 31 . Here, only the contribution of the three N-terminal residues of a peptide on TAP transport is taken into account.
For the calculation of aminopeptidase scores fAP(d) associated with position d, the turnover rates of the residues at position d were taken from the experimental data described below. Logarithmic values of the turnover rates were used. This is coherent with the proteasomal cleavage prediction scores, which essentially give log(amount produced) values, and the TAP transport predictions, which give log(IC50) values.
Purification of the cytosolic and microsomal fraction
The purification of cytosol and microsomes is based on earlier protocols (39). The seven cell lines HeLa (human cervical adenocarcinoma), MC57 (mouse fibrosarcoma), RMA (mouse thymoma), J774 (mouse macrophage cell line), EL4 (mouse T cell lymphoma), D2SC/1 (mouse dendritic cell line), and RAW309 (mouse macrophage cell line) were each cultured in 2 l RPMI 1640 (Biowhittaker) with 10% FCS (SeraPlus) at 37°C. Adherent cells were detached by a solution of 0.3 mM EDTA in PBS (pH 7.2). After pelleting (1,000 x g, 5 min, 4°C), the cells were washed once with PBS and twice with protease assay (PA)-buffer (20 mM HEPES/KOH (pH 7.6), 150 mM KCl, 5 mM MgCl2, and 0.5 mM DTT). The following steps were conducted at 0–4°C. All buffers were filtered through a 0.45 µm membrane filter (Millipore) except buffers containing sucrose, which were filtered through a 1.2 µm membrane filter. A minimal volume of PA-buffer was added to the pellet before homogenization with an Elvehjem-potter (20x, 10–15 s up or down, 2000 rpm). The microsomes and the cytosol were purified by differential centrifugation, first for 10 min at 1,000 x g, then for 10 min at 10,000 x g, and last for 2.5 h at 140,000 x g. In the first two steps, the supernatant was taken for the next centrifugation and the pellet and the floating fat were discarded. In the last step, the sample was spun through a sucrose-cushion (1.3 M sucrose in PA-buffer; ratio sample to cushion 3:1) to separate cytosol and microsomes. Cushion and membrane-fractions were entirely removed and discarded. The supernatant, consisting of purified cytosol, was frozen in aliquots at –80°C before further use. The pellet, consisting of the microsomal fraction, was resuspended in PA-buffer and the same volume of stripping-buffer (50 mM EDTA and 1 M KCl in PA-buffer) was added to remove attached proteins. After incubation for 15 min at 0–4°C, the sample was centrifuged for 1 h at 140,000 x g through a sucrose cushion (0.5 M sucrose in PA-buffer; ratio sample to cushion 3:1). Cushion and supernatant were removed quantitatively and the pellet was washed with PA-buffer without disturbing it. The purified microsomes were lysed with PA-lysis buffer (1% Triton X-100 in PA-buffer) and stored in aliquots at –80°C.
Measuring aminopeptidase activity
The activity measurements with single-residue fluorogenic substrates were made in a 96-well flat-bottom transparent plate (Greiner Bioscience). For each amino acid, one well was filled with the particular cytosolic or microsomal fraction, the respective buffer (PA-buffer for the cytosol or PA-lysis buffer for the microsomes), and BSA with a final concentration of 1 mg/ml. Samples and substrates were then adjusted to 37°C. After reaching this temperature, the reaction was started by the addition of the fluorogenic amino acid amino-4-methyl coumarin (AMC) substrate (Bachem) with a final concentration of 100 µM. Immediately following the substrate addition, the fluorescence was measured in real fluorescence units (RFUs) over time with a SPECTRAFluor Plus (Tecan) fluorometer at a wavelength of 360 nm for excitation and 450 nm for emission at 37°C.
For the activity measurements with peptide substrates, ten peptides with the sequence H-XRGYVYQGL-OH (X being either E, P, S, Q, I, R, L, F, M, or W) were ordered from Schafer-N at >80% purity. A total of 10 x 2 µl of each peptide [1 µg/µl in water] and 2 µl of purified cytosol or 20 µl of purified microsomes (see above) were mixed with PA-buffer (20 mM HEPES (pH 7.6), 150 mM KCl, and 5 mM MgCl2) to a final volume of 500 µl and aliquoted at 50 µl. Aliquots were incubated at 37°C for 0 h, 10 min, 20 min, 30 min, 60 min, 90 min, or 120 min, the reaction stopped with 1% formic acid (F.A.), and the aliquots frozen at –20°C. Capillary liquid chromatography was performed with a Waters NanoAcquity UPLC system. Mobile phase A contained 0.1% F.A. in H2O and mobile phase B contained 0.1% F.A. in acetonitrile. Mass spectrometry analysis was performed using a Waters Q-Tof Premier in positive V-mode ESI. The mass spectrometer was calibrated with a [Glu1]-fibrinopeptide solution (500 fmol/µl at 300 nl/min) delivered through the reference sprayer of the NanoLockSpray source.
Determination of initial turnover rates
The initial velocity (v0) of the turnover for each amino acid was calculated by determining the slope of the linear initial part of the curve in the RFU over time diagram. This value (RFU*min–1) was first normalized to the used volume (RFU*min–1*µl–1) and then, to be able to compare different sets, displayed as a percentage of the overall activity, consisting of all 20 summarized single activities of the set: % of overall turnover = (v0 [RFUs*µl–1*min–1] of the respective amino acid)/((
v0 [RFUs*µl–1*min–1] of all 20 amino acids of the set)/100).
| Results |
|---|
|
|
|---|
We compared the amino acid frequencies at up to 15 sequence positions upstream the true N terminus of 1543 known MHC ligands (taken out of the SYFPEITHI database (38)) with average frequencies found within the whole source proteins (Table I). A
2 analysis shows that positions –5, –4, –3, –2, and –1 (numbers indicate the distance to the N-terminal residue of the final ligand) exhibit sequence profiles that deviate strongly from the randomly expected ones (p
0.001,
2 test). Such significant deviations were not detected in a previous analysis (40), most likely due to the more limited amount of data available at the time.
|
|
Fig. 1 compares how the processing motif matches the known sequence preferences of the constitutive proteasome, the immuno proteasome, and the TAP transporter. For this analysis, we applied sequence-based prediction algorithms to calculate proteasomal cleavage scores (28) (describing the specificity of the proteasome) for the various peptide bonds within the N-terminal extension and scores for the transport of the resulting peptide into the ER by TAP (31) (describing the specificity of TAP). As explained in the Materials and Methods section, the mean of these predicted scores was compared with the mean of reference scores calculated at random sequence positions of the source proteins. At each position up to five residues distal of the ligands N terminus, the mean cleavage scores for the constitutive and immuno proteasome are at least one SEM above those found at randomly chosen sequence positions. For TAP, the average transport score of precursors carrying an extension of up to eight residues is at least one SD above its random counterpart. Thus, amino acids preceding the N terminus of MHC class I ligands exhibit an accumulation of residues favoring proteasomal cleavage sites in this sequence region as well as the transport of the N-terminally extended peptides by TAP. This finding indicates that the N-terminal processing motif found in MHC class I ligand precursors results at least partly from the sequence selectivity of the proteasome and TAP.
Processing by the proteasome and TAP alone do not explain the observed N-terminal residue distribution
It is unlikely, however, that the processing motif is only influenced by the specificity of proteasomal cleavage and TAP transport. If N-terminally prolonged precursors are generated by the proteasome and transported by TAP, they have to be trimmed to their final size by aminopeptidases before presentation. If this trimming occurs with different efficiencies for different residues, it is expected to further influence the amino acid composition found in the N-terminal sequence region of MHC class I ligand precursors.
Based on this rationale, we examined the frequency of the amino acids A, F, I, L, M, V, W, and Y, which are all residues with an above average score for both proteasomal cleavage (28) and TAP transport (31) when averaging over the sequence window used in the predictions. These amino acids are expected to be found in frequencies above average in all positions within the processing motif. As seen in Table I, this is not the case; the residues W, I, and V are found significantly below average at the position –1 (residues V highly significantly so), which is the amino acid directly preceding the N terminus of an MHC class I ligand. However, further upstream from the N terminus, at positions –7 to –4, these residues do occur at frequencies above average. This preliminary analysis suggests that aminopeptidases are involved in MHC class I ligand processing and those responsible for cleaving precursors with the one residue extensions have a low activity toward W, I, and V.
Determination of cytosolic and endoplasmic aminopeptidase specificity
To test this hypothesis, we used fluorogenic single-residue substrates for all 20 amino acids to monitor aminopeptidase activities in purified cytosol or purified solubilized microsomes, the latter reflecting the activity in the ER. The usage of single-residue substrates does not provide information about length preferences of aminopeptidases and they are not physiological substrates. However, they have the advantage that they cannot be cleaved by endopeptidases, which are incapable of digesting single-residue substrates, and thus cannot interfere with the assay. In particular, our amino acid probes cannot be cleaved by the two major cytosolic endopeptidases 20S proteasome and TPPII (Fig. 2(a)).
|
As expected, there were relevant differences between the cytosolic and the endoplasmic turnover rates (Fig. 2, (c) and (d)). One of the most prominent examples for such a difference is proline, which is one of the preferred amino acids for aminopeptidases in the cytosol, whereas there is almost a complete lack of proline-specific peptidase activity in the ER. The absence of proline activity in the microsomal preparation is a good measure for the purity of this fraction, as there is a strong activity for this residue in the cytosol. Similar to proline, the amino acid tryptophane is also trimmed much more efficiently in the cytosol. The opposite tendency is observed for the amino acids glycine, leucine, and methionine, all of which are degraded more efficiently in the ER. These differences in the specificity of cytosolic and endoplasmic aminopeptidases should also be reflected by the processing motif.
Next, we evaluated whether the digestion rates determined with nonphysiological single-residue substrates are comparable with those of oligomeric peptides. Ten 9-mer peptides differing only in their N-terminal residue were incubated with the purified cytosol (Fig. 3(a)) and microsomal fraction (Fig. 3(b)). Time courses of peptide degradation were recorded (Fig. 3) with the rates of degradation being R > W, M > L, F > I, Q > S, P > E in the cytosol and L, M > F, R, Q > I, W > S > E, P in the microsomal fraction. This hierarchy is very similar with the one determined with the single-residue substrates, which supports their use in this study.
|
To clarify the influence of aminopeptidase activities on MHC class I ligand generation and, thus, on the processing motif, we compared the residue-specific turnover rates found in the cytosol and the ER (see Fig. 2, (c) and (d)) with the amino acid frequencies observed in the N-terminal extensions of the 1543 MHC class I ligands (Fig. 4). A significantly higher turnover rate is found in the ER compared with the cytosol for positions –2 and –1 (p < 0.001). A high turnover in the ER means efficient liberation of the final MHC class I ligand, whereas a low turnover in the cytosol increases the probability of a peptide to be transported by TAP. This suggests that the final trimming of MHC class I ligand precursors at sequence positions –2 and –1 should predominantly take place in the ER. This conclusion is supported by the above mentioned finding that the residues W, I, and V are not preferred in the ER compared with the cytosol (Fig. 2, (c) and (d)).
| Discussion |
|---|
|
|
|---|
We found a significant increase in predicted proteasomal cleavage sites up to five residues distal from the MHC class I ligand N terminus and a significantly better predicted TAP transport rate of precursors carrying an extension of up to eight residues. The latter finding is in agreement with the reported maximum length of peptides efficiently transported by TAP (42). Determination of aminopeptidase activities led to the conclusion that the aminopeptidases of the ER are much more efficient than their cytosolic counterparts in removing the first two residues next to the true N terminus of the MHC class I ligand. This suggests that the final steps for trimming of most ligands take place in the ER. We did not find a significantly increased susceptibility toward aminopeptidase activity for the residues beyond sequence position –2. This can be due to equal participation of aminopeptidases in the ER and in the cytosol in the removal of these residues or it can be due to primary cleavage of longer precursors by endopeptidases.
Combining our results with data gathered in numerous previous studies, we propose the following model for Ag processing (Fig. 5). The first step is the cleavage of proteins by the proteasome into oligopeptides of up to 25 residues carrying already the correct C terminus for most MHC class I ligands. Proteasomal cuts preferentially lie within, but may also lie outside, the N-terminal processing motif formed by the sequence region –8 to –1 next to the true N terminus of the MHC class I ligand. Oligopeptides with >15 residues are likely to be further cleaved by TPPII. Either way, the proteasome alone or the proteasome in concert with TPPII generate MHC class I ligand precursors with N-terminal extensions of up to eight residues. Precursors of this length and below can be efficiently transported into the ER by TAP. Trimming at positions –8 to –3 can occur in either of the two compartments by cytosolic or endoplasmic aminopeptidases. Residues at positions –2 and –1 are predominantly removed in the ER. This final trimming event is followed immediately or is even accompanied by MHC class I binding of the ligand to protect it from further degradation (17, 43).
|
As shown above, preferred cleavage sites for the proteasome can be found anywhere in various positions within the N-terminal processing motif and the same is likely to be true for TPPII, based on its peptide length preference. Hence, cleavage at residues further away from the ligand N terminus will be less dominated by the action of aminopeptidases, which lowers the probability to find clear aminopeptidase preferences at these positions. This again suggests that the significant values at positions –2 and –1 reflect a strong preference for endoplasmic trimming in this sequence.
A study by Nielsen et al. (27) was the first to derive a length distribution of MHC class I ligand precursors from proteasomal cleavage predictions. This was done by translating cleavage scores from the NetChop algorithm into cleavage probabilities using in vitro digest data. These probabilities were then used to describe MHC class I ligand precursor generation as a stochastic process. The resulting precursor length distribution showed a maximum extension length of about seven amino acids, with
30% of all ligands having N-terminal extensions of three or more residues. This is in remarkable agreement with our results.
A number of different prediction algorithms for proteasomal cleavage are available, such as NetChop[46], Pcleavage [47], and PaProc [29]. These methods do not completely agree on what the preferred amino acids motifs for proteasomal cleavage are. Repeating the present study with different algorithms could therefore lead to different results. Our choice of the MHC-pathway matrices for proteasomal cleavage is motivated mainly by the fact that these were developed by ourselves and they are based solely on in vitro experiments characterizing proteasomes, not on the frequencies of residues found surrounding MHC class I ligands, which would bias the current analysis.
The knowledge gained by our analysis is of importance for the further development of the in silico identification of MHC class I ligands in arbitrary protein sequences. Although sequence-based predictions for proteasomal cleavage, TAP transport, and MHC binding are available, a mathematical model that includes epitope trimming is missing so far. To fill this gap, our data could help to create new algorithms incorporating the processing of the N terminus of MHC class I ligands. Our data on the specificities of trimming proteases could help to optimize the design of linkers between different epitopes to promote optimal processing and therefore optimal epitope presentation in multiepitope constructs.
| Acknowledgments |
|---|
| Disclosures |
|---|
|
|
|---|
| Footnotes |
|---|
1 This work was supported by grants from the Deutsche Forschungsgemeinschaft (Sonderforschungsbereich 490, E6, and Z3 to S.T. and H.S.), EU 6FP 503231 to H.S., Hochschulbauförderungsgesetz Programme HBFG-122-605 to H.S., and Pacific Southwest Regional Center of Excellence for Biodefense and Emerging Infectious Diseases Grant U54 AI065359 to B.P. ![]()
2 M.M.S. and B.P. contributed equally to this work. ![]()
3 Address correspondence and reprint requests to Dr. Stefan Tenzer and Dr. Hansjörg Schild, Universitat Mainz, Hochhans au Argustoplatz, Obere Zahlbacher Strasse 67, D-55131 Mainz, Germany. E-mail addresses: tenzer{at}uni-mainz.de and schild{at}uni-mainz.de ![]()
4 Abbreviations used in this paper: ER, endoplasmic reticulum; PA, protease assay; AMC, amino-4-methyl coumarin; RFU, real fluorescence units; F.A., formic acid. ![]()
Received for publication March 16, 2007. Accepted for publication December 12, 2007.
| References |
|---|
|
|
|---|
can stimulate post-proteasomal trimming of the N terminus of an antigenic peptide by inducing leucine aminopeptidase. J. Biol. Chem. 273: 18734-18742.
-induced aminopeptidase in the ER, ERAP1, trims precursors to MHC class I-presented peptides. Nat. Immunol. 3: 1169-1176. [Medline]This article has been cited by other articles:
![]() |
L. E. Valentine, J. T. Loffredo, A. T. Bean, E. J. Leon, C. E. MacNair, D. R. Beal, S. M. Piaskowski, Y. C. Klimentidis, S. M. Lank, R. W. Wiseman, et al. Infection with "Escaped" Virus Variants Impairs Control of Simian Immunodeficiency Virus SIVmac239 Replication in Mamu-B*08-Positive Macaques J. Virol., November 15, 2009; 83(22): 11514 - 11527. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Hearn, I. A. York, and K. L. Rock The Specificity of Trimming of MHC Class I-Presented Peptides in the Endoplasmic Reticulum J. Immunol., November 1, 2009; 183(9): 5526 - 5536. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Motozono, S. Yanaka, K. Tsumoto, M. Takiguchi, and T. Ueno Impact of Intrinsic Cooperative Thermodynamics of Peptide-MHC Complexes on Antiviral Activity of HIV-Specific CTL J. Immunol., May 1, 2009; 182(9): 5528 - 5536. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. L. S. Colaco Comment on "Characterizing the N-Terminal Processing Motif of MHC Class I Ligands" J. Immunol., September 15, 2008; 181(6): 3731 - 3731. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |