|
|
||||||||



,
* Department of Microbiology and Carter Immunology Center, University of Virginia, Charlottesville, VA 22908, and Departments of
Chemistry and
Pathology, University of Virginia, Charlottesville, VA 22906
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
- and
-chains immediately after synthesis in the endoplasmic
reticulum, blocks the peptide binding site, and retargets the complex
to the endosomal/lysosomal compartment (5). Endosomal
proteases digest most of the invariant chain, leaving a set of peptides
called class II-associated invariant chain peptide associated
with the class II peptide binding site (2, 4). HLA-DM
catalyzes the dissociation of the class II-associated invariant chain
peptide peptides, allowing for the subsequent binding of other peptides
present in the endosomal compartment (6, 7). Although the assembly, transport, and peptide binding of class II MHC molecules is relatively well understood, little is known about the processes and proteases that form these peptides. Although endosomes contain a variety of proteases (reviewed in Ref. 4), few have been directly implicated in the generation of class II-associated peptides. Cathepsin D has been implicated in the generation of T cell epitopes from OVA, hen egg lysozyme, sperm whale myoglobin, and insulin based on in vitro digestion to produce antigenic fragments (8, 9, 10, 11, 12). However, splenocytes from cathepsin B or cathepsin D-deficient mice process and present multiple antigenic determinants normally, including those from OVA and hen egg lysozyme (13). Recently, a newly identified asparaginyl endopeptidase was shown to generate a proteolytic intermediate in the processing of a tetanus toxin Ag (14). This study also suggested the existence of at least two processing events: an initial generation of precursor fragments from the intact protein and secondary processing to an optimal size for MHC binding and/or T cell recognition. However, it remains unclear whether processing is completed before binding to class II MHC molecules in the endosome, or whether some processing of MHC-associated peptides occurs subsequent to binding.
Binding of peptides to class II MHC molecules involves several hydrogen bonds between residues in the binding site and the peptide backbone, as well as contacts between specific peptide side chains and pockets in the binding site (15, 16, 17). In contrast to class I MHC molecules, the ends of the class II molecule binding site are open and do not constrain the length of class II-associated peptides. Indeed, sequence analysis of peptides associated with class II MHC molecules has identified peptides of lengths between 10 and 34 residues (1, 18, 19, 20). Furthermore, many of these class II-associated peptides comprise nested sets that contain the same core sequence but vary in length at the amino or C-terminal ends. However, little information has been gathered concerning the relative abundance of related peptides or the factors that determine the preferred length of class II-associated peptides.
Previous sequence analyses of class II MHC-associated peptides have relied on either Edman degradation or mass spectrometry (1, 18, 19, 20, 21, 22). The utility of Edman degradation is limited by the complexity of the peptide mixture even after HPLC fractionation. Chicz and colleagues (1, 20) used mass spectrometry to correlate masses in individual HPLC fractions with Edman data, and thus inferred the identity of several peptides from different human class II alleles. Although this improves on the use of Edman degradation alone, the mass accuracy of the instrument used does not allow for truly unambiguous identification of such peptides. Only by using tandem mass spectrometry (MS/MS)4 is it possible to select an individual peptide ion and subject it to collision-activated dissociation to generate an unambiguous sequence (19, 22).
In the present study, we used a high throughput mass spectrometric technique together with automated database searching of collision-activated dissociation spectra to determine candidate sequences for >700 different peptides bound to the human class II MHC molecule HLA-DR*0401. Next, we used Fourier Transform mass spectrometry to analyze 142 peptides that were members of nested sets and determine their relative abundances. Using this information, we provide a comprehensive view of the distribution of peptides within a nested set that are associated with this class II MHC molecule. Our analysis has also allowed us to draw inferences regarding the proteolytic processes that lead to final peptide products displayed by HLA-DR4.
| Materials and Methods |
|---|
|
|
|---|
The human B lymphoblastoid cell line PRIESS (kindly provided by B. Nepom, Virginia Mason Research Center, Seattle, WA) is homozygous for HLA-DR*0401 (DRA*0101, DRB1*0401). The hybridoma LB3.1 secretes an mAb that specifically recognizes human HLA-DR molecules (23). Both cell lines were maintained in RPMI 1640 supplemented with 10% FBS and 2 mM glutamine.
Isolation of HLA-DR*0401-associated peptides
PREISS cells (5 x 108) were lysed in 5 ml of lysis buffer (20 mM Tris-HCl (pH 8.0), 150 mM NaCl, 1% 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonate, 5 µg/ml aprotinin, 10 µg/ml leupeptin, 10 µg/ml pepstatin A, 5 mM EDTA, 0.04% sodium azide, and 1 mM PMSF) for 1 h at 4°C. The lysate was clarified by centrifugation at 16,000 x g for 30 min at 4°C, and the supernatant was precleared by incubation with 100 µl recombinant protein A-Sepharose beads (Amersham Pharmacia Biotech, Piscataway, NJ) for 4 h at 4°C. After removal of the beads, the supernatant was incubated overnight at 4°C with 100 µl recombinant protein A-Sepharose beads to which 2 mg of LB3.1 Ab had been bound. The beads were subsequently washed twice in lysis buffer, four times in 20 mM Tris-HCl (pH 8.0), 150 mM NaCl, twice in 20 mM Tris (pH 8.0), 1 M NaCl, and three times in 20 mM Tris-HCl (pH 8.0). Peptides were eluted in acid and separated from beads, Ab, and class II MHC molecules by passage through a 10,000-day cutoff ultrafiltration unit (Millipore, Bedford, MA).
Mass spectrometry
An aliquot corresponding to 1 x 107
cell equivalents of the peptide mixture described above was analyzed by
nanoflow HPLC-electrospray ionization coupled directly to an ion trap
mass spectrometer (24, 25). Following data-dependent MS/MS
analysis (24, 25), spectra with characteristic features of
peptide fragmentation were searched against the nonredundant
database maintained at the National Center for Biotechnology
Information using the SEQUEST algorithm (26) to identify
candidate sequences. Selected sequences were confirmed by manual
interpretation of MS/MS spectra and their identification as members of
nested sets. A second aliquot corresponding to 1 x
106 cell equivalents of the peptide mixture was
subsequently analyzed by nanoflow HPLC coupled to electrospray
ionization on a home-built Fourier transform mass spectrometer
(27). Full scan mass spectra (300
m/z
5000) were acquired at approximately one scan
per second. Peptide ions were correlated with those identified in ion
trap experiments by relative HPLC retention time and accurate mass.
Peptide abundance was estimated based on the observed ion current.
Because peptides in nested sets are chemically similar to one another,
it is reasonable to expect that they will exhibit similar ionization
efficiencies.
| Results |
|---|
|
|
|---|
The peptides associated with immunoaffinity-purified HLA-DR*0401
molecules were extracted and analyzed by nanoflow HPLC interfaced to
electrospray ionization on an ion-trap mass spectrometer. By using a
long, shallow HPLC gradient and by acquiring mass spectra in the
data-dependent mode (24),
2000 nonredundant peptide
MS/MS spectra were acquired in a single nanoflow HPLC interfaced to
electrospray ionization on an ion-trap mass spectrometer run. A search
of these MS/MS spectra against the NCBI nonredundant database
using SEQUEST yielded 701 candidate peptide sequences. Based on
estimates of
10,000 peptides presented by class I and II MHC
molecules on the surface of a typical lymphoid cell (28),
this represents
7% of the total peptide repertoire presented by
HLA-DR*0401 on these cells. Among these 701 candidate peptide
sequences, 273 were identifiable as members of nested sets (defined as
groups of peptides that share core sequences but have distinct amino
and C-terminal ends). By manual interpretation of the MS/MS spectra, we
confirmed the sequences of 142 peptides that comprised 11 nested sets
(Table I
).
|
|
To facilitate understanding the factors contributing to the
presentation of nested set peptides, we used three predictive
algorithms to determine the likely binding frames of the peptides
in each nested set. The three binding algorithms used different
methodology, but predicted the same preferred frame in each case. For
human Ig
-chain (Ig
)140162, human
-soluble N-ethylaleimide-sensitive factor
attachment protein148163, human
lysosomal-associated multitransmembrane protein
(LAMP)234252, and
HLA-C150169, these algorithms identified single
binding frames that were contained within all peptides comprising these
sets and had relative affinities significantly higher than any
other predicted frame (Table III
). The
nested sets from HLA-C150169,
HLA-B150170,and
HLA-E148167 are derived from homologous regions
of these three class I MHC molecules, and the sequences
contained in any two shares 1518 of 19 possible residues. A single
homologous high-affinity binding frame accommodated 5/5, 14/16, and
10/11 peptides from these sets, respectively. Similarly, the nested
sets HLA-C5274,
HLA-B5172, and
HLA-A4974 were also derived from homologous
regions of these proteins, and share 20/22 residues. In fact, 12
peptides were derived from a region that is identical in HLA-B and -C
molecules; and therefore, could come from either or both of these
molecules. A common homologous high-affinity frame accounted for the
binding of 28/32 peptides from the combined HLA-B and -C sets and 27/36
of the peptides from the HLA-A4974.
|
The two nested sets derived from bovine serotransferrin precursor (BSP)554572 and human heat shock cognate 71-kDa protein (HSC71)482499 had two frames of similar predicted affinity that were shifted by one residue with respect to one another. These two frames also accommodated the same sets of peptides. In addition, no predicted alternate binding frame can account for the two peptides of the BSP554572 set that terminate at Q, and thus do not contain the complete primary predicted frame. In these cases, it cannot be determined what proportions of these peptides will be displayed by HLA-DR*0401 in one or both frames.
Distribution of peptide length and abundance within a nested set reveals holes in the presented peptide repertoire
With the information about likely binding frames in hand, we
reexamined in more detail the abundance of individual peptides within
two representative nested sets containing a minimum of 11 unique
members. The 16-member HLA-B150170 set was
dominated by four peptides (154168 (15 mer), 153168 (16 mer),
153169 (17 mer), and 152169 (18 mer)) that collectively accounted
for almost 80% of the total peptide abundance (Fig. 1
, upper panel). In
contrast, eight peptides in this set were each present at <1.5%. The
distribution of peptide lengths within this set is relatively regular
and continuous, and the dominant peptides were related to one another
by the gain or loss of single residues at either the amino or the C
termini. A small number of dominant species was also evident in the
Ig
140162 set, with the most abundant three
peptides (140158, 145158, and 145159) accounting for 64% of the
total peptide abundance (Fig. 1
, lower panel).
However, the 140158 species was not related to the other dominant
peptides by the gain or loss of a single residue at the amino terminal
end, but instead involved a difference of five residues. More
generally, Ig
-derived peptides beginning at residues 140 or 145 were
relatively frequent, while those beginning at 141144 were much less
abundant. At the extreme, no Ig
peptides beginning at
R142 were found, leading to a distribution
"gap" and two "clusters" of peptides based on their amino
termini. It is unlikely that an alternate binding frame could account
for this discontinuous distribution, since only a single binding frame
of any significance was identified (Table III
). Thus, these features
are most likely due to the specificity of the proteolytic enzymes that
produce these peptides during Ag processing.
|
The data of Fig. 1
, taken together with the differences in
dominant and mean peptide lengths among the nested sets and the
predicted binding motif information, suggested the possibility that the
proteases responsible for the generation of the peptides associated
with HLA-DR4 preferentially cleave at certain residues. Alternatively,
the HLA-DR4 binding site might sterically inhibit access of a
nonspecific exoprotease or endoprotease to the ends of an already bound
peptide, even though those ends might not be immobilized in the site.
To gain more information about these possibilities, we summed the
abundances of all peptides in each of the nested sets that either began
or ended at a particular residue. For example, 5 of the 18 peptides in
the Ig
nested set ended with N158, and their
summed abundance is 59.5 x 107 (Table I
).
Similarly, six peptides from this set begin with
K145, and their summed relative abundance is
73.1 x 107. By combining this data with the
predicted binding frames in Table II
, we examined whether the
distribution of peptide ends was related simply to binding site
proximity, or suggested specificity in proteolysis (Fig. 2
).
|
140162 set was there significant
representation of peptides that terminated at P11: the most abundant
C-terminal residues for the remaining 10 sets were usually at the P12
or P13 positions. While the paucity of peptides terminating at P11
could be due to steric inhibition of protease access, we also observed
that certain residues were preferred cleavage sites although they were
more distant from the apparent end of the binding site. Thus, peptides
of the HLA-C5274,
HLA-B5172, and
HLA-A4974 sets terminated at R at the P12 or
P16 positions with a similar frequency, and peptides terminating at one
of these two residues were more abundant than those terminating at
intervening residues. R was also found as a strong secondary C terminus
at the P13 position for peptides of the
HLA-C150169 and
HLA-B150170 sets. Similarly, Q was the
preferred C-terminal residue for the
HLA-C150169,
HLA-B150170, and
HLA-E148166 sets. While Q occupies the P12
position in the first two sets, it occupies the P13 position in the
latter, although the sequences of the binding frames for these nested
sets are otherwise highly homologous. These preferred cleavage sites
were not explainable as steric hindrance due to the use of a secondary
binding frame, since these frames all would have protected amino
terminal, rather than C-terminal residues. The enhanced representation
of certain C-terminal residues in multiple sequences at different
positions is consistent with the hypothesis that they are due to
selective protease activity rather than simply by steric constraint due
to proximity to the peptide binding site.
In contrast to the C termini, dominant amino termini were often found
immediately adjacent to the peptide binding site at the P(-3) position.
In two cases (human
-soluble NSF attachment protein
(NSFA)148163 and
HSC71482499), the dominant terminal residues
were at the P(-2) position. In neither case could this observation be
accounted for by a secondary binding frame. It is also more difficult
to discern evidence for the specificity of proteases that produce the
amino termini of the remaining nested sets. These observations are
consistent with the idea that steric constraints on protease activity
are less severe at this end of the peptide binding site, or that
substantially complete processing of peptide amino termini occurs
before stable HLA-DR4 binding.
| Discussion |
|---|
|
|
|---|
Based on hydrogen bond interactions and visibility in the x-ray crystallographic structure of HLA-DR*0401 (DRA*0101, DRB1*0401) associated with the collagen II peptide 11681180 (17), as well as other crystal structures of HLA-DR molecules (15), only 12 peptide residues interact with the peptide binding site of this molecule. Furthermore, studies using sequentially smaller peptides have shown that 11 mers bind to HLA-DR4 with affinities comparable to those 13 mers (29). Our work confirms on a large scale that the average naturally occurring class II-associated peptides is four to seven residues longer than necessary for maximal hydrogen bonding between the peptide and the class II molecule. However, perhaps most surprising was that 12 mers corresponding to this "ideal" binding site were not observed among the peptides of most nested sets, or if observed, were of low relative abundance. Although it is formally possible that these additional residues contribute to the affinity of these particular peptides, these observations suggest that most peptides interact initially with the class II binding site in the context of larger precursors, which are then proteolyzed further at both amino and C-terminal ends. The additional residues beyond the minimum necessary for binding site occupancy are either protected from proteolytic removal after the peptide is bound to the class II molecule or are a reflection of the nonrandom cleavage of peptides by endosomal proteases. However, in a small number of cases, a fraction of the peptides from individual nested sets terminate at residues that lie within the peptide binding site. Such peptide termini likely represent dominant protease cleavage sites that are used early in the degradation process.
Several observations are consistent with the idea that the final form of peptides displayed by HLA-DR*0401 reflects the action of proteases with preferences for certain amino acid side chains, rather than simply amino and carboxyl peptidase digestion of exposed ends until steric hindrance prevents further action. First, the average peptide lengths and length of the most dominant species differ among nested sets. Second, when individual species were aligned, some sets showed evidence of gaps or low frequency of occurrence of particular amino or C termini. Third, preferential occurrence of R and Q at the C terminus was observed when different nested sets were compared. This preference was only partly dependent on the absolute position relative to the peptide binding site. The specificities of proteases that have been localized to endosomes are rather broad (4), and so it may be somewhat surprising that any evidence of selectivity in proteolysis was observed in this work. However, with the exception of a recently described asparaginyl endopeptidase (14), none of these proteolytic activities have been directly implicated in the generation of class II-associated peptides other than those derived from the invariant chain. Our work may offer useful insights into the characteristics of proteases that are important in this process.
Previous work had established that nested sets were a typical feature of peptides associated with class II MHC molecules (1, 18, 19, 20, 21, 22). Indeed, 4 of the 11 sets described in the present work were previously identified with a more limited number of members (1). However, we have shown in this study that while they tend to be dominated by a relatively small number of species, many such sets are quite complex. Previous studies have demonstrated that T cells may differentially recognize different members of such sets. For example, the crystal structure of hen egg lysozyme5062 restricted by I-Ak showed that residues 61 and 62 had very weak electron density, consistent with their being considerably solvent exposed (30). However, two-thirds of the T cells that respond to this epitope recognize the peptide 4863, but not the minimal 4861 peptide (31). Therefore, nested sets may add a level of antigenic diversity that increases immunogenicity. It will be interesting to determine how different members of these nested sets are viewed by the immune system.
|
|
| Footnotes |
|---|
2 J.D.L. and F.M.W. have contributed equally to this work. ![]()
3 Address correspondence and reprint requests to Dr. Victor H. Engelhard, Carter Immunology Center, University of Virginia, P.O. Box 801386, Charlottesville, VA 22908. E-mail address: vhe{at}virginia.edu ![]()
4 Abbreviations used in this paper: MS/MS, tandem mass spectrometry; Ig
, human Ig
-chain; BSP, bovine serotransferrin precursor; NSFA, human
-soluble NSF attachment protein; LAMP, human lysosomal-associated multitransmembrane protein; HSC71, human heat shock cognate 71-kDa protein. ![]()
Received for publication June 19, 2002. Accepted for publication August 29, 2002.
| References |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
F. R. Depontieu, J. Qian, A. L. Zarling, T. L. McMiller, T. M. Salay, A. Norris, A. M. English, J. Shabanowitz, V. H. Engelhard, D. F. Hunt, et al. Identification of tumor-associated, MHC class II-restricted phosphopeptides as targets for immunotherapy PNAS, July 21, 2009; 106(29): 12073 - 12078. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Lippolis and T. A. Reinhardt CENTENNIAL PAPER: Proteomics in animal science J Anim Sci, September 1, 2008; 86(9): 2430 - 2441. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Tunheim, K. W. Schjetne, I. B. Rasmussen, L. M. Sollid, I. Sandlie, and B. Bogen Recombinant antibodies for delivery of antigen: a single loop between {beta}-strands in the constant region can accommodate long, complex and tandem T cell epitopes Int. Immunol., March 1, 2008; 20(3): 295 - 306. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Zangani, M. Froyland, G. Y. Qiu, L. A. Meza-Zepeda, J. L. Kutok, K. M. Thompson, L. A. Munthe, and B. Bogen Lymphomas can develop from B cells chronically helped by idiotype-specific T cells J. Exp. Med., May 14, 2007; 204(5): 1181 - 1191. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. D. Meiring, B. Kuipers, J. A. M. van Gaans-van den Brink, M. C. M. Poelen, H. Timmermans, G. Baart, H. Brugghe, J. van Schie, C. J. P. Boog, A. P. J. M. de Jong, et al. Mass Tag-Assisted Identification of Naturally Processed HLA Class II-Presented Meningococcal Peptides Recognized by CD4+ T Lymphocytes J. Immunol., May 1, 2005; 174(9): 5636 - 5643. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. A. Munthe, A. Os, M. Zangani, and B. Bogen MHC-Restricted Ig V Region-Driven T-B Lymphocyte Collaboration: B Cell Receptor Ligation Facilitates Switch to IgG Production J. Immunol., June 15, 2004; 172(12): 7476 - 7484. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. W. Purcell and J. J. Gorman Immunoproteomics: Mass Spectrometry-based Methods to Study the Targets of the Immune Response Mol. Cell. Proteomics, March 1, 2004; 3(3): 193 - 208. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |