|
|
||||||||
Faculty of Life Sciences, Bar-Ilan University, Ramat-Gan, Israel.
| Abstract |
|---|
|
|
|---|
light chain genes. We show that,
assuming that there are no selection mechanisms responsible for
abolishing cells expressing two light chains, the repertoire of newly
generated B lymphocytes exiting the bone marrow must contain a
significant fraction of such
double-productive B
cells. | Introduction |
|---|
|
|
|---|
:
ratio among B
lymphocytes or the degree of allelic exclusion vs allelic inclusion
(the simultaneous expression of two alleles of the same receptor chain
gene in the same cell) have received much attention
(6, 7, 8, 9, 10, 11). Theoretical evaluation of models were suggested
as explanations of repertoire characteristics, followed by experimental
testing of the predictions of these models. These studies have yielded
a detailed picture of how gene rearrangement interacts, causally and
temporally, with the processes of receptor-based selection and how all
these factors in turn influence the observed population dynamics of
developing lymphocytes (12, 13, 14). There are, however, many
pieces still missing in this fascinating puzzle.
The question of allelic exclusion is one of the most elusive problems
related to lymphocyte repertoire development (15).
Allelically included cells, in which both receptors are expressed on
the cell surface, are potentially dangerous because the lower
expression levels of each of the two receptors may lead to escape of an
autoreactive receptor from central tolerance (16). Protein
chains of B and T cell receptors vary in the degree of allelic
exclusion or inclusion they exhibit. Gene rearrangement is linked to
transcription, and chromatic remodeling markers such as DNA methylation
and histone acetylation distinguish rearranged from unrearranged
alleles (17, 18, 19), although the exact role of these markers
in induction of allelic exclusion remains to be elucidated
(20, 21, 22). Expression of the pre-B cell receptor
(BCR)3 on the cell surface has
been shown to be required for the induction of heavy chain allelic
exclusion (23), although neither of the components of the
surrogate light chain
5 or VpreB, is required in itself for
induction of allelic exclusion (24, 25), and a similar
view is held for T cells (26).
A related problem is whether gene segment choice for rearrangement is random or biased, e.g., whether downstream (D-proximal) V segments or upstream J segments are preferably rearranged first. Rearrangement bias may be related to the accessibility of the particular DNA segment to the recombination machinery (27). Data from different systems are conflicting, and largely obscured by differences between the recombination signal sequences of individual gene segments (28, 29, 30, 31, 32, 33, 34), which affect the cleavage efficiency of the V elements by the recombination-activating gene complex (35). Stepwise activation of the heavy chain V gene locus may also play a role (36).
Our approach to the complex problem of lymphocyte repertoire
development is to formulate mathematical and computational models of
the processes of gene rearrangement, receptor-based selection, and the
overall population dynamics of developing lymphocytes and to use these
models to evaluate the proposed explanations for experimental
observations. The first study in this series (37) dealt
with the apparent contradiction between continuous rearrangement in
the Ig light chain and the concept of allelic exclusion, and showed
that the two may be reconciled by a certain degree of order in gene
segment choice of rearrangement and by the fact that the number of
light chain rearrangement attempts a developing B cell is allowed to
perform is limited to two to three attempts. The same characteristics
also served to give a full explanation of the observed
:
ratio
among mature B lymphocytes. The second study (38) dealt
with T lymphocyte development and showed how the observed fraction of T
cells containing productive TCR
rearrangements on both alleles can
be explained by the relative permissiveness of thymic positive
selection, acting on cells with a much higher potential for secondary
rearrangements than that of B lymphocytes.
This paper uses similar models to address the following questions
(which are explained in more detail in the following
paragraphs). (1) Is the feedback model sufficient to explain
heavy chain allelic exclusion? (2) Is heavy chain rearrangement
simultaneous or ordered (first one allele, then the other)? (3) Is
light chain rearrangement ordered (first one allele, then the other),
or can successive rearrangements be made on either light chain allele
at random? (4) What determines the usage of light chain J segments in
rearrangement? And, finally, what is the degree of allelic exclusion in
light chains? We use the first (to our knowledge) computational
model of Ig gene rearrangement that accounts for both heavy and light
chain gene rearrangement and for all segments of all Ig genes
(39). The model and simulation methodology are described
in detail below.
| Heavy chain |
|---|
|
|
|---|
If we denote the probability for a rearrangement to be productive by
Pproduct, and the probability that a heavy chain
pairs well with the surrogate light chain by
Ppairsurr, then the probability of success in
the first rearrangement attempt is Pproduct x
Ppairsurr. Failure in rearrangement of the first
allele or failure in the pairing of the resulting chain with the
surrogate light chain would preclude this rapid positive selection,
enabling the cell to complete the rearrangement on the second allele.
(In the heavy chain, only one rearrangement attempt is possible on each
allele, because rearrangement deletes all D segments except for the one
participating in the rearrangement process.) This model, which assumes
that the two alleles are rearranged one after the other with sufficient
time for selection between completion of rearrangement of the first
allele to that of the second, can be referred to as the ordered model
of heavy chain gene rearrangement (Fig. 1
A). It may be contrasted with
a synchronous model, in which rearrangement on the two alleles proceeds
in parallel (Fig. 1
B), or with a more likely hybrid model in
which a cell has a certain probability of performing the two
rearrangements in parallel. In this case, eventual testing will be
based on the status of both alleles, and the probability that a cell
passes the test becomes P(pass due to either allele) =
1 - P (fail on both alleles) = 1 - (1
- Ppairsurr)2 =
2Ppairsurr - Ppairsurr2.
|
45% and that
the fraction of mature B cells with productive rearrangements on both
heavy chain alleles will be
9%. Even taking into account the
existence of V pseudogenes does not reduce this predicted value by more
than a factor of 2. In some of the latter cells, the proper light
chain, which will later be rearranged and expressed by the cell, may
pair well with both heavy chains, resulting in phenotypic heavy chain
allelic exclusion in most (23) but not all cells. Contrary to the latter prediction, however, only about one in 103104 mature splenic B lymphocytes were found to express two heavy chains in the cytoplasm (23, 44). The same level of allelic exclusion was found even in mice triallelic for the Ig heavy chain locus (9). Hence, either there is an active mechanism preventing the simultaneous expression of two heavy chain alleles in most cells or there is some additional selection process, possibly in the peripheral lymphoid tissues, favoring cells that express only one heavy chain allele (23), as has been shown for B cells bearing two complete transgenic receptors (45).
Kitamura and Rajewsky (23) have also found that when one
of the two heavy chain alleles has a disrupted µ membrane exon, such
that it cannot be expressed on the cell membrane and transduce a
selecting signal, the fraction of heavy chain double-productives
(HCDPs) is larger than expected for this situation. The
double-productive (DP) fraction in newly generated B lymphocytes in the
bone marrow was expected to be 12% but was much higher, between 20 and
25%. This was explained as an artifact. However, as we show in
Results this result could be completely accounted for
by the inability of the disrupted µ chain to be expressed on the cell
surface with the surrogate light chain, which creates a situation
equivalent to that of semisynchronous rearrangements. In this regard,
it is interesting that
1% of human T cells express two TCR
chains on their surface (46). Yet, in this case, as with
murine peripheral B cells, the effects of peripheral selection cannot
be separated from those of putative allelic exclusion mechanisms.
Thus, the following two questions have not as yet been completely answered. 1) Is the rapid selection (feedback) model sufficient to explain heavy chain allelic exclusion, or must other mechanisms, such as competition between two chains for light chain binding (16) or peripheral preference of single-receptor cells, be evoked to explain peripheral B cell allelic exclusion? 2) Does heavy chain rearrangement proceed simultaneously on both alleles or in an ordered manner (i.e., first one allele is rearranged, then the other)? Additionally, the exact value of Ppairsurr, the probability that a heavy chain pairs well with the surrogate light chain, has not been sufficiently established. It has been estimated as one-half (31, 43), but this value was based on the pairing of only 14 of 33 heavy chains with the surrogate light chain, which gives about 0.4. We set out in this study to examine the degree of sequentiality or synchronicity in heavy chain gene rearrangement and to make some progress towards the evaluation of Ppairsurr. We have dealt with this issue by extending our previous computational model (37) to include all gene rearrangements on the heavy chain (which has not been dealt with at all in our previous studies) as well as the light chain (39). Below, we calculate the predicted fraction of DPs in newly generated B cells (if there is no selection against DPs, and no active mechanism silencing one heavy chain allele), for values of Ppairsurr between 0.1 and 1, and values of Ptestpair, the probability of being tested for heavy chain functionality after the first rearrangement, between zero (a strictly synchronous model) and 1 (a strictly ordered model). Comparisons of the results of our calculations to experimental data enable us to draw several conclusions concerning heavy chain rearrangement order and allelic exclusion.
| Light chain |
|---|
|
|
|---|
chains (reviewed in Ref. 38).
Experimental studies have in many cases used mice transgenic for
autoreactive receptors, which shed light on tolerance mechanisms but do
not give an estimate of the degree of allelic inclusion in normal
situations (see, e.g., Refs. 16 and 47). To
approach this issue, one must address two relevant subquestions: 1) Is
light chain rearrangement ordered (first one allele, then the other) or
can successive rearrangements be made on either light chain allele at
random? 2) What determines the usage of light chain J segments in
rearrangement? In the present study, we directly address light chain
allelic inclusion by using the first computational model which
addresses and enumerates B cells maturing with two productively
rearranged light chain genes (39).
An older study (37) dealt only with light chain gene
rearrangement, taking into account only J segments because they are the
limiting factor in Ig light chains. Results of simulations,
incorporating various assumptions on the probabilistic choices cells
make during rearrangement, were compared with published data on the
:
ratio in mature B cells, on the fractions of mature B cells
with rearrangements on one or two alleles, and on the relative
frequencies of J
segments in these rearrangements. These comparisons
led to the following conclusions. First, to explain the observed
:
ratio of
20:1 in mature murine B cells, it is necessary to
assume both a preference of choosing
over
in rearrangement and
a certain probability of cell death after each nonproductive
rearrangement. This death probability limits the number of
rearrangement attempts a cell may go through, which was suggested as an
explanation for light chain allelic exclusion. Second, a bias toward
using upstream vs downstream J
segments in rearrangement was
indicated. Although this seems to allow the cell to perform a larger
number of rearrangement attempts before exhausting J
segments on
both alleles (relative to the random case), the probability of death
after each rearrangement failure does not allow the cell to maximize
the usage of its pool of eight J
and four J
segments. Third, the
possibility that the cell prefers to remain on the same allele was
examined. The conclusion supporting such a preference was later shown
to be incorrect (39, 48) and was not supported by
experimental data (49) or by our studies on TCR gene
rearrangement (38).
Although the above study evaluated many of the parameters that play a
role in determining the fate of a developing B lymphocyte, it was not
conclusive on the point of light chain allelic exclusion for several
reasons: 1) allelic exclusion was built into the older simulation
(cells with two productive light chain rearrangements were killed by
default), and hence the degree of allelic inclusion as function of the
parameters could not be evaluated; 2) only the two extreme cases of
J
bias were studied, either no bias at all or an almost absolute
sequentiality in segment usage, although the actual magnitude of this
bias probably lies somewhere between the two extremes; 3) the
contribution of V segments was not considered. Although it is obvious
that the numbers of J segments is much more limiting than the much
higher number of V segments, it remained to be proved that V segment
choice does not limit the rearrangement process, and it remained to be
seen whether there are biases in V segment choice
(28, 29, 30, 31, 32, 33).
Given that many new data have recently become available (Ref. 50 ; reviewed in Refs. 15, 16, 17, 18, 19, 20, 21, 22, 23, 24), we set out in our recent and present studies to clarify whether the extent of allelic exclusion in the BCR light chain can or cannot be explained by probabilistic factors alone. Our approach is to simulate gene rearrangement and the subsequent selection of developing B cells under various values of the underlying probabilities and generate the predicted composition of the mature B cell repertoire under each set of probabilities. Wherever experimental data are available, we compare our results with these data and draw conclusions on the underlying mechanisms. Where data are not available, our results point at the parameters that are worth measuring.
| Methods: Simulation and Parameters |
|---|
|
|
|---|
segment usage in
-expressing cells, have reached their
conclusions based on very small numbers of cells (e.g., Refs.
15, 16, 17, 18, 19, 20, 21, 22, 23). In contrast, the simulations in our previous
studies have shown that these repertoire characteristics are highly
variable between samples of small numbers of cells (37);
if any repertoire characteristic is plotted against the number of cells
(or heavy chain clones) simulated, an order of 104 or even
105 cells must be simulated to obtain a reliable result. In
our studies, simulation results (e.g., the
:
ratio) were always
plotted as function of the number of cells or the number of lineages
simulated to find the order of magnitude of cell numbers for which the
results have reached a fixed point (other than a small amount of noise
due to the inherent stochasticity). Numbers of cells of the order of
105 or numbers of clones of the order of 103
were usually adequate (Fig. 2
|
-expressing cells with productive
rearrangements on both
alleles. On the other hand, this model
enables us to evaluate the statistical reliability of small sets of
experimental data by simulating small rather than large numbers of
cells and calculating the mean and SD of several repeats of the
simulation. We can thus deduce the ranges of simulation parameters that
give results sufficiently close to the experimental data. The results
shown in this paper are, in most cases, means and SDs of 5
simulations of 500 cells each (of which usually between 50 and 100
cells survive and mature, depending on parameter values). These results
were usually the same as those obtained from simulations of
105 cells; however, their variability was much higher (see
below).
The following is a detailed description of our computer simulation of
Ig heavy and light chain gene rearrangement and receptor-based
selection, first published in Ref. 39 . In each run of the
simulation, we create many clones of cells and follow their
development. A "cell" is an object, which is characterized by the
set of probabilities determining what the cell will do next (divide,
die, rearrange its heavy or light chain genes and so on). Each cell
contains a "genome" composed of six loci (two heavy chain alleles,
two
and two
light chain alleles). A heavy chain locus is
composed of three arrays of gene segments corresponding to the V, D,
and J segments. A light chain locus is composed of two arrays of gene
segments representing V and J segments. Each segment is characterized
by the probability that it will be chosen for rearrangement in the next
rearrangement attempt. Deleted segments, or segments currently
rearranged, are thus assigned a rearrangement probability of zero, and
the probabilities for rearrangement of the remaining segments are
normalized accordingly after each rearrangement.
This model is obviously a simplified picture of the actual heavy and
light chain loci, the structures of which are known in great detail,
especially for the murine
locus (51). First, we
neglect the issue of V gene orientation, because the numbers of J
segments, rather than of V segments, are the limiting factor, as will
be shown below. Second, we neglect the existence of pseudogenes. Taking
pseudogenes into account is equivalent to using a value lower than
1/3 for the probability for a rearrangement to be productive.
Reducing this value does not significantly change the results shown
here (data not shown).
The simulation flow for each cell is as follows (Fig. 3
).
|
Cells are born into the simulation having all heavy and light chain genes in the unrearranged (germline) configuration. Probabilities for each segment to be chosen for rearrangement in the first attempt are assigned according to the rearrangement bias (e.g., a bias toward rearranging V-proximal J segments first). If there is no bias, then the probabilities are equal for all segments in a given locus.
Heavy chain rearrangement (Fig. 3A
)
The cell first selects randomly the heavy chain allele to rearrange, with equal probabilities assigned to both alleles. Then, the cell selects the V, D, and J segments that will be rearranged on this allele. This selection is also probabilistic and uses the rearrangement probabilities assigned to each segment, as described above. The next decision is whether the rearrangement was productive; the probability for a rearrangement to be productive is denoted by Pproduct. If the rearrangement is nonproductive, the cell cannot continue rearranging on the same allele. Hence, the cell attempts to rearrange the other allele. If the rearrangements on both alleles were nonproductive, the cell dies, and a new cell is born.
Synchronicity vs order in heavy chain rearrangement (Fig. 3A
)
The parameter Ptestpair determines the probability that, after the first attempt of heavy chain gene rearrangement and if this rearrangement is productive, the cell will test the resulting heavy chain for pairing with the surrogate light chain. If the cell, rather than performing this test after the first rearrangement, first makes a rearrangement attempt on the second allele, then the two rearrangements can be regarded as synchronous. Thus, Ptestpair = 1 corresponds to the standard, strictly ordered model, whereas Ptestpair = 0 corresponds to a fully synchronous model. We have examined intermediate values of this parameter as well (see below).
After a productive heavy chain rearrangement (or two simultaneous rearrangements), the simulation tests the heavy chain(s) for pairing with the surrogate light chain. Success here is, again, a probabilistic event, with a probability of success Ppairsurr per heavy chain, as explained above. If the cell passes this test, its probability for further rearrangement and its probability of death are set to zero, and its probabilities for cell divisions or further differentiation are set to positive values that are also parameters of the program.
If heavy chain rearrangement succeeds, there is a probability that the cell will perform a number of subsequent cell divisions, in which a clone of cells with the same heavy chain rearrangement is created (52). Each cell from this clone is now followed during its subsequent (independent) development. Cell divisions in themselves do not have any effect on the rearrangement process. Nevertheless, we have included cell divisions in our simulation as part of the attempt to create a relatively realistic model. This would enable us to estimate parameters such as the fraction of B cells that actually mature (see below), and the influence of cell divisions, if any, on the variability of the generated repertoire (future work). A cell that has ceased to divide proceeds to the light chain rearrangement stage.
Light chain isotype selection (Fig. 3B
)
At each light chain rearrangement step, the isotype to be
rearranged,
or
, is first selected, according to the selection
probability P
, i.e., the cell chooses
with probability P
, or
with
probability (1 - P
).
Light chain allele selection (Fig. 3B
)
After the isotype has been selected, then one or the other
allele is randomly selected for rearrangement. The cell may either
rearrange the previously rearranged allele or switch to the opposite
allele, depending on the probability parameter
Pswitch. If Pswitch =
0, then the cell rearranges the previously rearranged allele,
provided there are J
(and
V
) segments remaining on it. We call this
situation absolute allele preference. If Pswitch
= 0.5, then the choice of the next allele to be rearranged is
completely random (no allele preference), with equal probabilities to
choose either allele, and independent of previous rearrangements. If a
cell has run out of J
segments on one allele,
then it automatically switches to the other allele. If a cell is
entirely out of J
segments on both alleles,
then it switches to
rearrangement, and vice versa, until all
segments are exhausted. The cell dies if there are no more light chain
segments available for rearrangement.
The possibility of simultaneous light chain rearrangements on different alleles and/or isotypes is not included in the simulations; evidence suggests that this is a rare event (17, 42), and inclusion of these possibilities in the simulations would complicate them much further. The implications of this choice on the interpretation of our results are discussed below.
Rearrangement of a
allele
Once a
allele has been selected, one
V
segment and one J
segment are chosen for rearrangement, using the probabilities assigned
to each segment. Each rearrangement is followed by renormalization of
rearrangement probabilities of the allele in question, i.e., a
rearranged J
segment is deleted (as are all
unrearranged J
segments 5' relative to the
rearranged segment) and cannot be chosen again by the program. Similar
deletions occur for V
segments. We thus neglect the
possibility of inversional rearrangements, in spite of the high
fraction of V genes placed in the inversional orientation observed in
the murine
locus (51), under the assumption that the
only difference between deletional and inversional rearrangements is
that the latter do not delete the intermediate V gene segments. Given
that the number of V segments is not limiting (an assumption that we
have verified in the simulations; see below), neglecting inversional
rearrangements does not have any significant effect on our results and
conclusions.
Rearrangement of a
allele
If
is selected, then one of the two
alleles is chosen at
random, with equal probabilities for the two alleles. V
and J
segments on the current allele are chosen at
random. Failure in
rearrangement leads to deletion of the
rearranged segment only and has no effect on any other segments.
Light chain testing and negative selection
Once a V-J light chain rearrangement is made, the program determines whether the rearrangement is productive (V and J are in the same reading frame), with a probability Pproduct. If the rearrangement is productive, the program determines whether the resulting light chain can pair with the existing heavy chain, with a probability Ppairlight. If so, the program determines whether the resulting BCR is autoreactive (anti-self), with a probability Pas. Rearrangement is repeated until a productive, heavy-light chain-matched, nonautoreactive BCR is produced, or until the cell dies (see below).
Cell fate after selection
Results of rearrangement are classified into one of three categories:
1. Cells containing an anti-self rearrangement. These cells are assigned a high death probability, denoted by Pdas. If, however, a cell does not die after such a rearrangement (an event that occurs with a probability (1 - Pdas)), it may try another rearrangement (receptor editing; Refs. 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55). In reality, the fate of these cells may be affected by the site of antigen encounter (56), the number of such encounters (57), receptor expression levels (58, 59) and the affinity of the receptor for the negatively selecting Ag (47).
2. Cells containing only out of frame V-J joins, heavy-light-mismatched heavy-light chain pairs, or germline light chain alleles. These cells are assigned a moderate death probability, PDL. If such cells do not die, they also continue rearranging their light chain genes.
3. Cells that have arrived at a productive, heavy-light matched, nonautoreactive rearrangement are allowed to mature. The simulation tallies the composition of the repertoire of those cells that have been allowed to mature.
The simulation and a users manual are available from the authors on request (the current version is a C program, which can run on any computer equipped with a C compiler. We used a Unix computer (Silicon Graphics).
Parameters
Table I
summarizes the default
parameter values used in our simulations. Parameter definitions, and
justifications of the choice of default values in case the parameters
were not varied in the simulations, are as follows.
|
2. Pproduct: The probability for a V-J rearrangement to be productive is one-third, because this is the probability that V will be joined to J in the correct reading frame. Incidentally, Pproduct = 1/3 for V-D-J rearrangements as well, because in most cases the D segment can be read in all three reading frames. We neglect here the effect of the existence of pseudogenes (51), which would only slightly decrease Pproduct.
3. Ptestpair: The probability to perform a test of pairing with the surrogate light chain after the first heavy chain rearrangement, which is one of the parameters under study here (see below). Ptestpair = 0 corresponds to the fully synchronized model. Ptestpair = 1 corresponds to the fully ordered model. Intermediate values were used as well.
4. Ppairsurr: The probability of pairing of the heavy chain with the surrogate light chain (31, 43). This parameter is studied below in detail.
5. Ppairlight: Probability of a light
chain to pair with the heavy chain. Ppairlight
was chosen to be 0.8, because previous studies (60) have
shown that this is the minimum value required to give a ratio of
:
> 2; we confirmed this choice in our previous study
(39), where this value gave
+/0
0.68, confirming past choices (Refs. 37 and
60 and data not shown).
6. PDH: The probability of cell death after an unsuccessful heavy chain gene rearrangement (this parameter is relevant only in the strictly ordered model). Death due to failed rearrangements may result from unresolved double-stranded DNA breaks.
7. PDL: The probability of cell death
after an unsuccessful light chain rearrangement or failure in light
chain pairing with the heavy chain. The default value was found by
comparing simulation results to experimental data on the fraction of
cells with only one
allele rearranged
(
+/0), which has been found to be
68%
(7, 61). In our previous studies, this fraction was
obtained for
-expressing B cells only with
PDL = 0.2, with no significant dependence
on the values of P
or
J
bias (Ref. 39 and data not
shown). Hence, we chose PDL = 0.2 as the
default value for the rest of the study.
8. Pdiv: The probability of divisions after heavy chain selection. This probability is defined per division; i.e., after a successful heavy chain gene rearrangement or after a cell division, the cell will divide again with a probability Pdiv or stop dividing, differentiate further, and rearrange its light chain genes, with a probability (1 - Pdiv). The default value was taken from our previous study (37).
9. P
: The probability of choosing
over
light chain for rearrangement. The default value was
determined by comparing simulation results to experimental data on the
ratio between cells expressing
and cells expressing
light
chains. The observed minimal
:
ratio in murine B lymphocytes is
between 5:1 and 10:1 in immature B cells and 10:120:1 in peripheral
mature B cells (62, 63). We previously found that high
values of this parameter (P
0.95
are necessary to obtain high
:
ratios in our simulations. In the
current simulations, with PDL = 0.2, the
value of P
= 0.975 (39)
gave a
:
ratio of >10. This reconfirmed our conclusion that a
strong preference for choice of
over
for a light chain gene
rearrangement is necessary to explain the observed
:
ratios;
however, it is not necessary to assume that the order of isotypes to be
rearranged is preprogrammed (50, 63, 64).
10. Pswitch: The probability of switching
between light chain
alleles or
alleles.
11. J
bias: The strength of the J segment
bias, that is, the preference of the cell to rearrange V-proximal J
segments. The exact definition is:
![]() |
i) is the probability
to choose J
i for rearrangement (the
simulation takes into account the deletion of already used segments and
intermediate segments). Hence, the smaller the value of
J
bias, the stronger is the bias. In the
previous study, this parameter was either set to
J
bias = 1, which corresponds to a
completely unbiased gene segment choice, or to
J
bias = 0.001, which gives a very
strong, practically absolute, preference toward choice of upstream
segments. Here, we used several intermediate values as well.
12. V
bias: The strength of V segment
bias, defined in a way similar to that of
J
bias, with the bias favoring rearrangement
of J-proximal V segments.
13. Pas: The probability of a cell to be anti-self with its current receptor. The value of Pas has been independently estimated to be about 2/3 by others (65, 66).
14. Pdas: The probability of cell death if the cell was found to be anti-self. The default value was taken from our previous study (37).
The program follows each new cell, or cells produced by cell division after heavy chain pairing with the surrogate light chain, until it either matures (leaves the bone marrow after surviving all selection stages) or dies, and repeats the process for the predetermined number of cells. The program generates as output the distribution of genotypes among the cells that have matured.
| Results |
|---|
|
|
|---|
In this section, we examine the issues of heavy chain allelic exclusion and pairing with the surrogate light chain. The heavy chain composition of the mature B cell repertoire is assumed to be independent of the parameters governing light chain gene rearrangement and selection, for which the default values were used in all the simulations described in this section.
Cells with rearrangements on both heavy chain alleles.
We plotted the fraction of newly generated B cells with two heavy chain
gene rearrangements (HC-DRs), regardless of whether or not the
nonexpressed rearrangement is productive, as a function of
Ptestpair, the probability to undergo heavy
chain selection after the first heavy chain gene rearrangement, and as
function of Ppairsurr, the probability that a
heavy chain pairs well enough with the surrogate light chain to be
recognized as functional by the cell (Fig. 4
A). The fraction of these
"double-rearranged" cells decreases rapidly with
Ptestpair, as expected, because the higher the
value of Ptestpair, the higher is the
probability that the cell will not perform a second rearrangement due
to success in the first attempt. However, the fraction of
"double-rearranged" cells is almost independent of
Ppairsurr, especially for low values of
Ptestpair.
|
The results presented here (and below, unless stated otherwise) are from simulations of 500 cells each. This is the total number of cells simulated, including cells that died during the simulation in addition to those that matured. The fraction of surviving cells depends on the various parameters but usually does not exceed 1020%. Hence, these simulations are equivalent to experiments in terms of the number of cells sampled. Shown are the mean and SD of the results of five simulations for each parameter set. It is evident that the variability between simulations, which we regard as an estimate for the variability between experimental repetitions, is large, but not so much so as to mask the effects of parameter changes.
Cells with productive rearrangements on both heavy chain alleles.
Next, we plotted the fraction of cells with two productive heavy
chain gene rearrangements (HCDPs) as a function of
Ptestpair and Ppairsurr
(Fig. 4
B). Experimental estimates of this fraction vary
between 0.01 and 5%, depending on the measurement method (9, 23). As our simulation results show, even in a rigorously
ordered model, with Ppairsurr values
0.5, the
expected fractions of HCDPs are higher than those measured in
peripheral B cells. To obtain an HCDP fraction below 5%,
Ppairsurr must be higher (almost all heavy
chains pair with the surrogate light chain
(Ppairsurr = 1). The latter case is
unlikely, given that it has already been shown in mice that the pairing
probability is lower than or close to one-half (31, 43).
Moreover, even in a strictly ordered model, which also includes a
probability PDH for cell death after failure in
the first heavy chain gene rearrangement, the fractions of HCDPs are
higher than measured in peripheral B cells (Fig. 4
C).
The fraction of HCDPs in pre-B cells was
7% (40),
higher than most estimates of the same fraction in splenic B cells.
Unless this difference is completely attributable to measurement error,
it may suggest that some additional selection mechanisms act against
cells expressing two heavy chains, even in the cytoplasm, as will be
discussed below. On the other hand, HCDP fractions of
25%, as had
been measured for heavy chain mutant (µMT) mice
(23), can be obtained in our simulations for values of
Ptestpair below 0.5 and values of
Ppairsurr of 0.5 or lower. Hence, the measured
value is neither an artifact nor an indication of a priority for the
targeted allele to be rearranged first, as suggested, but can be
explained as follows. The targeted allele is rearranged first only half
the time, but because the heavy chain produced from this allele cannot
be expressed on the cell surface, in these cases there is no testing of
pairing; i.e., in 50% of the cases, the cells behave as if
rearrangement of the two alleles was synchronous, which is the meaning
of Ptestpair = 0.5.
Light chain
J
usage.
As observed in our previous study (37), if there is no
bias toward upstream J
segments
(J
bias = 1), then the fractions of
alleles with rearrangements to downstream J
segments actually exceed the fractions of
alleles with
rearrangements to upstream J
segments (Fig. 5
). This is due to an "accumulation
effect": there are many more pathways leading to cell maturation with
a rearrangement to J
4 or
J
5 than to J
1 or
J
2 (37). Evidence supporting
this argument was found in human leukemias and myelomas, in which
J
usage in
-expressing cells (i.e., cells
in which several
rearrangements were performed before the cells
resorted to rearranging
genes) was indeed skewed towards
J
4 and J
5
(67). On the other hand, the extreme case of
J
bias = 0.001, which was used in the
previous study, leads to an unrealistic distribution of
J
segment usage among mature cells, with
almost 80% of the alleles having rearrangements to
J
1. We find that
J
bias = 0.5 gives results that are most
similar to experimental observations (63, 68). Our results
thus still support a bias in J
segment choice
for rearrangement, but this bias is finite, so that
J
segment choice need not be deterministic
(as a bias of 0.001 effectively is).
|
usage depended only slightly on
PDL (not shown). Obviously, the higher the value
of PDL, the smaller is the chance that a cell
will perform more than one rearrangement, and hence
J
usage becomes more skewed toward upstream
J
segments. However, this effect is marginal
compared with that of the bias. Similar results were obtained in
simulations of 100,000 cells (not shown).
A cell is allowed, on average, only two rearrangement attempts.
Our results show an inverse correlation between the death probability
PDL and the average number of rearrangements per
cell (Fig. 6
). When
PDL = 0.5, a cell usually manages to
perform only one rearrangement, which is unlikely to be the case.
However, even for very low PDL values, a cell
does not perform, on average, more than four rearrangements. For
PDL = 0.2, which we consider most
plausible, the average is about two rearrangements per cell. Similar
results were obtained in simulations of 105 cells
(39). It is remarkable that even in mice transgenic for
autoreactive receptors, the average number of rearrangements per cell
was only about three (16, 47). This result is hardly
influenced by changes in the values of other parameters, in particular
Ppairlight, Pas, and
Pdas (not shown). Hence at least one of the
mechanisms responsible for allelic exclusion is the fact that the cell
is not given the opportunity to perform too many secondary
rearrangements.
|
rearrangement is only part of the explanation for the
:
ratio.
Rearrangement order within
increases the potential number of
rearrangements the cell can perform and thus decreases the probability
of exhausting the possibilities for
rearrangement and moving on to
rearrangement. At the same time, the low number of rearrangements
per cell means that few cells survive to the point where the
possibilities for
rearrangement are exhausted and
rearrangements are initiated. These combined factors contribute to the
:
ratio by decreasing the chance that a cell will arrive at
rearranging
.
Our results do not support allele preference.
We used Pswitch, the probability of switching
the allele to be rearranged, to reexamine the issue of allele
preference: when Pswitch = 0.5, the choice
between the two alleles in each rearrangement attempt is random;
when Pswitch = 0, the cell continues to
rearrange on the same allele until there are no more
segments for
rearrangement on that allele, and only then does the cell try to
rearrange the other
allele. We ran the simulation with
PDL between 0.01 and 0.4, using several values
of J
bias, and the two values of
Pswitch (Fig. 7
).
For Pswitch = 0, the fraction of cells that
rearranged only one
allele was much higher than the experimentally
observed values of 68% (7, 61). The only exception
occurred for very low values of PDL, which we
consider very unlikely based on the data presented above. Hence, our
results do not support the assumption of allele preference in
rearrangement; rather, the cell may rearrange any
allele it
chooses, and the choice is random. These results did not depend on
variations of other parameter values (not shown) and were similar to
those obtained in simulations of 105 cells
(39).
|
alleles may be as high as
50%.
V segments and V bias.
The results reported above on J
usage and the
average number of rearrangements per cell were qualitatively similar to
those obtained in the J segment only study with similar
parameter values (37), indicating, as expected, that V
segments are not a limiting factor in the rearrangement process. To
further verify that putative biases in V segment choice for
rearrangement are also inconsequential, we introduced a bias in the
choice of V segments toward downstream (J-proximal) V segments,
similarly to the bias introduced for J segments, with values between
1.0 and 2/3 (probability to choose each consecutively more
upstream segment is 2/3 that of the choice probability of the
preceding segment). This change did not result in any significant
alteration of the above results (data not shown).
Finally, all the results presented up to this point were insensitive to the values of the probability of light chain pairing with the heavy chain, Ppairlight, the probability for an expressed BCR to be anti-self, Pas, and the probability for an anti-self cell to be deleted, Pdas, unless otherwise noted.
The fraction of
double expressors: implications for light
chain allelic exclusion/inclusion.
Allelic exclusion, or rather to which extent it is broken, can be
measured directly in the present model (as in the T cell model
(38)) by enumerating the cells that mature with two
productively rearranged light chain alleles and counting how many of
those cells have a potentially autoreactive BCR. This is in contrast to
the previous model of BCR gene rearrangement (37), which
did not directly address the question of allelic inclusion in B cells,
because in that model, cells with two productive rearrangements were
automatically deleted and not allowed to mature. That is, allelic
exclusion was "built into" the model. Hence, we set out in the
present study, where these limitations were not imposed, to check
whether cells with two productively rearranged and expressed light
chains can mature and in what frequency. These simulations were done
with a large number of cells (105) to arrive at more
precise estimates of the small fractions of double-productive
cells.
Our simulations give a fraction of such
double-positive cells
(
DPs) of
45% for most parameter values (Fig. 8
). This is consistent with recent
experimental measurements (69). As expected, there are
more
DPs with Pswitch = 0.5 than in the
case of Pswitch = 0, because switching
means that a productive rearrangement on the previously rearranged
allele may be left undeleted. Similarly, there are more
DPs with
random J
choice, because it increases the
chance that the last J
rearrangement on a
given allele will be to J
5, which may end up
being undeleted (at least in our model, which neglects recombination
signal deletion of the whole locus) even if the other allele ends up
being rearranged last. Most of these cases are due to a productive
light chain rearrangement generating a chain that did not pair well
with the heavy chain, but there are always a few cases of "escape"
of cells with one anti-self receptor. Obviously, the proportion of
these cases increases with the probability for an expressed BCR to be
anti-self, Pas, and decreases with the
probability for an anti-self cell to be deleted,
Pdas, as shown here. Interestingly, the fraction
of
DPs is highly sensitive to Ppairlight,
with the actual fractions of "escaping" anti-self cells
increasing when Ppairlight is increased. This
may be due to the fact that with higher values of
Ppairlight, more cells undergo the negative
selection test more than once and have a larger chance of escaping
negative selection.
|
| Discussion |
|---|
|
|
|---|
In this study, we have also addressed questions concerning order vs
stochasticity in light chain rearrangement, namely, whether light chain
rearrangement is ordered (first one allele is rearranged, then the
other) or can successive rearrangements be made on either light chain
allele at random; what determines the usage of light chain J segments
in the rearrangement process; and what is the degree of allelic
exclusion in
light chains in normal (rather than transgenic) B
lymphocytes. Concerning simultaneous vs serial light chain
rearrangements, the evidence suggests that in most cases, light chain
genes differ in their DNA methylation and histone acetylation, and
hence replication status, before rearrangement, and thus rearrangement
may be initiated at different times on the two alleles. However,
simultaneous light chain rearrangements on two alleles is a possibility
that has not yet been ruled out. This is obviously related to the
question of whether cells switch the rearranged allele between
rearrangements. If cells cannot switch (Pswitch
= 0), then simultaneous light chain rearrangements cannot occur.
If cells can switch, then simultaneous light chain rearrangements might
occur, which might increase the fractions of double-rearranged or DP
cells in our simulations. Because we have not included this possibility
in the current model due to the added complexity this would cause, our
estimate for PDL is a minimum estimate (because
simultaneous light chain rearrangements will result in lower fractions
of
-expressing cells with only one
allele rearranged).
Because our results on J
usage are not very
sensitive to the value of PDL, our conclusions
in this regard will be unchanged. Our estimates of the fractions of
-expressing cells with productive
rearrangements on both alleles
are, obviously, also minimal estimates, given that these fractions
would be larger if then simultaneous light chain rearrangements had
occurred. This only strengthens the point we make here, i.e., that if
allelic exclusion is only the result of the low probability for
rearrangement and selection success, then the appearance of
-expressing cells with productive
rearrangements on both alleles
is inevitable.
Our results show an inverse correlation between the death probability PDL and the average number of rearrangements per cell, with PDL = 0.2, the most plausible value, giving about two rearrangements per cell. Hence, at least one of the mechanisms responsible for allelic exclusion is the fact that the cell is not allowed the opportunity to perform too many secondary rearrangements. A probability of death of 0.2 per rearrangement failure is quite high and thus provides the explanation for the considerable loss of cells throughout the process of B lymphocyte development. Additionally, we found that the most likely value probability of a light chain pairing with the heavy chain of the cell, Ppairlight, is 0.8. The implication is that most light chains can pair well with most heavy chains, although success in pairing is not always guaranteed. It will be interesting to see whether future experimental measurements confirm this predicted value.
The results presented here support a bias in
J
segment choice for rearrangement, as
previously suggested (37); however, this bias is finite,
so that J
segment choice need not be
deterministic as previously proposed. Conversely, our results do not
support the assumption of allele preference in rearrangement; rather,
the cell may rearrange any
allele it chooses, and the choice is
random. This conclusion is supported by recent experimental studies
(49). This might seem to be in conflict with the recent
demonstrations that features such as DNA demethylation
(17) and histone acetylation (18, 19) mark or
possibly even determine the identity of the allele which is rearranged
first. However, there is no conflict here, because, although
demethylation of the
locus may indeed determine the location of the
first rearrangement, it does not seem to directly activate V(D)J
recombination (22), and when the process is activated,
both alleles may very quickly become subject to rearrangement.
Experimental studies are divided on whether there is any bias in heavy or light chain V segment choice (28, 29, 30, 31, 32, 33, 34). If there is a bias in V segment choice, it is probably masked to a large extent by the much larger differences in rear