In optically stimulated luminescence (OSL) dating, statistical age models for equivalent dose (De) distributions are routinely estimated using the maximum likelihood estimation (MLE) method. In this study, a Markov chain Monte Carlo (MCMC) method was used to analyze statistical age models, including the central age model (CAM), the minimum age model (MAM), the maximum age model (MXAM),
Keywords
- OSL dating
- statistical age models
- equivalent dose
- MCMC sampling
In optically stimulated luminescence (OSL) dating, the assessment of certain parameters of interest (
Maximum likelihood estimation (MLE), which provides point estimates of parameters and associated standard errors, is routinely adopted to estimate parameters used in these statistical age models by maximizing the logged-likelihood function (Galbraith
In contrast, the Markov chain Monte Carlo (MCMC) methods construct a Markov chain while converging to the desired sampling distribution by starting from random initials and seeking to characterize posterior distributions for parameters of interest (Gelman
Such MCMC methods have increasingly been employed to analyze OSL data for Bayesian inferences (e.g., Millard, 2004; Sivia
The first objective of this study is to estimate statistical model parameters (such as the CAM, the MAM,
Bayesian models utilize stratigraphic relationships between samples to increase the precision of dates and to provide a sophisticated protocol to combine multiple age estimates into a meaningful chronological framework (Ramsey, 1995, 2008). By incorporating the stratigraphic principle that shallower sediments in the stratum sequence are younger than deeper sediments into age-depth models, Bayesian inference can reduce the uncertainty of dates wherein age probability distributions overlap. Age-depth modelling using Bayesian inference was initially developed to analyze radiocarbon dating data (Ramsey, 1995; Bayliss and Ramsey, 2004). Over the past decade, the application of Bayesian inference to model depth-age relationships using luminescence data have also gained widespread usage (e.g., Rhodes
During the routine application of statistical age models, such as the CAM and MAM, characteristic De values of individual samples are estimated separately (e.g., Schmidt
The second objective of this study is to apply statistical age models to refine age-depth relationships using De datasets from multiple luminescence samples employing an alternative Bayesian inference model. In contrast to the method by Cunningham and Wallinga (2012), in this study characteristic De values are sampled directly from the probability density function that describes the statistical model (see the following section for more detail) through MCMC sampling. The joint probability density function for multiple OSL ages is constructed by utilizing a combination of statistical models and measured De and dose rate datasets. Both the CAM and MAM are used to flexibly analyze fully and partially bleached OSL dates, respectively, using MCMC in order to combine different types of statistical age models in a mathematically rigorous, geologically reasonable, and philosophically satisfying manner. The performance of the model is then tested on measured and simulated De distributions.
Galbraith
Optimal parameters can be obtained using either an iterative numeric procedure or by direct numerical maximization of the joint-likelihood determined from paired observations
One of the most widely used statistical models for tackling partially-bleached samples is the MAM (Galbraith
Galbraith
Three quantities (
The maximum age model (MXAM) is an analogue of the MAM and is firstly applied by Olley
The characteristic dose of the MXAM3 can be estimated according to the MAM3 by using a “mirror image” of the original De distribution as an input (Olley
Larger De estimate typically have larger absolute standard errors; namely, larger De estimates will vary more (Arnold and Roberts, 2009). Consequently, the frequency distribution of De estimates will tend to be positively skewed. The Lognormal distribution provides a simple model to describe variables wherein their standard deviations increase in proportion to their means. It is for this reason that the aforementioned statistical age models use log-transformed De datasets as inputs. Datasets in log-scale are obtained by calculating natural logarithms of De estimates and transforming their absolute standard errors into RSEs. This is because the RSE of a De estimate approximates the standard error of a De estimate on a natural logarithm scale (Galbraith and Roberts, 2012). As a result, these statistical age models are unsuitable for measured zero or negative De values that are consistent with zero or positive De values within a level of uncertainty. However, such types of De distribution are frequently encountered for either very young or modern samples. As a consequence, Arnold
The statistical age models described in Section 2 are in fact equivalent dose models. This is because they estimate the characteristic De instead of the burial age (e.g., Guérin
Each parameter is assigned a prior distribution that represents the prior knowledge of it before some evidence is considered. For
Galbraith and Roberts (2012) reiterated that an estimate of additional uncertainty (
In this section, an age-depth model to combine De datasets from multiple samples with stratigraphic constraints was considered by using the statistical age models described in Section 2. Although Bayesian age-depth models are expected to improve the precision of dates, such precision may be misleading if potential systematic uncertainties have not been included by the individual samples (Zeeden
This study adopted the method described by Combès and Philippe (2017) to build the relationship between the characteristic dose
For parameters
The objective being to obtain sampling distributions of parameters p(θ|D) based on the joint-likelihood p(
The input, likelihood, transformation, and distribution used to implement MCMC sampling for individual samples using the CAM and the MAM3 are illustrated in
Fig. 1
Statements used to conduct MCMC sampling according to the CAM and the MAM3 (log-scale) using De sets from individual samples. The MXAM3 can be applied in a similar manner to the MAM3 by changing the likelihood function.

Samples from the same sequence may substantially vary in De distributions due to differences in provenance, depositional environment, microdosimetry,
Fig. 2
Statements used for conducting MCMC simulation simultaneously according to different age models (log-scale) using De sets from three samples whose ages were constrained such that a1<a2<a3. The three samples were analyzed using the MAM3, CAM, and MXAM3, respectively. The protocol can be easily adapted for MCMC sampling without order constrains for burial ages.

First, the validity of the MCMC sampling protocol (as demonstrated in
Fig. 3
Measured multi-grain De distributions for four aeolian samples from Gulang County at the southern margin of the Tengger Desert, China (Peng et al., 2016b). N denotes the number of measured aliquots. Note that in Peng et al. (2016b) an additional uncertainty of σb=5% was added in the quadrature to the RSEs of the measured De values to account for sources of errors that were not considered in the measurements. However, no additional uncertainty was added to the De distributions shown here.

Comparisons of burial dose and age estimated from MLE and MCMC for the various statistical age models using De sets from four measured aeolian samples. Quantities estimated using the MCMC sampling protocol shown in Fig. 1 are marked in bold. Quantities estimated using the MLE are inside parentheses.
GL2-1 | 3.26 ± 0.25 | ||||||
GL2-2 | 2.84 ± 0.21 | ||||||
GL2-3 | 3.23 ± 0.23 | ||||||
GL2-4 | 3.40 ± 0.24 |
These four samples were also analyzed simultaneously using the MCMC sampling protocol (as demonstrated in
Fig. 4
Posterior distributions of burial ages obtained from the CAM for the four aeolian samples taken from the same sedimentary section obtained using the MCMC sampling protocol, as shown in Fig. 2. The blue-coloured density plots denote results obtained by constraining the burial ages in ascending order. The grey-coloured density plots denote results obtained by imposing no constraints on the order of burial ages. The value inside parentheses denotes the calculated RSD of burial age.

A summary of burial dose and age estimated from the CAM for De sets from four measured aeolian samples using the MCMC sampling protocol shown in Fig. 2 without and with order constraints. The systematic error for the dose rate measurement shared by all samples was set to σdc=0.1.
GL2-1 | 3.26 ± 0.25 | 34.55 ± 0.91 | 10.75 ± 0.96 | 34.23 ± 0.86 | 9.72 ± 0.52 |
GL2-2 | 2.84 ± 0.21 | 32.71 ± 1.00 | 11.68 ± 1.06 | 32.19 ± 0.92 | 10.28 ± 0.49 |
GL2-3 | 3.23 ± 0.23 | 30.58 ± 1.04 | 9.59 ± 0.85 | 31.03 ± 1.00 | 10.59 ± 0.52 |
GL2-4 | 3.40 ± 0.24 | 32.24 ± 1.29 | 9.60 ± 0.85 | 33.13 ± 1.26 | 11.13 ± 0.67 |
This section shows simulation results of a series of De distributions with known burial ages to further verify the performance of the MCMC sampling protocols. The first three samples (#1–#3) were assumed to be partially-bleached, the next three samples (#4–#6) fully-bleached, and the last three samples (#7–#9) contain fully-bleached grains that have been subsequently mixed with younger, intrusive grains. These samples were assumed to be collected from the same sedimentary sequence but at different depths. Their “true” burial doses ranged from 18 to 42 Gy with a step of 3 Gy, and their “true” burial ages ranged from 7 to 11 ka with a step of 0.5 ka. The “true” average dose rate can be obtained by dividing the “true” burial dose by the “true” burial age of a sample. The random error arising from dose rate measurement was assumed to be 7%, in order to simulate a realistic annual dose rate. Namely, the “true” dose rates are noised using a Normal distribution with a mean equivalent to the “true” average dose rate and a relative standard deviation (RSD) equal to 7%. The generated random dose rates were used as the measured dose rates when determining burial ages using MLE and MCMC. The systematic error representing the error in the calibration of the dose rate measurement device shared by all samples was assumed to be zero (
Lognormal distributions were used to model the natural dispersion in dose distributions arising from dose rate variation with an RSD of 5% (
Fig. 5
Distribution of OSL sensitivity for the 127 grains of a sample taken from Lake Mungo, Australia. The red line denotes the probability density curve obtained by fitting the OSL sensitivity data using a Gamma distribution. The fitted Gamma distribution has a shape parameter of α=0.565 and a rate parameter of β=0.191. Q[x%] denotes the x% sample quantile of OSL sensitivity.

Fig. 6
Simulated De distributions of nine samples from a sedimentary sequence. Each De set contains 100 De values (and associated standard errors). The shaded bars in each plot indicate the burial dose within 2σ error. The first three samples (#1–#3) are partially-bleached, the next three samples (#4–#6) are fully-bleached, and the last three samples (#7–#9) contain fully-bleached grains that have been subsequently mixed with younger, intrusive grains. The nine samples are assumed to be collected from the same section but at different depths, and their estimated burial dose and age are summarized in Fig. 7 and Table 3.

The simulated De sets were analyzed individually using MLE, analyzed simultaneously using MCMC without order constraints, and analyzed simultaneously using MCMC with order constraints. The first, next, and last three samples were analyzed using the MAM3, CAM, and MXAM3, respectively. To facilitate the model in the analysis of the MAM3 and the MXAM3, a
Fig. 7
Posterior distributions of burial ages of the nine simulated samples from a sedimentary section obtained using the MCMC sampling protocol, as shown in Fig. 2. Samples #1–#3, #4–#6, and #7–#9 were analyzed using the MAM3, CAM, and MXAM3, respectively. The blue-coloured density plots are results obtained by constraining burial ages in ascending order. The grey-coloured density plots are results obtained by imposing no constraints on the order of burial ages. The first and second values inside parentheses denote the calculated RSD and RE of the burial age, respectively.

A summary of burial dose and age estimated from various statistical age models for nine simulated De sets taken from the same sedimentary sequence. Results obtained using MLE and results obtained using the MCMC sampling protocol as shown in Fig. 2 without and with order constraints are presented. The systematic error on the dose rate measurement shared by all simulated samples was assumed to be σdc=0.
#1 | MAM3 | 7 | 18 | 18.35 ± 1.17 | 7.95 ± 0.75 | 17.85 ± 1.17 | 7.82 ± 0.76 | 16.58 ± 1.02 | 6.80 ± 0.44 |
#2 | 7.5 | 21 | 21.16 ± 0.63 | 6.95 ± 0.53 | 20.96 ± 0.79 | 6.96 ± 0.57 | 21.17 ± 0.65 | 7.23 ± 0.40 | |
#3 | 8 | 24 | 24.47 ± 1.22 | 8.41 ± 0.72 | 23.64 ± 1.57 | 8.22 ± 0.81 | 23.25 ± 1.43 | 7.84 ± 0.42 | |
#4 | CAM | 8.5 | 27 | 26.91 ± 0.33 | 8.24 ± 0.59 | 26.92 ± 0.34 | 8.32 ± 0.61 | 26.93 ± 0.33 | 8.31 ± 0.39 |
#5 | 9 | 30 | 30.40 ± 0.38 | 8.50 ± 0.60 | 30.41 ± 0.39 | 8.59 ± 0.63 | 30.44 ± 0.38 | 8.83 ± 0.42 | |
#6 | 9.5 | 33 | 32.79 ± 0.46 | 10.59 ± 0.76 | 32.81 ± 0.47 | 10.71 ± 0.79 | 32.66 ± 0.45 | 9.62 ± 0.42 | |
#7 | MXAM3 | 10 | 36 | 35.77 ± 1.61 | 9.13 ± 0.76 | 36.43 ± 1.86 | 9.39 ± 0.84 | 37.18 ± 1.81 | 9.95 ± 0.45 |
#8 | 10.5 | 39 | 39.26 ± 1.37 | 9.32 ± 0.73 | 39.70 ± 1.69 | 9.53 ± 0.79 | 40.62 ± 1.87 | 10.39 ± 0.54 | |
#9 | 11 | 42 | 43.65 ± 2.55 | 10.63 ± 0.97 | 44.61 ± 2.52 | 10.96 ± 1.00 | 45.43 ± 2.53 | 11.48 ± 0.85 |
Although a properly constructed MCMC protocol enables one to draw samples successively from a convergent Markov chain, deciding the point to terminate sampling is sometimes difficult as well as the point to confidently conclude when the algorithm has converged to the desired stationary distribution. One simple and direct way to inspect the convergence of a chain is to check the degree of mixing of the chain by using a trace plot (
Fig. 8
Convergence diagnostics for posterior samples of the CAM burial age from sample GL2-1. The plots were automatically generated using function

The Gelman-Rubin convergence diagnostic (Gelman and Rubin, 1992) is one of the most widely used methods for monitoring the convergence of MCMC outputs. Several parallel chains are simulated using various initial states when applying this method, and a shrink factor is used as a measurement of the difference between within-chain and between-chain variance (which is similar to the analysis of variance). A shrink factor value equal to or less than 1 indicates adequate convergence, while a factor value significantly above 1 indicates a lack of convergence. Results from Gelman-Rubin convergence diagnostics for posterior samples of CAM burial ages from sample GL2-1 is shown in
A summary of Gelman-Rubin convergence diagnostics for measured and simulated samples obtained using the MCMC sampling protocol, as shown in Fig. 2 with order constraints. n_eff is the effective sample size, while Rhat (i.e., the shrink factor) is a statistic measure of the ratio of the average variance of samples within each chain to the variance of the pooled samples across chains. If all chains are at equilibrium, the Rhat will be 1. If these chains have not converged to a common distribution, the Rhat statistic will be greater than 1.
GL2-1 | 16000 | 0.9998 | 5541.74 | 1.0002 | #1 | 13865.37 | 1.0001 | 9987.93 | 1.0000 |
GL2-2 | 16000 | 0.9999 | 8220.18 | 1.0003 | #2 | 16000 | 1.0001 | 11473.55 | 1.0000 |
GL2-3 | 16000 | 0.9997 | 7595.60 | 1.0002 | #3 | 9405.95 | 1.0004 | 12466.84 | 1.0003 |
GL2-4 | 16000 | 0.9998 | 8864.01 | 1.0003 | #4 | 16000 | 1.0003 | 13161.07 | 0.9999 |
#5 | 16000 | 0.9999 | 12830.44 | 0.9999 | |||||
#6 | 16000 | 0.9999 | 14048.98 | 0.9998 | |||||
#7 | 16000 | 0.9999 | 13073.69 | 0.9998 | |||||
#8 | 7780.82 | 1.0003 | 12665.56 | 1.0001 | |||||
#9 | 16000 | 0.9999 | 16000 | 0.9999 |
This study adopted an MCMC method to obtain sampling distributions on parameters of interest in statistical age models, including the MAM3, CAM, and MXAM3, using De distributions from individual samples and multiple samples with order constraints. It demonstrated that the estimates obtained by MLE and MCMC from measured samples were consistent within errors for various age models (
For all MCMC simulations, initial states did not need to be specified and were generated automatically using random number generators, suggesting that the MCMC sampling method employed in this study was not sensitive to the choice of the initial state, and that the algorithm itself was free from the influence of local modes and was able to converge to the desired distribution with a finite number of iterations (
When applying the MCMC sampling method to multiple samples with stratigraphic constraints using statistical age models, samples are not analyzed on a standalone basis; rather, they are combined simultaneously in a single model implementation so that between-sample age-depth relationships can also be included. This shows that the burial age of measured samples was internally consistent with the stratigraphy, and the posterior standard deviations of burial ages significantly decreased (
The sequence of De sets simulated in Section 4 (Simulated De sets) was determined to be free of systematic error arising from dose rate measurements. However, the systematic error term that induces dependences between observations and the associated age-depth model should always be taken into account when analyzing measured samples (e.g., Rhodes
This study employed an MCMC sampling method for the statistical analysis of De datasets, using statistical age models, including the CAM, MAM3, and MXAM3. The method was tested using measured, and simulated De sets, and consistent results in agreement with expected values were obtained. The development of numerical programs provides a statistical toolbox for the calculation of OSL ages using MCMC sampling for both individual samples and multiple samples with (or without) order constraints by taking into account systematic and individual errors. The MCMC method employed in this study can be used in addition to other Bayesian models in tackling luminescence-related chronological data and can be flexibly adapted to compute OSL ages of both well- and poorly-bleached sedimentary samples.
Fig. 1

Fig. 2

Fig. 3

Fig. 4

Fig. 5
![Distribution of OSL sensitivity for the 127 grains of a sample taken from Lake Mungo, Australia. The red line denotes the probability density curve obtained by fitting the OSL sensitivity data using a Gamma distribution. The fitted Gamma distribution has a shape parameter of α=0.565 and a rate parameter of β=0.191. Q[x%] denotes the x% sample quantile of OSL sensitivity.](https://sciendo-parsed-data-feed.s3.eu-central-1.amazonaws.com/606af92a3999800867736d79/j_geochr-2015-0114_fig_005.jpg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Date=20220815T151539Z&X-Amz-SignedHeaders=host&X-Amz-Expires=18000&X-Amz-Credential=AKIA6AP2G7AKP25APDM2%2F20220815%2Feu-central-1%2Fs3%2Faws4_request&X-Amz-Signature=8f9d7d134100e8f6880a056a08b6e506d8933e6685837e26f765edf6a9504b90)
Fig. 6

Fig. 7

Fig. 8

A summary of burial dose and age estimated from various statistical age models for nine simulated De sets taken from the same sedimentary sequence. Results obtained using MLE and results obtained using the MCMC sampling protocol as shown in Fig. 2 without and with order constraints are presented. The systematic error on the dose rate measurement shared by all simulated samples was assumed to be σdc=0.
#1 | MAM3 | 7 | 18 | 18.35 ± 1.17 | 7.95 ± 0.75 | 17.85 ± 1.17 | 7.82 ± 0.76 | 16.58 ± 1.02 | 6.80 ± 0.44 |
#2 | 7.5 | 21 | 21.16 ± 0.63 | 6.95 ± 0.53 | 20.96 ± 0.79 | 6.96 ± 0.57 | 21.17 ± 0.65 | 7.23 ± 0.40 | |
#3 | 8 | 24 | 24.47 ± 1.22 | 8.41 ± 0.72 | 23.64 ± 1.57 | 8.22 ± 0.81 | 23.25 ± 1.43 | 7.84 ± 0.42 | |
#4 | CAM | 8.5 | 27 | 26.91 ± 0.33 | 8.24 ± 0.59 | 26.92 ± 0.34 | 8.32 ± 0.61 | 26.93 ± 0.33 | 8.31 ± 0.39 |
#5 | 9 | 30 | 30.40 ± 0.38 | 8.50 ± 0.60 | 30.41 ± 0.39 | 8.59 ± 0.63 | 30.44 ± 0.38 | 8.83 ± 0.42 | |
#6 | 9.5 | 33 | 32.79 ± 0.46 | 10.59 ± 0.76 | 32.81 ± 0.47 | 10.71 ± 0.79 | 32.66 ± 0.45 | 9.62 ± 0.42 | |
#7 | MXAM3 | 10 | 36 | 35.77 ± 1.61 | 9.13 ± 0.76 | 36.43 ± 1.86 | 9.39 ± 0.84 | 37.18 ± 1.81 | 9.95 ± 0.45 |
#8 | 10.5 | 39 | 39.26 ± 1.37 | 9.32 ± 0.73 | 39.70 ± 1.69 | 9.53 ± 0.79 | 40.62 ± 1.87 | 10.39 ± 0.54 | |
#9 | 11 | 42 | 43.65 ± 2.55 | 10.63 ± 0.97 | 44.61 ± 2.52 | 10.96 ± 1.00 | 45.43 ± 2.53 | 11.48 ± 0.85 |
A summary of Gelman-Rubin convergence diagnostics for measured and simulated samples obtained using the MCMC sampling protocol, as shown in Fig. 2 with order constraints. n.eff is the effective sample size, while Rhat (i.e., the shrink factor) is a statistic measure of the ratio of the average variance of samples within each chain to the variance of the pooled samples across chains. If all chains are at equilibrium, the Rhat will be 1. If these chains have not converged to a common distribution, the Rhat statistic will be greater than 1.
GL2-1 | 16000 | 0.9998 | 5541.74 | 1.0002 | #1 | 13865.37 | 1.0001 | 9987.93 | 1.0000 |
GL2-2 | 16000 | 0.9999 | 8220.18 | 1.0003 | #2 | 16000 | 1.0001 | 11473.55 | 1.0000 |
GL2-3 | 16000 | 0.9997 | 7595.60 | 1.0002 | #3 | 9405.95 | 1.0004 | 12466.84 | 1.0003 |
GL2-4 | 16000 | 0.9998 | 8864.01 | 1.0003 | #4 | 16000 | 1.0003 | 13161.07 | 0.9999 |
#5 | 16000 | 0.9999 | 12830.44 | 0.9999 | |||||
#6 | 16000 | 0.9999 | 14048.98 | 0.9998 | |||||
#7 | 16000 | 0.9999 | 13073.69 | 0.9998 | |||||
#8 | 7780.82 | 1.0003 | 12665.56 | 1.0001 | |||||
#9 | 16000 | 0.9999 | 16000 | 0.9999 |
A summary of burial dose and age estimated from the CAM for De sets from four measured aeolian samples using the MCMC sampling protocol shown in Fig. 2 without and with order constraints. The systematic error for the dose rate measurement shared by all samples was set to σdc=0.1.
GL2-1 | 3.26 ± 0.25 | 34.55 ± 0.91 | 10.75 ± 0.96 | 34.23 ± 0.86 | 9.72 ± 0.52 |
GL2-2 | 2.84 ± 0.21 | 32.71 ± 1.00 | 11.68 ± 1.06 | 32.19 ± 0.92 | 10.28 ± 0.49 |
GL2-3 | 3.23 ± 0.23 | 30.58 ± 1.04 | 9.59 ± 0.85 | 31.03 ± 1.00 | 10.59 ± 0.52 |
GL2-4 | 3.40 ± 0.24 | 32.24 ± 1.29 | 9.60 ± 0.85 | 33.13 ± 1.26 | 11.13 ± 0.67 |
Comparisons of burial dose and age estimated from MLE and MCMC for the various statistical age models using De sets from four measured aeolian samples. Quantities estimated using the MCMC sampling protocol shown in Fig. 1 are marked in bold. Quantities estimated using the MLE are inside parentheses.
GL2-1 | 3.26 ± 0.25 | ||||||
GL2-2 | 2.84 ± 0.21 | ||||||
GL2-3 | 3.23 ± 0.23 | ||||||
GL2-4 | 3.40 ± 0.24 |