 Original Article
 Open Access
 Published:
Extension of emission expectation maximization lookalike algorithms to Bayesian algorithms
Visual Computing for Industry, Biomedicine, and Art volume 2, Article number: 14 (2019)
Abstract
We recently developed a family of image reconstruction algorithms that look like the emission maximumlikelihood expectationmaximization (MLEM) algorithm. In this study, we extend these algorithms to Bayesian algorithms. The family of emissionEMlookalike algorithms utilizes a multiplicative update scheme. The extension of these algorithms to Bayesian algorithms is achieved by introducing a new simple factor, which contains the Bayesian information. One of the extended algorithms can be applied to emission tomography and another to transmission tomography. Computer simulations are performed and compared with the corresponding unextended algorithms. The totalvariation norm is employed as the Bayesian constraint in the computer simulations. The newly developed algorithms demonstrate a stable performance. A simple Bayesian algorithm can be derived for any noise variance function. The proposed algorithms have properties such as multiplicative updating, nonnegativity, faster convergence rates for bright objects, and ease of implementation. Our algorithms are inspired by Green’s onesteplate algorithm. If written in additiveupdate form, Green’s algorithm has a step size determined by the future image value, which is an undesirable feature that our algorithms do not have.
Introduction
This work is inspired by Green’s onesteplate (OSL) expectationmaximization (EM) algorithm [1, 2]. Green’s algorithm became popular because it is userfriendly and easy to implement. It has a wide range of applications, such as in positron emission tomography (PET) and single photon emission computed tomography (SPECT) [3,4,5,6,7]. Green’s algorithm also has applications in other fields, such as the minimization of the penalized Idivergence [8]. Furthermore, Green’s algorithm may diverge [9]. This study improves Green’s algorithm, making it more stable and more applicable for various noise models.
Green’s algorithm is a maximum a posterior (MAP) algorithm, using imagedomain constraints for emission tomography. Other MAP algorithms exist [10,11,12,13,14,15]. In ref. [12], a method of projection onto convex sets (POCS) was proposed to enforce data fidelity, totalvariation (TV) minimization, and image nonnegativity. In addition, a GPU algorithm was proposed in ref. [13] to combat the long computation time in combined EM and TV minimization. Filtered backprojection (FBP) reconstruction was proposed for use as the initial image for penalized weighted leastsquares (PWLSTV) reconstruction [12]. Furthermore, in ref. [13] monotonic algorithms for transmission tomography penalized likelihood image reconstruction were developed based on paraboloidal surrogate functions. A similar idea using surrogate functions was reported in refs.[16, 17].
Most recently, we developed a family of emissionEMlookalike algorithms [10]. These were iterative algorithms in the form of multiplicative image updating, which intrinsically enforced image nonnegativity. The unique feature of this family was that the scaling factor was formed by the forward projection of the reconstructed image at the previous iteration, which is a unique feature in the “Estep” in an EM algorithm. Each member of the family had its own noise model. This work will extend this family of emissionEMlookalike algorithms to Bayesian algorithms, by introducing a new factor. The three main features of the proposed algorithms comprise multiplicative updating with a nonnegativity constraint, weighting by a projection noise model, and the incorporation of Bayesian constraints.
Many MAP algorithms in image reconstruction, especially in transmission tomography, employ the POCS methodology, which is an alternating optimization method. This breaks the objective function into different parts and optimizes each part separately. Our proposed method optimizes the objection function with all constraints considered simultaneously.
Methods
Modification of iterative Green’s OSL algorithm
We first provide a brief review of Green’s algorithm, before extending it. The iterative Green’s OSL algorithm can be expressed as [1, 2].
where \( {x}_{i,j}^{(n)} \) is the reconstructed image pixel (i, j) at the nth iteration, p_{k} is the kth raysum measurement, a_{(i,j)k} is the contribution of the pixel x_{i,j} to the measurement p_{k}, β is a control parameter, and \( {U}_{i,j}^{(n)} \) is the derivative of a penalty function V with respect to the image pixel \( {x}_{i,j}^{(n)} \) at the nth iteration, i.e.,
Using the approximation 1/(1+x)≈1x when │x│<<1, the first factor \( {x}_{i,j}^{(n)}/\left[\sum \limits_k{a}_{\left(i,j\right)k}+\beta {U}_{i,j}^{(n)}\right] \) in algorithm (1) can be approximated as \( \frac{x_{i,j}^{(n)}}{\sum \limits_k{a}_{\left(i,j\right)k}}\left(1\frac{\beta }{\sum \limits_k{a}_{\left(i,j\right)k}}{U}_{i,j}^{(n)}\right) \). Here, \( \sum \limits_k{a}_{\left(i,j\right)k} \), is in general not a constant. If \( \beta /\sum \limits_k{a}_{\left(i,j\right)k} \) is not a constant, then the constraint U is not uniformly enforced throughout the image. To improve the algorithm, we simply discard \( \sum \limits_k{a}_{\left(i,j\right)k} \) in \( \beta /\sum \limits_k{a}_{\left(i,j\right)k} \). Thus, our proposed modification of Green’s algorithm is
We will gain further insight into this modification by rewriting both the original Green’s algorithm (1) and the modified algorithm (3) in the additiveupdate form (that is, in the form of gradient descent). The additive form can be expressed as
where
is the noiseweighting factor for the Poisson noise model and
is the step size for projection data fidelity minimization. In algorithm (4), λ_{1} is the step size for Bayesian constraint minimization. For the original Green’s algorithm (1),
whereas for the revised algorithm (3),
The most significant difference between algorithms (7) and (8) is that the factor \( {\lambda}_1^{(original)} \) in (7) depends on the future image \( {x}_{i,j}^{\left(n+1\right)} \), while the factor \( {\lambda}_1^{(revised)} \) in (8) depends only on the current image \( {x}_{i,j}^{(n)} \).
It is required that the image x_{i,j} is nonnegative. It can be observed from algorithm (3) that if \( \beta {U}_{i,j}^{(n)}>1 \), then the sign of x_{i,j} will alternate. Therefore, a necessary condition for the image to be nonnegative is \( \beta {U}_{i,j}^{(n)}<1 \). This intrinsic nonnegativity constraint is guaranteed by the requirement that \( \beta {U}_{i,j}^{(n)}<1 \) if the initial image is positive. This can be readily observed by noticing that every factor in algorithm (3) is nonnegative.
One way to prevent this from occurring is to introduce a sigmoid function φ, and to replace \( \beta {U}_{i,j}^{(n)} \) by \( \phi \left(\beta {U}_{i,j}^{(n)}\right) \). There are many ways to define a sigmoid function φ. For example, one may choose \( \phi (x)=x/\sqrt{1+{x}^2} \).
In deriving the Green’s algorithm using prior information [1], it is necessary to evaluate the derivative of the energy function V, which carries the prior information. This energy function is defined for the updated image, which is not yet available. In Green’s algorithm, an approximation is performed to evaluate this derivative of the energy function, using the current image to replace the future image. This approximation is termed “onesteplate”.
The derivation of the EMlookalike algorithms in ref. [10] was based on the noise variance model, unlike the conventional approach based on a random variable distribution function. Our derivation only considered two items: (1) the noise variance in the projections and (2) the nonnegativity constraint for the image.
The derivation of the MAP in this study can been considered as an approximation of Green’s MAP algorithm using 1/(1+x)≈1x when │x│<<1. The proposed algorithms are in the form of “(1βU) × (EMlookalike).” When β = 0, this form is exactly the EMlookalike form. The factor (1βU) is new in this work, to minimize a Bayesian function V whose gradient is the function U. By driving U to zero, the Bayesian function V is minimized. The additive form algorithm (4) reveals that the proposed algorithms minimize the objective function
where the functions U and V are related as
For a different noise model, we can simply change the noise weighting w_{k} as in ref. [10].
This study builds on ref. [10], by considering a new energy function V and forcing its gradient U to zero. This point can be intuitively appreciated from the additive form algorithm (4).
From algorithm (6), we observe that the MLEM algorithm’s step size λ_{2} is scaled by the image pixel value \( {x}_{i,j}^{(n)} \) at the nth iteration. As a result, brighter objects converge faster than darker objects.
From algorithm (5), we observe that the weighting factor w_{k} is the reciprocal of the estimated mean value of the kth raysum at the nth iteration. Note that w_{k} will change with different noise models.
From algorithm (7), we observe that λ_{1} depends on the image value of the next iteration. This feature is undesirable, because it may cause the algorithm diverge. This undesirable feature has been removed from the revised algorithm, as shown in algorithm (8), where λ_{1} depends only on the current image value.
The parameter λ_{2} is intrinsically determined by the conventional MLEM algorithm. The parameter λ_{1} is affected by the parameter β. For any penalty function V, the parameter β is chosen by trialanderror. When in doubt, a smaller positive β value should be chosen.
If the true solution with \( {\sum}_{\hat{i},\hat{j}}{a}_{\left(\hat{i},\hat{j}\right)k}{x}_{\hat{i},\hat{j}}={p}_k \) and U_{i,j} = 0 exits, then it is straightforward to verify that the true solution is a fixed point of the proposed algorithm (3). In fact, letting \( {\sum}_{\hat{i},\hat{j}}{a}_{\left(\hat{i},\hat{j}\right)k}{x}_{\hat{i},\hat{j}}^{(n)}={p}_k \) and \( {U}_{i,j}^{(n)}=0 \), the righthand side of (3) becomes \( {x}_{i,j}^{(n)} \).
Modified algorithm for no weighting
We now consider a hypothetical imaging system, where the noise in the measurements is identically distributed with the same variance. In this case, noise weighting should not be utilized in the image reconstruction algorithm. The MLEM lookalike algorithm for this hypothetical case is given as [10].
Using our strategy of introducing a simple new factor \( \left(1\beta {U}_{i,j}^{(n)}\right) \), the Bayesian algorithm associated with algorithm (11) is proposed as
Modified algorithm for the transmission noise model
The variance of the transmission tomography sinogram is proportional to the exponential function of the sinogram’s mean value [11]:
An MLEM lookalike algorithm for the transmission data is derived in ref. [10] as
It is straightforward to modify algorithm (14) to a Bayesian algorithm, by introducing a new factor \( \left(1\beta {U}_{i,j}^{(n)}\right) \) as follows:
In general, a Bayesian algorithm can be readily obtained from a multiplicativeupdate image reconstruction algorithm by introducing a new factor \( \left(1\beta {U}_{i,j}^{(n)}\right) \). The resulting Bayesian algorithm remains multiplicative.
The TV penalty function
Any penalty function V can be employed in the proposed algorithm (3). Some constraints encourage smoothing, such as the maximum entropy constraint [18], because their main goal is denoising. Maximum entropy algorithms tend to oversmooth images, and as a result sharp edges are not maintained. Thus, maximum entropy algorithms are not popular for CT image reconstruction. On the other hand, TVtype constraints can reduce noise and maintain sharp edges when the parameters are suitably chosen. Here, we select the TV norm for a feasibility evaluation:
where x_{i,j} is a pixel value in a twodimensional (2D) image. The associated derivative (2) is given as
Here, the small value of ε is introduced to prevent the denominator being zero. In this study, ε = 0.0001 is adopted.
Computer simulations
Two sets of computer simulations were conducted, using emission and transmission noise models, respectively. The simulation setup for the emission data is as follows.
There were 180 projection views over 360°. The images were reconstructed in an array of size 128 × 128 (pixels). A parallelhole collimation was assumed for the data generation. The detector had 128 detection bins, and the bin size was the same as the image pixel size.
A 2D circular phantom with a diameter of 120.32 pixels was employed in the simulations. The phantom, based on SPECT imaging, contained two small cold disks and two small hot disks, all with a diameter of 25.6 pixels, as shown in Fig. 1. The image intensity of the large circular disk was defined as 1 unit. The cold disks had an intensity value of 0.5, and the hot disks had an intensity value of 1.5. The projections were generated analytically, without using discrete pixels, and noisy projections were generated using the Poisson noise model. The total number of counts was approximately 2 × 10^{6}.
The computer simulation setup for the transmission data was as follows. A parallelbeam imaging geometry was assumed. The image array was of size 512 × 512, the number of views was 400 over 180°, and the number of detection channels was 512. The transmission phantom looked similar to the emission phantom (Fig. 1), except four times larger. The pixel length was 0.5 mm. Furthermore, the attenuation coefficient was 0.0193 mm^{− 1} for the large disc, 0.0269 mm^{− 1} for the small circular bright regions, and 0.0083 mm^{− 1} for the small circular dark regions.
The transmission CT noise model was adopted for the sinogram data with very low counts, where the sinogram variance was proportional to the exponential function of the sinogram value. Two xray influxes were considered: I_{0} = 100 and I_{0} = 10,000.
Three regions were selected in the image for TVnorm noise evaluation. Note that the TV norm can measure the image fluctuation. These regions are depicted in Fig. 1. The average of the TV norms in these regions was employed as a figureofmerit for noise evaluation. Furthermore, a line profile was provided for each reconstructed image. The location of the line profile is indicated in Fig. 1. As an additional figureofmerit, the meansquarederror (MSE) was also calculated between the reconstruction and true profiles, and this is reported in the figures.
Results
Emission data simulation results
Three algorithms were used to reconstruct the images: the conventional MLEM algorithm (by setting β = 0 in either (1) or (9)), Green’s OSL algorithm (1), and the proposed algorithm (9). The results are depicted in Figs. 2, 3, and 4, respectively, for the three algorithms. The proposed algorithm and Green’s OSL algorithm yield similar performances.
The parameter β in the revised algorithm (3) is approximately equal to β in the original Green’s algorithm (1) divided by the backprojection value of the constant 1. Roughly speaking, the β value in the original Green’s algorithm is the β value in the revised algorithm times the number of view angles. In our example, β = 1.2 for the original Green’s algorithm and β = 0.01 for the revised algorithm, and the number of view angles is 180. Thus, the regularization in Fig. 4 is a little stronger than that in Fig. 3.
Transmission data simulation results
Two algorithms were used to reconstruct the images: the EMlookalike transmission algorithm (14) and proposed algorithm (15). For each algorithm, images were reconstructed with two noise levels. The results are presented in Figs. 5, 6, 7, and 8, for the two algorithms and two noise levels.
Finally, for comparison purposes we implemented the POCS algorithm proposed in ref. [12] and used it to reconstruct the transmission images. The results are presented in Figs. 9 and 10, for the lower and higher noise cases, respectively. We observe that our proposed simultaneous optimization algorithm performs better than the POCS algorithm proposed in ref. [12] in this task, in terms of the TV norm and MSE results.
It can be observed that the central region of the phantom appears darker in the Fig. 7. We hypothesize that noise may affect the convergence rate in an iterative algorithm. If a system of linear equations is more consistent, then the convergence rate may be faster. If the data is noisier and the system is less consistent, then the convergence rate may be slower.
We point out that when large 512 × 512 images are displayed as small binneddown images, as in Figs. 5–10, image details are lost. At iteration 10,000, all algorithms are considered converged. We zoom in on the upperright images in Figs. 5, 6, and 9 in Fig. 11. Here, one can better observe the differences between them. It is observed that the proposed Bayesian algorithms are effective in noise regularization, and stable as the iteration number increases.
The iterative POCS algorithm in our patient study provides better (yet noisier) spatial resolution than the proposed algorithm. The spatial resolution of an image reconstructed by the proposed iterative algorithms depends on the iteration number as well as the Bayesian penalty function. Usually, a larger iteration number gives a better spatial resolution, but a noisier reconstruction. The tradeoff between the spatial resolution and image noise is a main decision factor in selecting the iteration number. Suitable selection of the Bayesian penalty function, i.e., the constraints, plays an important role in the quality of the final reconstruction.
Conclusions
Our proposed algorithms are inspired by Green’s OSL EM algorithm. The main novelty of this study is to propose a general methodology that extends EMlookalike algorithms into MAP algorithms through a new multiplication factor (1βU). We claim that our approach can be extended to any multiplicative updating reconstruction algorithm, where image nonnegativity is built in. Thus, the proposed algorithms also have an intrinsic nonnegativity constraint. The proposed algorithms are simple to implement, and they simultaneously optimize all constraints (instead of using POCS).
We implemented the POCS algorithm presented in ref. [12] for transmission tomography, and we utilized the TV norm and MSE to evaluate the reconstructions. We observed that our proposed simultaneous optimization algorithm outperforms the POCS algorithm proposed in ref. [12] for our experiments.
Availability of data and materials
Not applicable.
Abbreviations
 EM:

Expectation maximization
 FBP:

Filtered backprojection
 MAP:

Maximum a posterior
 ML:

Maximum likelihood
 MSE:

Meansquarederror
 OSL:

Onesteplate
 PET:

Positron emission tomography
 POCS:

Projection onto convex sets
 PWLS:

Penalized weighted leastsquares
 SPECT:

Single photon emission computed tomography
 TV:

Total variation
References
Green PJ (1990) On use of the EM algorithm for penalized likelihood estimation. J Roy Stat Soc: Ser B 52(3):443–452. https://doi.org/10.1111/j.25176161.1990.tb01798.x
Green PJ (1990) Bayesian reconstructions from emission tomography data using a modified EM algorithm. IEEE Trans Med Imag 9(1):84–93. https://doi.org/10.1109/42.52985
Panin VY, Zeng GL, Gullberg GT (1999) Total variation regulated EM algorithm [SPECT reconstruction]. IEEE Trans Nucl Sci 46(6):2202–2210. https://doi.org/10.1109/23.819305
Ellis S, Reader AJ (2017) Longitudinal multidataset PET image reconstruction. In: abstracts of 2017 IEEE nuclear science symposium and medical imaging conference, IEEE, Atlanta, GA, USA, 2128 October 2017. https://doi.org/10.1109/NSSMIC.2017.8532657
Berker Y, Schulz V, Karp JS (2016) Discrete iterative algorithms for scattertoattenuation reconstruction in PET. In: abstracts of 2016 IEEE nuclear science symposium, medical imaging conference and roomtemperature semiconductor detector workshop, IEEE, Strasbourg, France, 29 October6 November 2016. https://doi.org/10.1109/NSSMIC.2016.8069455
Mair BA, Zahnen J (2006) A generalization of Green’s onesteplate algorithm for penalized ML reconstruction of PET images. In: abstracts of 2006 IEEE nuclear science symposium conference record, IEEE, San Diego, CA, USA, 29 October1 November 2006. https://doi.org/10.1109/NSSMIC.2006.356454
Zahnen JA (2006) Penalized maximum likelihood reconstruction methods for emission tomography. Dissertation, University of Florida. http://etd.fcla.edu/UF/UFE0015871/zahnen_j.pdf
Choi K, Lanterman AD (2007) Phase retrieval from noisy data based on minimization of penalized Idivergence. J Opt Soc Am A 24(1):34–49. https://doi.org/10.1364/JOSAA.24.000034
Fessler JA, Hero AO (1995) Penalized maximumlikelihood image reconstruction using spacealternating generalized EM algorithms. IEEE Trans Imag Proc 4(10):1417–1429. https://doi.org/10.1109/83.465106
Zeng GL (2018) Technical note: emission expectationmaximization lookalike algorithms for xray CT and other applications. Med Phys 45(8):3721–3727. https://doi.org/10.1002/mp.13077
Zeng GL, Wang WL (2016) Noise weighting with an exponent for transmission CT. Biomed Phys Eng Exp 2(4):045004. https://doi.org/10.1088/20571976/2/4/045004
Sidky EY, Pan XC (2008) Image reconstruction in circular conebeam computed tomography by constrained, totalvariation minimization. Phys Med Biol 53(17):4777–4807. https://doi.org/10.1088/00319155/53/17/021
Yan M, Chen JW, Vese LA, Villasenor J, Bui A, Cong J (2011) EM+TV based reconstruction for conebeam ct with reduced radiation. In: Bebis G, Boyle R, Parvin B, Koracin D, Wang S, Kyungnam K et al (eds) Advances in visual computing. Springer, Berlin, Heidelberg, pp 1–10. https://doi.org/10.1007/9783642240287_1
Hu ZL, Gao J, Zhang N, Yang YF, Liu X, Zheng HR, Liang D (2017) An improved statistical iterative algorithm for sparseview and limitedangle CT image reconstruction. Sci Rep 7(1):10747. https://doi.org/10.1038/s4159801711222z
De Pierro AR (1995) A modified expectation maximization algorithm for penalized likelihood estimation in emission tomography. IEEE Trans Med Imag 14(1):132–137. https://doi.org/10.1109/42.370409
Erdogan H, Fessler JA (1999) Monotonic algorithms for transmission tomography. IEEE Trans Med Imag 18(9):801–814. https://doi.org/10.1109/42.802758
Erdogan H, Fessler JA (1999) Ordered subsets algorithms for transmission tomography. Phys Med Biol 44(11):2835–2851. https://doi.org/10.1088/00319155/44/11/311
Elfving T (1989) An algorithm for maximum entropy image reconstruction from noisy data. Math Comput Model 12(6):729–745. https://doi.org/10.1016/08957177(89)903580
Acknowledgements
Not applicable.
Funding
This research is partially supported by NIH (No. R15EB024283).
Author information
Affiliations
Contributions
All authors read and approved the final manuscript.
Authors’ information
Gengsheng Zeng is with Department of Engineering, Utah Valley University and Department of Radiology and Imaging Sciences. Ya Li is with Department of Mathematics, Utah Valley University.
Corresponding author
Ethics declarations
Competing interests
None of the authors have any competing interests in the manuscript.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Zeng, G.L., Li, Y. Extension of emission expectation maximization lookalike algorithms to Bayesian algorithms. Vis. Comput. Ind. Biomed. Art 2, 14 (2019). https://doi.org/10.1186/s4249201900274
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s4249201900274
Keywords
 Image reconstruction
 Tomography
 Iterative reconstruction algorithm