A New Method for Efficient Symbolic Propagation in Discrete Bayesian Networks

E. Castillo,J.M. Gutiérrez, and A.S. Hadi
Networks. Vol. 28, 31-43.

ABSTRACT.

The paper presents a new efficient method for uncertainty propagation in discrete Bayesian networks in symbolic, as opposed to numeric, form, when considering some of the probabilities of the Bayesian network as parameters. The algebraic structure of the conditional probabilities of any set of nodes, given some evidence, is characterized as ratios of linear polynomials in the parameters. We use this result to carry out these symbolic expressions efficiently by calculating the coefficients of the polynomials involved, using standard numerical algorithms. The numeric canonical components method is proposed as an alternative to symbolic computations, gaining in speed and simplicity. It is also shown how to avoid redundancy when calculating the numeric canonical components probabilities using standard message-passing methods. The canonical components can also be used to obtain lower and upper bounds for the symbolic expression associated with the probabilities. Finally, we analyze the problem of symbolic evidence, which allows answering multiple queries regarding a given set of evidential nodes. In this case, the algebraic structure of the symbolic expressions obtained for the probabilities are shown to be ratios of non-linear polynomial expressions. Then we can perform symbolic inference with only a small set of symbolic evidential nodes. The methodology is illustrated by examples.

Bayesian networks are powerful tools both for graphically representing the relationships among a set of variables and for dealing with uncertainties in expert systems. A key problem in Bayesian networks is evidence propagation, that is, obtaining the posterior distributions of variables when some evidence is observed. Several efficient methods for propagation of evidence in Bayesian networks have been proposed in recent years. Exact methods exploit the independence structure contained in the network to efficiently propagate uncertainty (see, for example, Kim and Pearl (1983), Lauritzen and Spiegelhalter (1988), Jensen, Olesen, and Andersen (1990), Pearl (1988), and Shachter, Andersen, and Szolovits (1994)). Stochastic simulation constitute an interesting alternative in highly connected networks, where exact algorithms may become inefficient (Pearl (1986), Henrion (1988), Shachter and Peot (1990a), Fung and Chang (1990), Bouckaert, Castillo, and Gutiérrez (1996)). Recently, search-based approximation algorithms, which search for high probability configurations through a space of possible values, have emerged as an alternative to the above methods in special cases as, for example, in Bayesian networks with extreme probabilities (Poole (1993), Santos and Shimony (1994), Li and D'Ambrosio (1995)).

However, all exact and approximate methods require that the joint probabilities of the nodes be specified numerically, that is, all the parameters must be assigned numeric values. In practice, exact numeric specification of these parameters may not be available or it may happens that the subject matter specialists can specify only ranges of values for the parameters rather than their exact values. In such cases, there is a need for symbolic methods which are able to deal with the parameters themselves, without assigning them numeric values. Symbolic propagation leads to solutions which are expressed as functions of the parameters in symbolic form. Thus, the answers to general queries can be given symbolically in terms of the parameters and the answers to specific queries can then be obtained by plugging the values of the parameters in the solution which is given in symbolic form, without need to redo the propagation. Furthermore, symbolic propagation allows one to study the sensitivity of the results to changes in parameter values with little additional computational effort.

Recently, two main approaches have been proposed for symbolic inference in Bayesian networks. The symbolic probabilistic inference algorithm (SPI) (Shachter, D'Ambrosio, and DelFabero (1990b), Li and D'Ambrosio (1994)) is a goal directed method which performs only those calculations that are required to respond to queries. Symbolic expressions can be obtained by postponing evaluation of expressions, maintaining them in symbolic form. On the other hand, Castillo, Gutiérrez and Hadi (1995a, 1995b, 1996) perform symbolic calculations using slightly modified versions of standard numerical propagation algorithms by first replacing the values of the initial probabilities by symbolic parameters, then using computer packages with symbolic computational capabilities (such as, Mathematica and Maple) to propagate uncertainty. As opposed to SPI algorithm, this method is not goal oriented, but allows us to obtain symbolic expressions for all the nodes in the network.

However, both methods suffer from the same problem: they need to use special programs, or extra computational efforts implementing the necessary code, to carry out the symbolic computations. Furthermore, computing and simplifying symbolic expressions is a computationally expensive task, and it becomes increasingly inefficient when dealing with large networks, or large numbers of symbolic parameters. In this paper we present an efficient approach to symbolic propagation that takes advantage of the polynomial structure of the probabilities of the nodes to avoid symbolic computations. The main idea of the method is obtaining the symbolic expressions through a numerical algorithm to compute the coefficients of the associated polynomials. Then, all the computations are carried out numerically, avoiding the use of the computationally expensive symbolic manipulations. The main findings of this paper are the following:

The algebraic structure of initial or updated conditional probabilities of events, given the evidence, when considering as parameters several probabilities in the Bayesian network has been characterized as ratios of linear polynomials in each one of the parameters.

Taking advantage of this structure of the probabilities, a new method which obtains symbolic expression of the parameters by performing only numerical computations is introduced. Each symbolic query can be answered by performing numeric computations (numeric canonical components) using any of the mentioned above standard methods. Furthermore we show that, when using message-passing algorithms to obtain the numerical values associated with the canonical components, some of the messages are common to several components. Consequently, important savings in computation time can be obtained by avoiding redundant calculations.

The upper and lower bounds for the symbolic expressions for the probabilities are attained at canonical cases. Thus, calculating these values does no require extra computation effort, because it reduces to the simple operation of finding the maximum and minimum values and can be done during the process of calculating these expressions. These bounds provide a useful information about the sensitivity of certain parameters in the probability of the nodes in the network.

When introducing symbolic evidence in the network, the symbolic expressions for the conditional probabilities are now ratios of non-linear polynomials on the symbolic probabilities and symbolic evidential parameters. Then, symbolic propagation of symbolic evidence can only be efficiently performed in cases with a small number of symbolic evidential nodes.

For example, consider the following Bayesian network defined using some symbolic parameters:

The symbolic marginal probabilities of the nodes obtained by using the above symbolic method are

References:

Bouckaert, R. R., Castillo, E. and Gutiérrez, J. M. (1996) , "A Modified Simulation Scheme for Inference in Bayesian Networks," International Journal of Approximate Reasoning,

Castillo, E., Gutiérrez, J. M., and Hadi, A. S. (1995a), "Parametric Structure of Probabilities in Bayesian Networks," Lectures Notes in Artificial Intelligence, Springer-Verlag, 946, 89-98.

Castillo, E., Gutiérrez, J. M., and Hadi, A. S. (1995b), "Symbolic Propagation in Discrete and Continuous Bayesian Networks," in Mathematics with Vision: Proceedings of the First International Mathematica Symposium, (V. Keranen and P. Mitic, eds.), Computational Mechanics Publications, 77-84.

Castillo, E., Gutiérrez, J. M., and Hadi, A. S. (1996), Expert Systems and Probabilistic Network Models, Springer-Verlag, New York.

Fung, R. and Chang, K-C. (1990), "Weighing and Integrating Evidence for Stochastic Simulation in Bayesian Networks," in Uncertainty in Artificial Intelligence 5, Machine Intelligence and Pattern Recognition Series, 10, (Henrion et al. Eds.), North Holland, Amsterdam, 209-219.

Henrion, M. (1988), "Propagating Uncertainty in Bayesian Networks by Probabilistic Logic Sampling," in Uncertainty in Artificial Intelligence 2, (J.F. Lemmer and L. N. Kanal, Eds.), North Holland, Amsterdam, 317-324.

Jensen, F. V., Olesen, K. G., and Andersen, S. K. (1990), "An Algebra of Bayesian Belief Universes for Knowledge-Based Systems," Networks, 20, 637-659.

Kim, J. H. and Pearl, J. (1983), "A Computation Model for Causal and Diagnostic Reasoning in Inference Systems," in Proceedings of the 8th International Joint Conference on AI, Los Angeles, 190-193.

Lauritzen, S. L. and Spiegelhalter, D. J. (1988), "Local Computations with Probabilities on Graphical Structures and Their Application to Expert Systems," Journal of the Royal Statistical Society (B), 50, 157-224.

Li, Z., and D'Ambrosio, B. (1994), "Efficient Inference in Bayes Nets as a Combinatorial Optimization Problem," International Journal of Approximate Reasoning, 11, 1, 55-81.

Li, Z., and D'Ambrosio, B. (1995), "A Framework for Ordering Composite Beliefs in Belief Networks," IEEE Transactions on Systems, Man, and Cybernetics, 25, 2, 243-255.

Martos, B. (1964), ``Hyperbolic Programming," Naval Research Logistic Quarterly, 32, 135-156.

Pearl, J. (1986), "Evidential Reasoning Using Stochastic Simulation of Causal Models," Artificial Intelligence, 32, 245-287.

Pearl, J. (1988), Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, Morgan Kaufmann, San Mateo, CA.

Pool, D. (1993), "Average-case Analysis of a Search Algorithm for Estimating Prior and Posterior Probabilities in Bayesian Networks with Extreme Probabilities",In Proceedings of the 13th International Joint Conference on Artificial Intelligence, 13, 1, 606-612.

Santos, E., and Shimony S. E. (1994), "Belief Updating by Enumerating High-Probability Independence-Based Assignments," in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence, 506-513. Morgan Kaufmann Publishers, San Francisco.

Shachter, R. D. and Peot, M. A. (1990a), "Simulation Approaches to General Probabilistic Inference on Belief Networks," in Uncertainty in Artificial Intelligence 5, Machine Intelligence and Pattern Recognition Series, 10 (Henrion et al. Eds.), North Holland, Amsterdam, 221-231.

Shachter, R. D., D'Ambrosio, B., and DelFabero, B. (1990b), "Symbolic Probabilistic Inference in Belief Networks," in Proceedings Eighth National Conference on AI, 126-131.

Shachter, R. D., Andersen, S. K. and Szolovits, P. (1994), "Global Conditioning for Probabilistic Inference in Belief Networks," in Proceedings of the Tenth Conference on Uncertainty in Artificial Intelligence, 514-522. Morgan Kaufmann Publishers, San Francisco.