Rodney M. Goodman
Dr. John Lindal
What is a rule
based network?
ITRule uses the
rules it has discovered to build a parallel inference network rather
like a neural network. However, unlike a neural network which is
an implicit "black box" predictor, the ITRule
Rule Based Network has an explicit architecture and operation.
The architecture is explicit because links in the network correspond
to rules. The inference is explicit because the weights on the links
correspond to the "weight of evidence" associated with
each rule. That is, our belief in the truth of the rule RHS, given
that the rule LHS has fired.
The explicit nature of the Rule Based Network allows all its decisions
to be audited by humans, and if necessary shown to a third party
or judge to prove that it is operating in the desired manner.The
Rule Based Network is a powerful new extension of a simple first-order
Bayesian classifier. The network is capable of acting as a classifier
or, much more powerfully, outputting probability or "confidence"
estimates for each output decision. This enables a higher level
decision maker (such as a human) to make the final decision. The
advantage here is that situations in which a completely unknown
input is presented can be identified by low confidence on all the
outputs. This alerts the system to the fact that more training is
necessary to derive rules to handle the new situation.
A Medical Example:
Cancer Database
A common technique in breast cancer diagnosis is
to obtain a fine needle aspirate (FNA) from a patient under examination.
The FNA sample is evaluated under a microscope by a physician who
makes a diagnosis. All patients evaluated as malignant, and some
of those labeled as benign, later undergo biopsy which confirms
or disconfirms the original diagnosis - the other patients diagnosed
as benign undergo later re-examination to provide a true measurement
of their condition. Since biopsy is roughly eight times as costly
as the FNA technique, it is important that unnecessary biopsies
be kept to a minimum. Hence, it is important that any automated
decision maker outputs the confidence in its decisions, so that
the physician can make a final decision. Also it is important that
the automated system can explain its decisions in meaningful terms
to the physician - hence an ideal domain for a Rule Based Network.
The data is in the form of nine subjectively evaluated
characteristics of the FNA sample for each of 535 patients. These
features describe general characteristics of the FNA sample as seen
under a microscope, such as uniformity of cell size, marginal adhesion
and mitoses (W.H. Wolberg and O.L. Mangasarian, "Multisurface
method of pattern separation for medical diagnosis applied to breast
cytology," Proceedings of the National Academy of Sciences,
U.S.A., 87, 9193-9196, 1990.) Ground truth in the form of
class labels (benign or malignant) was obtained by biopsy.
The rule based network learned by ITRule
is shown in Figure 1(pdf). After training on only the first
50 examples, the network is able to predict the remaining 485 examples
with 95% accuracy. Furthermore, when ITRule makes a wrong
decision the confidence outputs are always very low - thus showing
that the network really does not have the data to make a good decision
in these cases.
Rule |
Info. |
Strength |
IF cell size uniformity = 1 AND mitoses = 1THEN diagnosis
benign |
0.297 |
5.9 |
IF bare nuclei = 1 AND normal nucleoli = 1 THEN diagnosis
benign |
0.289 |
6.2 |
IF bare nuclei = 1 AND epithelial cell size = 2 THEN diagnosis
benign |
0.271 |
8.0 |
IF bare nuclei = 10THEN diagnosis malignant |
0.231 |
-4.4 |
IF clump thickness = 10 THEN diagnosis malignant |
0.145 |
-5.7 |
IF cell size uniformity = 10 THEN diagnosis malignant |
0.111 |
-5.3 |
IF normal nucleoli = 10THEN diagnosis malignant |
0.103 |
-5.2 |
IF marginal adhesion = 10 THEN diagnosis malignant |
0.085 |
-4.2 |
IF cell size uniformity = 5THEN diagnosis malignant |
0.057 |
-4.5 |
IF epithelial cell size = 10THEN diagnosis malignant |
0.056 |
-3.8 |
IF bland chromatin = 8 THEN diagnosis malignant |
0.045 |
-4.2 |
More Info on Rule
Based nets:
R.M. Goodman, P. Smyth, C.M. Higgins & J. Miller, “Rule-Based
Neural Networks for Classification and Probability Estimation,
"Neural Computation", Vol. 4, No. 6, pp. 781-804, November
1992.
top
back to ItRule page