Relational Structure Representation

Next: Work in Relational Matching Up: Matching under Relational Constraints Previous: Matching under Relational Constraints

5.1

Relational Structure Representation

The features, properties and relations can be denoted compactly as a relational structure (RS) [Fischler and Eschlager 1973,Ambler et al. 1973,Cheng and Huang 1984,Radig 1984,Li 1992a,Li 1992b]. An RS describes a scene or a (part of) model object. The problem of object recognition is reduced to that of RS matching.

Let us start with a scene RS. Assume there are m features in the scene. These features are indexed by a set of sites. The sites constitute the nodes of the RS. Each node is associated with it a vector composed of a number of unary properties or unary relations, . A unary property could be e.g. the color of a region, the size of an area, or the length of a line. Each pair of nodes () are related to each other by a vector composed of a number of binary (bilateral) relations, . A binary relation could be e.g. the distance between two points, or the angle between two lines. More generally, among n features , there may be a vector of n-ary relations. This is illustrated in Fig.5.1. An n-ary relation is also called a relation, or constraint, of order n. The scope of relational dependencies can be determined by a neighborhood system on . Now, the RS for the scene is defined by a triple

where and H is the highest order. For H=2, the RS is also called a relational graph (RG). The highest order H cannot be lower than 2 when contextual constraints must be considered.

The RS for a model object is similarly defined as

where , , and so on. In this case, the set of labels replaces the set of sites. Each element in indexes one of the M model features. In addition, the ``neighborhood system'' for is defined to consist of all the other elements, that is,

This means each model feature is related to all the other model features. The highest considered order, H, in is equal to that in . For particular n and k (; ), represent the same type of constraint as ; for example, both represent the angle between two line segments.

Figure 5.2: Discrete mapping involving the NULL label (numbered 0). All scene nodes (sites) not modeled by the considered object should be matched to this special label.

Relations of various orders impose unary, binary, ... , H-ary constraints on the features. Inter-contextual constraints are represented by the second or higher order relations. Due to these constraints, a scene, an object or a view of an object is seen as an integrated part rather than as individual features. The higher the order of relations is, the more powerful the constraints are, but the higher the complication and expenses are in the computation.

There can be multiple model RS's in a model base. A model RS describes a whole model object or a part of it. When an RS is used to describe a part (for example a view) of an object, the whole object may be described by several RS's and these RS's may be related by some inter-RS constraints.

Now introduce a virtual model composed of a single node . It is called NULL model. This special model represents everything not modeled by , such as features due to all the other model objects and the noise. So the actual label set in matching the scene to the model plus NULL contains M+1 labels. It is denoted by

After the introduction of the NULL , the mapping from to is illustrated in Fig.5.2.

Figure 5.3: Cup images and segmentation based on H-K surface curvatures. Top: a cup image and its H-K map. Bottom: a transformed version of the cup image and the HK map. Some noise is introduced during the transformation due to quantization. From (Li 1992a) with permission; © 1992 Academic Press.

Figures 5.3 and 5.4 demonstrate two cup images and their segmentations based on the H-K surface curvatures [Besl and Jain 1985]. Suppose we are matching the RG in Fig. 5.4(b) to that in (a). Based on the constraints from the unary properties of surface curvature and region area, and binary relations of distance between regions, the correct matching from (b) to (a) is , and the rest to NULL .

Model-based matching can be considered as finding the optimal mapping from the image RS to the model RS (or vice versa). Such a mapping from one RS to another is called a morphism, written as

which maps each node in to a node in

and thus maps relations to relations

A morphism is called an isomorphism if it is one-to-one and onto. It is called a monomorphism if it is one-to-one but not onto. It is called a homomorphism if it is many-to-one. We do not allow one-to-many mappings because they contradict the definition of functions and, more crucially, increase the difficulties in finding the optimal solutions.

Figure 5.4: Relational graphs built from the cup HK segmentation maps. The texture of the nodes denotes different surface types and the links represent the adjacency relations. (a) The RG for the orginal upright cup image. Node 1 corresponds to the body of the cup and node 2 to the handle. (b) The RG for the transformed cup. Node 5 corresponds to the body, node 1 to the handle, and the rest to NULL . (c) Legend for the correspondences between texture types and H-K surface types. From (Li 1992a) with permission; © 1992 Academic Press.

When the properties and relations in the considered RS's include numerical values, the mappings are numerical morphisms which are more difficult to resolve than symbolic morphisms. Since such morphisms do not preserve relations in the exact, symbolic sense, they may be called weak morphisms -- a term extended from the weak constraint models [Hinton 1978,Blake 1983,Blake and Zisserman 1987].

The goodness of a numerical morphism is usually judged by an objective function such as an energy. It is not very difficult to find a correct one-to-one mapping (isomorphism) between two identical RS's. The requirement that the unary properties of two nodes and the binary relations of two links must be exactly the same in order to be matched to each other provides a strong constraint for resolving the ambiguities. For the case where the two RS's have different numbers of nodes and the matched properties and relations are not exactly the same, the matching is more difficult because it cannot exploit the advantage of the strong constraint of the exact equalities. Only ``weak constraints'' are available. The subsequent sections use MRFs to establish the objective function for weak homomorphisms for inexact, partial matching.

Next: Work in Relational Up: Matching under Relational Previous: Matching under Relational