Fifth Speaker: Keith Holyoak

Course Syllabus

Further Information

Invited Speakers

Article Summaries

Listserv Archive

The Neural Theory of Language Project

metaphor

blending and conceptual integration

xxxxx

Questions and Summary of John Hummel and Keith Holyoak. 1997. Distributed Representations of Structure: A Theory of Analogical Access and Mapping.

discussed by Dan Bauer

Hummel and Holyoak present a computational theory of analogy formation which they have implemented in a model called LISA (Learning and Inference with Schemas and Analogies). Their central motivation is to integrate two major processes of analogy formation-- memory access and structural mapping-- while preserving both flexibility and sensitivity to structure.
Analogy makes use of complex relations and structures which emerge from recombination of simpler elements, but also requires flexiblity and generalization. Traditional symbolic systems maintain structure but are inflexible; connectionist systems are just the reverse. Past hybrid models have lacked a natural interface between the two. The LISA model attempts to reconcile the two approaches and unify access and mapping with both structure sensitivity and flexiblilty.
In this summary I will focus primarily on the architecture of LISA, since I believe a concrete picture of the implementation will make it easier to understand the theory's broader ramifications. The model has relatively few components, but it is complex because of its subtle and dynamic behavior, and I will make some simplifications for conceptual clarity.
OVERVIEW OF THE MODEL
LISA can be divided roughly into two interacting systems: a "working memory" (WM) and "long-term memory" (LTM). LTM is a layered network of "structural" units, and the bottom structural layer connects to WM's single layer of semantic units.
Concepts and relations (in LTM) are represented as trees of structural units of three types: propositions, subpropositions, and objects/predicates. Propositions (like "John loves Mary") are broken into subpropositions which isolate the roles of each agent or patient (like "John loves" and "Mary is loved"). These in turn are broken into objects ("John", "Mary") and predicates ("loves", "is loved"). There may also be additional levels of propositions within objects ("Sam knows John loves Mary").
Each proposition tree in LTM is a potential "analog"-- the source or target of an analogy.
The semantic units of WM connect to and allow distributed representations of each object or predicate at the bottom of an LTM proposition tree. The more similar two objects/predicates are, the more semantic units they will share.
WM also includes a set of "mapping" links between LTM structure units of the same type (eg. predicate-predicate, proposition-proposition).
The basic dynamics of the system are as follows:
Activity starts in LTM, in a particular proposition unit chosen as the target analog, or "driver". Flashes of activity spread alternately down various competing branches of the driver's structure units and activate patterns of semantic units in WM. These semantic units activate "similar" objects and predicates, and activation spreads back up competing branches of other "recipient" analogs. Recipients which are most strongly activated ("retrieved" from LTM) at any moment are considered the best "source" analogs for the original target.
When structure units of the same type are active concurrently, the WM "mapping" weight between them strenghtens; when structure units are uncorrelated, the connecting weight is weakened.
After one branch of the target (eg. "John loves") has been active for some time and brought out the best source analog (eg. "Bill likes") it shuts off, and another target branch ("Mary is loved") takes over and spreads activation to a new recipient ("Susan is liked"). Over time, as the components of the target ("John loves Mary") map to analogous components of the source ("Bill likes Susan"), WM builds up a flexible but structure-preserving analogy.
MAJOR THEORETICAL ISSUES
While some limitations of the LISA model appear to be flaws, many actually parallel and therefore illuminate certain limitations of human reasoning, including processing order and memory constraints.
--Processing Order and Coherence: The quality of the analogical mappings in LISA depends on the order in which the target/driver's proposition and subproposition branches are activated. Stronger mappings result when successive propositions tend to overlap in their objects and predicates, and when important propositions appear early and often. These are some of the same qualities which make written text coherent, suggesting that the human mind prefers the same types of sequences. In fact, LISA's default strategy is to select propositions in the order and frequency in which they appear in a text describing the target analog.
--Memory Constraints: In LISA, the set of structure and semantic units which are active at any particular moment can be considered "active memory" which intuitively corresponds to the system's "current thought". The number of structure trees which can be active at any moment (called the "phase set") can be modified, and this "phase set" determines the size of active memory. The greater the active memory, the more subpropositions which can be interrelated, and the more complex the analogy which can be extracted.
This seems to parallel the role of active memory in human reasoning.
--Representational Flexibility (N-ary constraint): Previous analogy-mapping models (like ACME & SME) were able to map N-ary predicates only onto other N-ary predicates In LISA, because source analogs emerge indirectly from the semantic distributed representations, there can be reasonable mappings between predicates with different numbers of arguments.
=====================
MY RESPONSE:
In many respects, LISA embodies an elegant unification of access and mapping. I find particularly appealing its treatment of mapping as an emergent correlation of activity rather than brute-force predicate matching.
LISA seems to benefit from many of the advantages of distributed representations while maintaining the ability to portray complex structures.
Nevertheless, perhaps because I do not fully understand the model, I am left with an uneasy concern:
1) The authors claim on p436: "...the structure units do not directly encode meaning. Rather, they work together to impose particular patterns of synchrony on the semantic units; it is only the latter which encode meaning."
I would argue: Where does the meaning in the semantic units come from, if not from the structure units? A set of semantic units mean "dog" only because they are attached to the Dog structure unit, and because some of them are also attached to the Animal, Bark, and Tail structure units.
Imagine an unanticipated set of semantic units which together suggest a viable concept (e.g. animal + small + cute +...) but which are not explicitly grouped by any structure unit. This concept could never take on a role in an analog without an explicit structural unit to babysit it! The "meaning" could never participate in the retrieval and mapping process. Based on this, I would say the semantic units have meaning only because of the structures.
And if meaning is introduced only through explicit structures, then no meanings, mappings, or analogies can emerge which have not been implicitly anticipated and loaded into the model.
2) This is really the same point seen from a different direction:
It is true that LISA's mappings are not limited to predicates with the same number of arguments. However, there are plenty of other intuitive correspondences which do not seem to be mappable. For example, can IsHungry(John) map onto Wants(Food, John)? Neither Wants nor Food alone shares much semantic similarity with IsHungry. but even if their meanings could be combined, how would Hungry map simultaneously onto both object and predicate?
It seems that for two potential analogs to map onto each other, their representations must be arranged in advance; Wants(Food,John) must be re-represented as Hungry(John), or the reverse. In this case, LISA's representations are already implicitly loaded with analogous structures! Therefore it is not surprising that LISA finds the analogies.
I believe, if I understand the model correctly, that LISA's dependency on predicate structures actually undermines most of the strength of the distributed semantic representations, and that it still suffers many of the same flaws of traditional symbolic systems.
=======================
Dan Bauer
Cognitive Science, UCSD
dsbauer@cogsci.ucsd.edu





Questions and Summary of Hofstadter's review of Mental Leaps

discussed by Misty Karin
Cognitive Science, UCSD

Summary of Douglas Hofstadter's review of
Holyoak & Thagard _Mental Leaps: Analogy in Creative Thought_ (in _AI Magazine_ fall 1995 issue) and its relation to LISA.

This critique by Hofstadter discusses the 1994 book _Mental Leaps_ and the analogy-making model, ACME, which is discussed in the book. His main complaint cuts right to the core of HolyoakUs treatment of analogy. For Hofstadter, the gist, or "essence," of a situation is the basis for its representation and is the connecting element in analogy. Extracting the gist of a situation is the key to making an analogy. Hofstadter pointed to a complete lack of focus on the essence of the situation in _Mental Leaps_ and charged that Holyoak and Thagard failed to recognize the gulf between human perception of a situation which has no hard and fast distinctions and the view of a situation as a set of predicate calculus formulas full of distinct objects, attributes, and relations.
Holyoak and Thagard emphasize the presence of a one-to-one correspondence between the elements of analogous situations. They also see causal relationships as a central anological connection. But this focus on causality overlooks the importance of the complex situational themes present in the gist. Also, causality is ubiquitous to all events - thus it is a meaningless tool for classification. Hofstadter instead points to a one-to-one correspondence at the level of the gist, not the elements, of analogous situations.
ACME was designed to make analogies by mapping formalized situations onto each other. It worked by mapping analogies based on the structural similarity of situational predicates. ACME did not have any semantic knowledge of the elements being mapped together. The information given to the model was a formalization of the gist already extracted and translated into meaningless symbols:
Saddam launched the Gulf War by invading Kuwait.
president-of(Saddam, Iraq) -> p(S, I)
invade(Iraq, Kuwait) -> i(I, K)
Hitler launched W.W.II by occupying Austria.
furher-of(Hitler, Germany) -> f(H, G)
occupy(Germany, Austria) -> o(G, A)
The gist of these situations is a countryUs leader acting to start a war. This gist is pre-extracted for ACME. All the model does is play a matching game with the symbolic elements it is fed.
With their new analogy-maker, LISA, Hummel and Holyoak use a connectionist hybrid network to incorporate semantics into their analogical model. The architecture of LISA is described in Dan's summary, so I will not duplicate the detailed discription here. The major improvement over ACME is that predicate elements are now connected to semantic units which supply a rudimentary meaning to the proposition. A pair of analogous propositions will share many semantic units, and infrences can be made about one proposition from the semantic relationship with its analog.
Hummel and Holyoak claim that LISA solves the problem of maintaining structure in a distributed representation, but they fail to address the major criticism posed by Hofstadter in his critique of the earlier model. LISA still uses formalized predicates, and still focuses on correspondences between propositional elements. Hofstadter would claim thatthey are still modeling a theory of analogy-making which ignores both gist extraction and concept formation.
Questions for clarification:
There is an example of analogical inference given:
Analog 1: father(Abe, Bill) Analog 2: father(Adam, Bob)
brother(Charles, Abe) brother(Cary, Adam)
uncle(Charles, Bill)
LISA was able to infer the predicate uncle(Cary, Bob). Is the model flexible enough to make the inference if the predicate arguments are not laid out so neatly for the system?
What if analog 2 were given as father (Adam, Bob); Brother (Adam, Cary)?
What if the propositions for analog 2 were given as mother, sister,
aunt? This is a simple analogy for us, but how would LISA handle it?
Hummel & Holyoak (1996) state that "analogical reasoning plays an important role in schema induction." Yet the schema is basically the same as the gist which Hofstadter holds as the basis for analogy. Could you address this conflict between your views of analogy?

| Syllabus | Speakers | Article Summaries | xxxxx | xxxx |


Contact: Gilles Fauconnier
Web Design: Omar Alhassoon