M. Eng. Thesis Proposal

Push Singh

February 1997

For my masters thesis, I propose to write a program that can do limited planning in common sense domains. My approach to building the system will be drawn from Marvin Minsky’s "Society of Mind" theory of intelligence (Minsky, 85). I will develop Minsky’s paranome idea, a scheme where a diversity of specialized problem solving mechanisms, or "agencies", operate on parallel arrays of frames. This approach is motivated by the apparent diversity of common sense domains, which suggests that no planner with a single representation and problem solving strategy can efficiently solve problems in all of these realms. Accepting that to be true, systems that plan in common sense domains must use a diversity of mechanisms.   The paranome idea lets us bring multiple viewpoints to bear on a single problem, as it allows for different agencies to focus on the same thing in different ways, where a "thing" could be an object, an event, a difference, a cause – really anything at all. If a particular agency gets stuck while solving a problem, it can call for help from other agencies. These other agencies can look at the stuck agency’s problem in their own way, and perhaps see a solution that was not visible from the stuck agency’s restricted perspective.

I plan to explore these ideas by building three agencies and coordinating them to solve several planning problems in the context of a simple children’s story about a birthday party. I plan to use the Cyc knowledgebase to provide the background knowledge necessary to solve these problems. Cyc’s knowledge is organized into microtheories, consistent collections of knowledge related to some area of concern, like "people" or "space". These different microtheories can provide the background knowledge for different agencies. I will compare my multi-agency planner with current single agency planners.

What’s wrong with current planners?

The main inadequacy of current planners with respect to solving problems in common sense domains is that they almost all use a single representation scheme. There are two general categories of planners, rule-based and case-based (see footnote 1). Rule-based planners operate by searching through the space of possible plans, using control rules and evaluation functions to decide which actions to consider next during search. Case-based planners operate by matching and adapting existing plans to the current problem. But in either case, the representation of the problem is chosen in advance, and fixed throughout the planning process. A planner must choose a representation language that lets it express the differences between its current situation and its goals, such that those differences can be reduced by available actions. Given the enormous range of situations and goals, and hence differences, that arise in common sense problem solving, a common sense planner must have an extremely rich and diverse representation language. The language of Cyc, the only existing system that can express a wide range of common sense knowledge, has many thousands of terms.

The trouble begins when the planner must choose how to describe the current situation or difference in its representation language. Initially it will describe the situation in terms of those aspects that differ from its goals, but if at some point during the planning process it encounters a problem for which no solution is at hand, it must resort to searching an enormous space of possible descriptions to find a viewpoint that renders the solution obvious. Consider that even in the simplest situations the range of possible descriptions is enormous. In a typical act of giving, such as "Mary gave John a present", I can focus on Mary’s hair color, John’s appreciation of the present, the physical motion of the present as it changed hands, the transfer in ownership of the present, and a vast range of other possibilities. Current planners get around this problem either by expressing everything about the situation in a single undifferentiated union of possible descriptions, which can be done if the problem is simple enough, or they choose a particular description building strategy that restricts the descriptions to a certain narrow type. Clearly, neither of these solutions is adequate for common sense planning.

How can we add multiple representations?

In the multiple agency system I will build, each agency sees the world through the filter of a particular limited understanding of reality. One agency may know how to solve problems in social realms, where things are described in terms of who-knows-who and hierarchies of power; another may knows about physical realms, where things are described in terms of their shape, substance, and strength; another about economic realms, where things are described in terms of their value, their expected change in value as time passes, etc. What are the benefits of this approach, as compared to single agency systems?

Footnote 1. I do not include connectionist approaches to planning because they are typically variants of rule-based or case-based planners that either use activation spreading instead of systematic search, or use threshold units in matching instead of symbolic structures.

Push Singh
push@mit.edu


Some Self-Criticisms

To someone who is trying to develop an idea, nothing is more valuable than a set of well-stated criticisms. Here are some that I came up with while working on these ideas (some are a bit "straw man".)

Push Singh
push@mit.edu