Software agents have even more of a credibility problem than human nonacquaintances do. In addition to overcoming the problems of unfamiliarity and general technophobia, users must establish the extent of an agent's capabilities and intelligence before conducting business with it. While this is not as much of an issue for entertainment applications, it is important for any information dissemination or advice-giving tasks and crucial in sales tasks.
One of the primary purposes of rapport-building in general is to establish that our interlocutor is, in some way, "just like us," that is, shares our basic values and beliefs, as well as sharing the ability to communicate. One way for a software agent to overcome the credibility problem is to embody the agent in the interface, that is, provide it with a human appearance and lifelike behavior. The Rea project at the MIT Media Lab (Cassell, et al, 1999) has as its goal the construction of an embodied, multi-modal real-time conversational interface agent. Rea implements the social, linguistic, and psychological conventions of conversation to make interactions as natural as face-to-face conversation with another person. Rea's task domain is real estate sales, a domain intentionally selected to provide opportunities for both task-oriented and social conversational frames of interaction. Within a typical sales encounter, it is important for Rea to engage in some amount of small talk to build rapport with the user before going into the "sales" conversational frame. The rapport may convince users that Rea does in fact understand their communicative behaviors, as well as convincing them that she knows what she is talking about in the real estate domain.
A second mechanism for establishing credibility and rapport is small talk. The effectiveness of small talk on the user's perception of a software agent has been demonstrated in text-only "chatterbot" systems such as ELIZA (Weizenbaum, 1966). There are many documented cases in which users have established on-going relationships with such systems, even when they knew that the system did not understand what they were typing (Turkle, 1995). Thus, a software agent that is both embodied and capable of small talk will have a much better chance of establishing a rapport with a user than a software agent that has neither of these features. This rapport, in turn, may actually allow the system to function more effectively, by enabling the user to rely on his/her natural social and linguistic skills.
A real-estate sales situation is a unique type of service encounter, in which the customer may feel uncomfortable (wanting to avoid commitment, i.e., threats to negative face) and the salesperson is not only trying to make the customer feel at ease for the purpose of information gathering but also to attempt to develop a long-term relationship with them. The basic questions that need to be addressed by a computational model of small talk in this context are:
Schneider (1988) did the first extensive study on the genre of small talk, at least as it occurs in several common situations involving chance meeting of strangers. His review of the literature provided two basic functions for small talk; one addresses positive face needs (fulfilling a socialization function, as in Malinowski's phatic communion) and one which addressed negative face needs (avoiding the embarassment or "danger" of silence with a stranger). Schneider transcribed over 30 conversations and performed a detailed characterization of small talk conversations. The typical small talk conversation consists of a series of conversational segments, each with the following sequence of moves:
That casual conversation should be used to build rapport is also well documented in the popular press (e.g., RoAne, 1997), and literature in sociobiology (e.g., Dunbar, 1996) and social psychology (e.g., Laurenceau JP, Barrett LF, Pietromonaco, 1998). Equilibrium models of intimacy in social psychology are particularly interesting in that they account for the fact that individuals seek to achieve a fixed level of intimacy in a given interaction, and if this equilibrium is disturbed (e.g., by topic change in the conversation) they will seek to restore the overall balance of intimacy through other channels (e.g., eye contact, smiling, and physical proximity).
More recent chatterbot systems use various indices in addition to pattern matching to determine the system's response. For example, the system described in (Mauldin, 1994) used a current topic index which is updated based on an activation network. Rousseau and Hayes-Roth (1997), describe a system in which response indices include the character's affective state.
Chatterbot systems also utilize several "tricks" to give the illusion of intelligence and fluency (Mauldin, 1994), including:
To solve the problem of when the agent should transition into and out of small talk, I introduce a scalar measure of interpersonal closeness (IPC), which starts at zero (representing complete strangers) and increases with the perceived closeness between the agent and the customer. Certain communicative speech acts (such as asking about personal information) can have a minimum required IPC specified as part of their felicity conditions, and thus can only be executed when the customer and agent have achieved some level of familiarity with each other.
I also introduce small talk as a discrete frame of interaction (in addition to task, greet, and farewell frames). Within the small talk frame, IPC is gradually increased the longer the interlocutors engage in small talk. Task-oriented request speech acts can only be executed within the task frame. Given this framework, a dialog planner can plan to engage in small talk in order to reach its task-oriented goals, and can interleave task talk and small talk as needed to reach its goals as efficiently as possible.
The question of how dialog should be organized within the small talk frame is provided by Schneider's characterization of small talk segments and moves. The agent normally issues the initial query in each segment, allows the user to respond, provides evaluative feedback, and the performs a random amount of idling behavior.
Mixed initiative interaction is provided for by allowing the customer to ask the initiating query within the small talk frame, and transitioning immediately to the task frame if they introduce a task-oriented topic.
Interaction begins in the greet frame then automatically transitions into the small talk frame (per ritual convention). Interaction proceeds in the small talk frame until IPC has been increased to the point at which task-oriented speech acts can be executed or the customer mentions a task topic, at which time the task frame is started. Within the task frame, task-oriented speech acts are executed until either the agent's goals are satisfied, causing a transition to the farewell frame, or no more speech acts can be performed at the current IPC level, causing a transition back into the small talk frame.
Within the small talk frame, the dialog is conducted with segments and moves modeled according to Schneider. The agent normally starts a new segment and performs the odd-numbered moves, but the user is also free to do so, and the agent will perform the even-numbered moves. As each segment is concluded IPC is incremented. ELIZA-style pattern-response pairs are used to generate evaluative (third move) and query (second move) responses.
An example of the user taking initiative in the small talk frame is shown in Attachment 2, and an example of the user causing an immediate switch into the task frame through introduction of a task topic is shown in Attachment 3.
Integrating small talk functionality into an embodied conversational character provides numerous challenges and a fertile area of research. Some of the more difficult and interesting problems include:
Cassell, J., Bickmore, T., Billinghurst, M., Campbell, L., Chang, K., Vilhjálmsson, H., Yan, H. (1999). "Embodiment in Conversational Interfaces: Rea", to appear in CHI99, Pittsburgh, PA.
Coupland, J. and Coupland, N. (1992). "How are you?: Negotiating phatic communion." Language in Society 21, pp. 207-230.
Dunbar, R. (1996). Grooming, Gossip, and the Evolution of Language. Harvard University Press.
Garner, A. (1980). Conversationally Speaking. Lowell House, Los Angeles.
Goffman, E. (1983). Forms of Talk. University of Pennsylvania Press.
Jakobson, R. (1960). "Linguistics and poetics." In Sebeok 1960. Pp. 350-77.
Jefferson, G. (1978). "Sequential Aspects of Storytelling in Conversation". Studies in the Organization of Conversational Interaction, pp. 219-248. Academic Press.
Laurenceau J., Barrett L., and Pietromonaco P. (1998). "Intimacy as an interpersonal process: The importance of self-disclosure, partner disclosure, and perceived partner responsiveness in interpersonal exchanges." Journal of Personality and Social Psychology, 74: (5) 1238-1251.
Malinowski, B. (1923). "The problem of meaning in primitive languages." Supplement to C. Ogden and I. Richards The meaning of meaning. London: Routledge and Kegan Paul. Pp. 146-152.
Mauldin, M. (1994). "Chatterbots, Tinymuds, and the Turing Test: Entering the Loebner Prize Competition. Proceedings of AAAI 94.
Polanyi, L. (1985). Telling the American Story: A Structural and Cultural Analysis of Conversational Storytelling. Ablex Publishers, Norwood, New Jersey.
Reeves, B. and Nass, C. (1996). The Media Equation: How People Treat Computers, Television, and New Media Like Real People and Places. Cambridge University Press.
RoAne, S. (1997). What do I Say Next? Talking your way to business and social success. Warner Books, New York.
Schneider, K. (1988). Small Talk: Analysing Phatic Discourse. PhD Thesis, Philipps-Universitat Marburg, West Germany.
Tannen, D. (1984). Conversational Style: Analyzing talk among friends. Ablex Publishing, Norwood, New Jersey.
Turkle, S. (1995). Life on the Screen: Identity in the Age of the Internet. Touchstone, New York.
Weizenbaum, J. (1966). Eliza -a computer program for the study of natural
language communication between man and machine. Communications of the ACM,
9, 26-45.
--------------- initiate Frame ---------------
R: Hello.
U: Hi there.
--------------- smalltalk Frame ---------------
R: Isn't this a great neighborhood?
U: I think it's wonderful.
R: you think it s wonderful?
U: yes.
R: Isn't this living room fabulous?
U: I think it's pretty.
R: you think it s pretty?
U: yep.
R: yep?
--------------- task Frame ---------------
R: Is this home in the right location for you?
U: Yes.
R: Do you like this style of home?
U: Yes, it's nice.
R: Is this home big enough for you?
U: Yes, I think so.
--------------- smalltalk Frame ---------------
R: Where are you living now?
U: Cambridge.
R: cambridge?
U: Not far.
R: How big of a family do you have?
U: Just myself.
R: just yourself?
U: yes.
R: Do you have little ones on the way?
U: nope.
R: nope?
U: no.
--------------- task Frame ---------------
R: Do you have the down payment?
U: yes
R: Do you have any income?
U: yes, sufficient.
--------------- smalltalk Frame ---------------
R: Boston is great place to live, don't you think?
U: It's ok.
R: it s ok?
U: Yea.
R: How about this weather?
U: Wonderful.
R: wonderful?
U: Yes.
R: yes?
--------------- task Frame ---------------
R: Do you want to buy this home?
U: Yes.
** Game over. Rea won. **
--------------- farewell Frame ---------------
R: Goodbye.
--------------- smalltalk Frame ---------------
R: How about this weather?
U: It's gorgeous.
R: it s gorgeous?
U: Don't you think it's a wonderful Spring?
R: Maybe.
U: Yea?
R: yea.
U: I just love the flowers, don't you?
R: Yes.
U: Yea?
R: yea.
U: : Hmm.
R: Boston is great place to live, don't you think?
U: Yes, it is.
R: yes it is?
U: Yea.
R: yea?
R: How big of a family do you have?
U: Just me.
R: just you?
U: yes.
--------------- task Frame ---------------
--------------- initiate Frame ---------------
R: Hello.
U: hi
--------------- smalltalk Frame ---------------
R: Isn't this a great neighborhood?
U: Yes, it's great.
R: yes it s great?
U: I'm looking for a house with a pool.
--------------- task Frame ---------------
…