ArticlePDF Available

Abstract

this paper, we investigate this revi.-ed principle as applied to question answering. In particular the goals of the research described here are to: 1. characterize tractable cases in which the system as respondent (R) can anticipate the possibility of the user/questioner (Q} drawing false conclusions from its response and can hence alter or expand its response so as to prevent it happening; 2. develop a formal method for computing the projected inferences that Q may draw from a particular response, identifylug those 1This work is partially supported by NSF Grants MCS 81-07290, MCS 8.3-05221, and [ST 88-1 2At present visiting the Department of Computer and Information Science, University of Pemsylvania, Phi|adelphia, PA 19104. factors whose presence or absence catalyzes the inferences; 3. enable the system to generate modifications of its response that can defuse possible false inferences and that may provide addi6oual useful information as well
Preventing False Inferences 1
Aravind Joshi and Bonnie Webher
Department of Computer and Information Science
Moore School/D2
University of Pennsylvania
Philadelphia PA 19104
Ralph M. Weischedel 2
Department of Computer & Information Sciences
University of Delaware
Newark DE 19716
ABSTRACT
I
Introduction
In cooperative man-machine interaction, it is taken as
necessary that a system truthfully and informatively
respond to a user's question. It is not, however,
sufficient. In particular, if the system has reason to
believe that its planned response nfight lead the user to
draw an inference that it knows to be false, then it
must block it by nmdifying or adding to its response.
The problem is that a system neither can nor should
explore all eonchtsions a user might possibly draw:
its
reasoning must be constrained in some systematic and
well-motivated way.
Such cooperative behavior was investigated in [5], in
which a modification of Griee's
Maxim of Quality
is
proposed:
Grice's
Maxim of Quality-
Do not say what you believe to be false or for which
you lack adequate evidence.
Joshi's
Revised Maxim of Quality -
If you, the speaker, plan to say anything which may
imply for the hearer something that you believe to be
false, then provide further information to block it.
This behavior was studied in the context of interpreting
certain definite noun phrases. In this paper, we
investigate this revised principle as applied to question
answering. In particular the goals of the research
described here are to:
I. characterize tractable cases in which the
system as respondent (R) can anticipate the
possibility of the user/questioner (Q)
drawing false conclusions from its response
and can hence alter or expand its response
so as to prevent it happening;
2. develop a formal method for computing the
projected inferences that Q may draw from
a particular response, identifying those
1This work is partially supported by NSF Grants MCS
81-07290, MCS 8.3-05221, and [ST 83-11,100.
2At present visiting the Department of
Computer
and
Information Science, University of Pennsylvania, Philadelphia, PA
19104.
factors whose presence or absence catalyzes
the inferences;
3. enable the system to generate modifications
of its response that can defuse possible false
inferences and that [nay provide additional
useful information as well.
Before we begin, it is important to see how this work
differs from our related work on responding when the
system notices a discrepancy between its beliefs and
those of its user [7, 8, 9, 18]. For example, if a user asks
How many French students failed CSEI21 last term?',
he shows that he .believes inter alia that the set of
French students is non-empty, that there is a course
CSEI21, and that it, was given last term. If the system
simply answers "None', he will assume the system
concurs w'ith these b~diefs since the answer is consistent
with them. Furthermore, he may conclude that French
students do r;'d.her well in a difficult course. But this
may be a false conclusion if the system doesn't hold to
all of those beliefs (e.g., it doesn't know of any French
students). Thus while the system's assertion "No
French students failed CSEI21 last term" is true, it has
misled the user (1) inlo believing it concurs with the
user's beliefs and (2) into drawing additional false
conclusions from its response. 3 The differences between
this related work and the current enterprise are that:
1. It is no_~t assumed in the current enterprise
that there is any overt indication that the
domain beliefs of the user are in any way at
odds with those of the system.
2. In our related work, the user draws a false
conclusion from what is said because the
presuppositions of the response are not in
accord with the system's beliefs {following a
nice analysis in
[lO]).
In the current
enterpri.~e,
the
us~,r draws a false conclusion
from what is said because the system's
response behavior is not in accord with the
user's expectations. It. may or may not
also
31t is a feature of Kaplan's CO-OP system [7] that it point~ out
the discrepancy by saying "| don't know of any French students °
134
involve false domain beliefs that the system
attributes to the user.
In this paper, we describe two kinds of false
conclusions we are attempting to block by modifying
otherwise true response:
false conclusions drawn by standard default
reasoning - i.e., by the user/listener
concluding (incorrectly) that there is nothing
special about this case
false conclusions drawn in a task-oriented
context on the basis of the user's
expectations about the way a cooperative
expert will respond.
In Section II, we discuss examples of the first type,
where the respondent (R) can reason that the questioner
{Q) may inappropriately apply a default rule to the
(true) information conveyed in R's response and hence
draw a false conclusion. We characterize appropriate
information for R to include in his response to block it.
In Section HI, we describe examples of the second type.
Finally, in Section IV, we discuss our claim regarding
the primary constraint posed here on limiting R's
responsibilities with respect to anticipating false
conclusions that Q may draw from its response: that is,
it is only that part of R's knowledge base that is
already in focus (given the interaction up to that point,
including R's formulating a direct answer to Q's query)
that will be involved in anticipating the conclusions
that Q may draw from R's response.
H Blocking Potential Misapplication of Default
Rules
Default reasoning is usually studied in the context of a
logical system in its own right or an agent who reasons
about the world from partial information and hence
may draw conclusions unsupported by traditional logic.
However, one can also look at it in the context of
interacting agents. An agent's reasoning depends not
only on his perceptions of the world but also on the
information he receives in interacting with other agents.
This information is partial, in that another agent
neither will nor can make everything explicit. Knowing
this, the first agent (Q) will seek to derive information
implicit in the interaction, in part by contrasting what
the other agent (R) has made explicit with what Q
assumes would have been made explicit, were something
else the case. Because of this, R must be careful to
forestall inappropriate derivations that Q might draw.
The question is on what basis R should rea.~on that Q
may ~sume some piece of infotmati(>n (P) would have
been made explicit in the interaction, were it the ease.
One basis, we contend, is the likelihood that Q will
apply some
staudard default rule
of the type discussed
by Reiter [15] if R doesn't make it explicite that the
rule is not applicable. Reiter introduced the idea of
default rules in the stand-alone context of an agent or
logical system filling in its own partial information.
Most standard default rules embody the sense that
"given no reason to suspect otherwise,
there's nothing
special about the current case'.
For example, for a bird
what would be special is that it can't fly - i.e., •Most
birds fly•. Knowing only that Tweety is a bird and no
reason to suspect otherwise, an agent may conclude by
default that there's nothing special about Tweety and
so he can fly.
This kind of default reasoning can lead to false
conclusions in a stand-along situation, but also in an
interaction. That is, in a question-answer interaction, if
the respondent (l{) has reason for knowing or suspecting
that the situation goes counter to the standard default,
it seems to be common practice to convey this
information to the questioner (Q), to block his
pote, tially a.ssuming the default. To see this, consider
the following two examples. (The first is very much like
the "Tweety" case above, while the second seems more
general.)
A. Example 1
Suppose it's the case that most associate professors are
tenured and most of them have Ph.Ds. Consider the
following interchange
Q: Is Sam an ~sociate professor?
R: Yes, but he doesn't have tenure.
There are two thi, gs to account for here: (1) Given the
information w&s not requested, why did R include the
"but" clause, and (2) why this clause and not another
one? We claim that the answer to the second question
has to do with that part of R's knowledge base that is
currently in focus. This we discuss more in Section IV.
In the meantime, we will just refer to this subset as
RBc ".
Assume RBc contains at least the following
information:
(a) Sam is an associate professor.
(b) Most associate professors are tenured.
(c) Sam is not tenured.
(b) may be in RBc because the question of tenure may
be in context. Based on RBc, R's direct response is
clearly "Yes'. This direct response however eouJd lead
Q to conclude falsely, by default reasoning, that Sam is
tenured. That is, R can reason that, given just (b) and
his planned response "Yes" (i.e., if (c) is not in Q's
knowledge base}, Q could infer by default reasoning
that
Sam is tenured,
which R knows with respect to
!RBc is false. Hence, R will modify that planned
response to block this false inference, as in the response
above.
In general, we can represent R's reasoning about Q's
reaction to a simple direct response •Yes, B(a)', given
Q believes "Most Bs F=, in terms of the following
default schema, using the notation introduced in [15 I.
135
told{ILQ,l~(c)) k (Most x)[B(x) = F(x)]
&-~h:,ld(R,Q,-~Flc)): M(F[c})
..__" ............................................
F(c)
As in Reiter's discussion, "M(P)" means it is consistent
to assume that P. In the associate professor example, B
corresponds to the predicate "is an associate professor',
F, to the predicate "has tenure', and c, to Sam. Using
such an inslantiated rule schema, R will recognize that
Q is likely to conclude F(c) - "Sam has tenure" - which.
is false with rvspe(.t to RBc {and hence, with respect to
all of R's knowledge base). Thus R will modify his
direct response so as to block this false conclusion.
B. Example 2
Consider a user one of the mail systems on the
DEC-20. To exit from this system, a user who has
finished reading all the messages he earlier specified can
just type a carriage return. To exit under other
circumstances, the user must type QUIT. Consider the
following interchange between a new user who has
finished reading all his messages and either a mail
system expert or the mail system itself.
Q: How (In I get out of mail?
R~ Since you h:tve read all your specified messages,
you can just type a carriage return. In all cases,
you (':ill got ()lit by typing QHT.
Here tile prohh,m is to account for all that part of R's
response beyond the simple truthful statement "You
can type a carriage return."
A general statement of this probh,m is a.s follows:
Agent Q is in one situation (Sl) and wants to be in
another ($2). There is a general procedure P for
achieving $2 from any of several situations including Sl.
There is a special prodecure P* (i.e., shorter, faster,
simpler, etc.) for achieving $2 frolu Sl. Q doesn't know
how to achieve $2, but R does (including proced,res P
and P*). Q asks R how to achieve $2.
If R knows.i~lat Q is in situation SI and truthfully
responds to Q's request by simply telling him P*, Q
may falsely conclude that P* is a general procedure for
achieving $2. That is, as in the Tweety and Sam
examples, if Q has no reason to suspect anything special
about SI (such that P* only applies to it), then there is
nothing special about it. Therefore P* is adequate for
achieving $2, whatever situation Q is in. 4 Later when Q
tries to apply P* in a different situation to achieve $2,
he may find that it doesn't work. As a particular
examl)le of this, consider the mail case again. In this
ca.se~
SI = Q has read all his messages
$2 = Q is out of the mail system
P ~--- typing QUIT
P* -- typing a carriage return
~Lssume RBc contains at least the following
informa.tion:
(a) Sl
(b) want(Q,S2)
(c) ¥s6S. P(s) = S2
(d)
P*(Sl) =
s2
(e) Sl6r
(f) simpler(P*,P)
(g) VsE,~. "-{s = SI) =* -~(P*ls) = $21
where 17 is some set of states which includes SI and P(s)
indicates action P applied to state S.
Based on RBc, R's direct response would be "You can
exit the mail system by typing carriage return'. (It is
&ssumed that an expert will always respond with the
"best" procedure according to some metric, unle..~ he
explicitly indicates otherwise - of. Section lIl, case 2}.
However, this could lead Q to conclude falsely,-by
default, something along tile lines of Vs . P*(s) ---- $2. 5
Thus R will modify his planned response to call
attention to SI {in particular, how to recognize it) and
the limited applicability of P* to SI alone. The other
modification to R's response ('In all cages, you can get
out by typing QUIT'), we would ascribe simply to R's
adhering to Grice's
Alaxim of Quantity -
"Make your
contribution ,~s informative as is required for tile
current purposes of tile exchange" given R's
assumption of what is required of him in his role as
expert/teacher.
HI Blocking False Conclusions in Expert
Interactions
Tile situations we are concerned with here are ones in
which the system is explicitly tasked with providing
help and expertise to the user. In such circumstances,
the user has a strong expectation that the system has
both the experience and motivation to provide the most
appropriate help towards achieving the user's goals. The
user does not expect behavior like:
Q:
How can
I
get to Camden?
R: You can't.
As many studies have shown Ill, what an advice seeker
(Q)
expects is that an expert
(R)
will attempt to
recognize what plan Q is attempting to follow in pursuit
of what goal and respond to Q's question accordingly.
Further studies [11, 12, 13] show that Q may also
expect that R will respond in terms of a better plan if
the recognized one is either sub-optimal or unsuitable
for attaining Q's perceived goal. Thus because of this
principle of "expert cooperative behavior', Q may
expect a response to a more general question than the
one he has actually asked. That is, in asking an expert
flow do 1 do X?" or "Can I do X?', Q is anticipating a
response to "How can I achieve my goal?"
4Moreover if Q (falsely) believes that R doesn't know Q is in SI,
Q will certainly assume that P* is a general procedure. However,
this isn't necessary to the default reasoning behavior we are
investigating.
5Clearly , this is only for some subset of states, ones
corresponding to being in the mail system.
136
Con',id,.r a slud,.ut ((,~) :+skhig th,' foll,+,+i.g que+thm, near the
end of the
term.
Q'.
Can
I
dr~q, C1~,-,77?
Since it is already too late to drop
a
course, ti~e o~.!y dire,'t answer
the ,x~*~rt (R) can give is "No'. Of course, part of :,:, expert's
knowledge concerns the typical states users get into and the
possible actions that permit transitions between them. Moreover it
is al~o part of this expertise to infer such states from the current
state of the
inlrerac(.ion,
Q's query, some shared knowledge of Q's
goals and Pxpectali,ns and the shared assmnption that an expert is
expected to attend to these higher goals. How the system should
go about in"erring these states is a difficult task that others are
exami,iug [2, 12, 13]. We assume that such an inference has been
made. We al,~o assume for simplicity that the states are uniquely
det.ermined. For example, we assume that the system has inferred
that Q i.,: in state Sb (student is doing badly in the course} and
wants to be in a state Sg {student is in a position to do better in
this course or another one later), and that the a~tion a (diopping
the course) will take him f:om Sb to Sg.
Given this, the response in (2) may lead Q to draw some
conclusiuns that I/. knows to be false. For example, R can reason
that since a principle of cooperative behavior for an expert is to
tell Q the best way to go from Sb to Sg, Q is likely to conclude
from R's response that there is no way to go from Sb to Sg. This
con+:lusion however would be false if R knows some other ways of
going from Sb to Sg. To avoid potenlially misleading Q, R must
provide additional information, such as
R: No, bul you can take an incomplete and ask for
more time to finish the work.
As we noted earlier, an important question is how much
reasoning R should do to block fals~ conclusions on Q's part.
Again. we assume that R should only concern itself with those false
conclusions that Q is likely to draw that involve that part of R's
knowledge base currently in focus (RBc}, including of course that
subset it nc~ds in order to answer the query in the first place.
We will make this a little more precise by considering several
cases corresponding to the different states of R's knowledge base
with r~peet to Sb, Sg. and tran~iti,m~ between them. For
convenie,,.e, ~,: ~ill give an appropriate re~p~mse in terms of Sb,
Sg and the actions. Clearly, it should be given in terms of
descriptions of ~lat,.s and actions understandable to Q. (Moreover,
by making further assumptions about Q's beliefs, R may be able to
validly trim some of its respond.)
1. Suppose that it is possible to go from Sb to Sg by
dropping the course aml that. this is the only action
that will take one from Sb to Sg.
Sb Sg
In this ca.se, the respon~ is
R: Yes. ct is t h~ only action that will take
you fr,,m Sb to St.
2. Suppose that in addition to going from Sb to Sg by
dropping the cour~,~o there is a better way, say ~, of
doing so.e
.j
Sb :
Sg
In this ca~e, the
response is
6"Betteruess" is yet another urea for future research.
H: Yes, but there is a better action ,9 that
will take you from Sb to Sg.
3. Suppose that dropping the course does not take you
from Sb to St, but another action ~ will. This is the
situation we considered in our earlier discussion.
Sb Sg
In this case the response is
H: No, but there is an action ~ that will
take you from Sb to St.
4. Suppose that there is no action that will take one from
Sb to Sg.
Sb Sg ,
/
In this the rcspon~ is
R: No. There is no action that will take you
from Sb to Sg.
Of course, other situations are possible. The point, however, is
that the additional information that R provides to prevent Q from
drawing fal~ conclusions is limited to just that part of R's
knowledge hase that R is focussed on in answering Q's query.
IV
Constraining the Renpondent's Obligations
As many people have observed - from studies across a range of
linguistic phenomena, including co-referring expressions [3, 4, 16],
left dislocations [14], epitomizatkm [17], etc. - a speaker (R)
normally focuses on n particular part of its knowledge base. What
he focuses on dcpends in part oil (1) eoutext,
(2}
R's partial
knf~wledge of Q's overall goals, as well as what Q knows already as
a result of the interaction up to that point, and (3} Q's particular
query, etc. The precise nature of how these various factors affect
focusing is complex and is receiving much attention [3, 4, 16].
However, no matter how these various factors contribute to
focusing, we can certainly assume that H comes to focus on a
subset of its knowledge base in order to provide a direr answer to
Q's query (at some level of inl,.rpretalion). Let us call this subset
RBc for "R's current belief.~ ~. Our claim is tlmt one important
constraint on cooperative behavior is that it is determined b.v RBc
only. Clearly the i;ib~rmal.ion needed for a direct response is
contained in RBc, a.~ is the information needed for many types of
helpful responses. In other words, RBc
--
that part of R's
knowledge base that R deeide~ to focus on in order to glve-a direct.
response to Q's quer~ - also has the information needed to
generate several classes of h~Ipful responses. The simplest ease is
presupposition failure [7], as in (he following
Q: llow many A's were given in (',IS 500 ?
where Q presumes that CIS 500 was offered. In trying to
formulate a direct response, R will have to ascertain that CIS 500
was offered. If it was (Q's presumption is true}, then R can go
ahead and give a direct response. If not, then R can indicate that
CIS 500 was not offered and thereby avoid misleading Q. All of
this is straightforward. The point here is that the information
needed to provide this extra response is already there in that part
of R's knowledge base which R had to look up anyway in order to
try to give the direct, response.
In the above example, it is clear how the response can be
localized to RP, c. We would like to claim that this approach has a
wider applicability: that RBc alone is the basis for responses that
anticipate and attempt to block interactional defaults as well.
Since RBc contains the information for a direct response, R can
plan one (r}. From r, R can reason whether it is possible for Q to
infer some conclusion (g) which R knows to be false because -~g is
in RBe. If so, then R should modify r so as to eliminate this
possibility. The point is that the only false inferences that R will
attempt to block are those whose falsity can be checked in RBc.
137
There may be other false inferences that Q may draw, whose
falsity cannot be deterntined solely with respect to RBc (although
it might be possible with respect to R's entire knowledge base).
While intuitively this may not seen enough of a constraint on the
amount of anticipatory reasoning that Joshi's revised maxim
imposes on R, it does constrain things a lot by only considering a
(relatively small) subset of knowledge base. Factors such as
context may further delimit S's responses, but they will all be
relative to RBc.
V
Conclusion
There are many gaps in the current work and several aspects not
discussed here. In particular,
1. We are developing a formMism for accommodating the
system's reasoning based on a type of HOLDS
predicate whose two arguments are a proposition and a
state; see [6].
2. We are working on more examples, especially more
problematic cases in which, for example, a direct
answer to Q's query would be myes m [or the requested
procedure} BUT a response to Q's higher goals would
be "no t or "no" plus a warning
- e.g.,
Q: Can I buy a 50K savings bond?
S: Yes, but you could get the same security
on other investments with higher returns.
3. We need to be more precise in specifying RBc, if we are
to assume that all the information needed to account
for R's cooperative behevior is contained there. This
may in turn reflect on how the user's knowledge base
must be structured.
4. We need to be more precise in specifying how default
rules play a role in causing R to modify his direct
response, in recognition of Q's likelihood of drawing
what seems like a generalized "script" default - if there
is no reason to assume that there is anything special
about the current case, don't.
REFERENCES
[I] Allen, J.
Recognizing Intentions from Natural Language Utterances.
In M. Brady (editor),
Computational Models of Discourse,
M1T Press, Cambridge MA, 1982.
[2] Carberry, S.
Tracking User Goals in an Information-Seeking
Environment.
In
Proceedings of the National Conference on Artificial
Intelligence,
pages 59-63. AAAI, 1983.
[31 Groat, B.
The Representation and Use of Focus in Dialogue
Understanding.
Technical Report 151, SRI International, Menlo Park CA,
1977.
14] Grosz, B., Joshi, A.K. & Weinstein, S.
Providing a Unified Account of Definite Noun Phrases in
Discourse.
In
Proc. °Mst Annual Medin9,
pages 44-50. Assoc. for
Computational Ling., Cambridge MA, June, 1983.
15} Joshi, A.K.
Mutual Beliefs in Question Answering Systems.
In N. Smith leditor),
Mutual Belief, .
Academic Press,
New York, 1982.
[6] Joshi, A., Webber, B. & Wei~hedel, R.
Living Up to Expectations: Computing Expert Responses.
In
Proceedings of AAAI-8~.
Austin TX, August, 1984•
07] Kaplan, J.
Cooperative Responses from a Portable Natural Language
Database Query System•
In M. Brady (editor),
Computational Models o] Discourse,
MIT Press, Cambridge MA, 1982.
Isl
Mays,
E.
Failures in natural language systems: application to data
ba~e query systems.
In Proc.
First National Conference on Artificial
Intelligence (AAAI].
Stanford CA, August, 1980.
191
McCoy, K.
Correcting Miseunceptions: What to S~y.
In
CH1'83 Conference Human Fhctors in Computing
Systems.
Cambridge MA, December, 1983.
[101 Nlercer,
R.
& Rosenberg, R.
Generating Corrective Answers by Computing
Presuppositions of Answers, not of Questions.
In
Proceedings of the 1984 Con fere, ce,
pages 16-19.
Canadian Society for Computational Studies of
Intelligence, University of Western Ontario, London,
Ontario, May. 1984.
[111 Pollack, M., Hirschberg, J. and Webber. B.
User Participation in the Reasoning Processes of Expert
Systems.
In
Proc. AAA[-8e.
CMU, Pittsburgh PA, August, 1982•
A longer version appears as Technical Report CIS-8~9,
Dept. of Computer and Information Science, University
of Pennsylvania, July 1982.
112] Pollack, Martha E.
Goal Inference in Expert S~lstesm.
Technical Report MS-CIS-84-07, University of
Pennsylvania, 1984.
Doctoral dissertaion proposal.
113] Pollack, M.
Good Answers to Bad Questions.
In
taeoc. Canadian Sodettt for Computational Studies of
Intelligence (CSCSI],
Univ. of Western Ontario,
Waterloo, Canada, May, 1984.
[141 Prince, E.
Topicalization, Focus Movement and Yiddish Movement: A
pragmatic differentiation.
In D. Alford et al. (editor),
Proceedin#s of the 7th Annual
Altering,
pages 249-64. Berkeley Linguistics Society,
February, 1981.
(15] Reiter, R.
A Logic for Default Reasoning.
Artificial lnteUigence
13:81-132, 1980.
[16] Sidner, C,. L.
Focusing in the Comprehension of Definite Anaphora.
In M. Brady (editor),
Computational Models of Discourse,
MIT Press, Cambrid~.e MA, 1982.
117] Ward, G.
A Pragmatic Analysis of E~.,itomization: Topicalization it's
tint.
In
Procredin9.~ of the Summer Aft.sting 198£.
LSA, College
Park MD, Augu.~t, 1982.
Also in Papers in Linguisti,:s 17.
(181 Webber, B. & Mays, E.
Varieties of User Misconceptions: Detection and Correction.
In
Proc. IJCAI-8.
Karlsruhe, Germany, August, 1983.
138
... (QA) 2 also has topical connections to implicit reasoning in QA (de Marneffe et al., 2009;Louis et al., 2020;Geva et al., 2021), fact verification in QA (Chen et al., 2021;Park et al., 2022), and evaluation work in other domains of NLP for non-atissue meaning (Clausen and Manning, 2009;Tremper and Frank, 2011;Cianflone et al., 2018;Kim et al., 2019;Jeretic et al., 2020;Parrish et al., 2021;Jiang and de Marneffe, 2021, among many others). The literature on cooperative response generation is also more broadly relevant (Kaplan, 1982;Wahlster et al., 1983;Joshi et al., 1984;Gaasterland et al., 1992). ...
... When I went to Penn in 1978, Aravind Joshi was also carrying out and supervising work on question answering, among all the many other things that he was interested in, such as code switching and formal languages and the complexity of Natural Language grammar. Besides your own work, Aravind's supervision here included Jerry Kaplan's work on correcting factual misconceptions underlying a question (Kaplan 1982;Joshi, Webber, and Weischedel 1984;Webber 1986), and Eric Mays' work on possible versus impossible changes in dynamic databases (Mays, Joshi, and Webber 1982;Mays 1984;Webber 1986). In both cases, it was assumed that directly answering a question that had some underlying misconception would mislead the questioner and prolong the misconception. ...
Article
Full-text available
Because the 2020 ACL Lifetime Achievement Award presentation could not be done in person, we replaced the usual LTA talk with an interview between Professor Kathy McKeown (Columbia University) and the recipient, Bonnie Webber. The following is an editted version of the interview, with added citations.
... We believe that the framework is also applicable to other dialogue modalities, and to human-human task-oriented dialogues. In addition, while there are many proposals in the literature for algorithms for dialogue strategies that are cooperative, collaborative or helpful to the user (Webber and Joshi, 1982;Pollack, Hirschberg, and Webber, 1982;Joshi, Webber, and Weischedel, 1984;Chu-Carrol and Carberry, 1995), very few of these strategies have been evaluated as to whether they improve any measurable aspect of a dialogue interaction. As we have demonstrated here, any dialogue strategy can be evaluated, so it should be possible to show that a cooperative response, or other cooperative strategy, actually improves task performance by reducing costs or increasing task success. ...
... On the other hand, if communicating a decision implies communicating it properly, i.e. effectively, it is more difficult to draw the line clearly, and while preempting false inferences (Joshi et al 1984), or overanswering (Morik 1989), may appear to have to do with effectiveness, where choices of speech act (Morik 1989) or of lexical item may seem to be matters of acceptability, these cannot be general rules, so each instance would have to be categorised independently. It could on the other hand be claimed that if preempting or overanswering are motivated by Gricean principle, as in Joshi et al and Morik, this follows from the generic system task specification and not from the specific decision, and so has more to do with effectiveness in a global sense than in the sense relating to particular decision making in which I have defined it. ...
Article
This paper examines the implications, for linguistic output generation tailored to the interactive system user, of earlier analyses of the components of user modelling and of the constraints realism imposes on modelling. Using a range of detailed examples it argues that tailoring based only on the actual dialogue and on the decision model required for the system task is quite adequate, and that more ambitious modelling is both dangerous and unnecessary.
Article
Dialogue management technology has developed rapidly over the years resulting in real-time applications like telephony directories, timetable enquiries, and in-car applications. However, the current technology is still largely based on models that use rigid command language type interactions, and the users need to adapt their human communication strategies to the needs of the technology. As an increasing number of interactive ubiquitous applications will appear, challenges for interaction technology concern especially natural, more human-friendly communication. Recent research has focused on developing speech-based interactive systems that aim to increase the system's communicative competence. By including aspects of interaction beyond simple speech recognition and question-answer based interaction, applications with more conversational interfaces have become possible. New dialogue management technology needs to address the challenges in human-technology interaction, so that smart environments should not only enable user-controlled command interfaces but equip applications with a capability that affords easy and friendly interactions with the user. Dialogue Modelling: Speech Interaction and Rational Agents provides an overview of the current dialogue technology and research trends in spoken dialogue systems, presenting a coherent perspective of AI-based cooperative interaction management. The book complements existing research regarding human-computer interfaces, speech and language technology, and communication studies in general, bringing different view-points together and integrating them into a single point of reference. Constructive Dialogue Modelling: Presents a guide to spoken dialogue technology and current research trends. Provides an overview of human factors in dialogue systems and delivers a new metaphor for human-computer interaction and computer as agent. Explains the architecture of dialogue systems using examples from systems such as Interact and DUMAS Offers a comprehensive overview of original research into the new trends in speech dialogue technology in light of innovations such as ubiquitous computing. This book will provide essential reading for industrial designers and interface engineers, university researchers and teachers, computer scientists, human communication researchers, speech and language technologists, cognitive engineers/cognitive scientists, as well as social and media researchers, and psychologists. Advanced students and researchers in computer science, speech and language technologies, psychology and communication research will find this text of interest.
Article
This paper explores the application of a rich model of pragmatic context to the problem of identifying and correcting real-word spelling errors. Results suggest that such a model can be useful for generating and ranking a list of possible corrections according to their contextual relevance. In the domain of expert consultation discourse, a mode! of pragmatic context must represent not only the user's domain plans, but also the problem-solving processes that explore alternative plans, refining and instantiating the intended plan, and the connections between those problem-solving moves and their resulting discourse manifestations. In the model presented, metaplans are used to represent these problem-solving and discourse levels, while heuristics that take into account the user's problem-solving strategies and world knowledge serve to rank the relative likelihood of different possible next queries. An implementation of this model has been used to suggest pragmatically coherent interpretations that can be matched against a partial parse of the input in order to generate possible corrections for real-word spelling errors.
Chapter
As we stated at the beginning of our paper, systems using natural language as a means of human/machine communication exhibit varying degrees of complexity; closely related to it there is the degree of complexity of the NLP module. In any case, however, NLP is both a practically necessary and a theoretically stimulative task. A command of language is an inherent feature of human beings; if an AI system is supposed to model human intelligence, it cannot dispense with a module of natural language understanding and production. A considerable knowledge of language structure and functioning has been gathered throughout the centuries of linguistic research; it would mean reinventing the wheel if this knowledge were ignored.
Chapter
Spoken language interactive systems range from speech-enabled command interfaces to dialogue systems which conduct spoken conversations with the user. In the first case, spoken language is used as an alternative input and output modality, so that the commands, which the user could type or select from the menu, may also be uttered. The system responses can also be given as spoken utterances, instead of written language or drawings on the screen, so the whole interaction can be conducted in speech. Spoken dialogue systems, however, are built on models concerning spoken conversations between participants so as to allow flexible interaction capabilities. Although interactions are limited concerning topics, turn-taking principles and conversational strategies, the systems aim at human–computer interaction that would support natural interaction which enables the user to interact with the system in an intuitive manner. Moreover, trying to combine insights of the processes that underlie typical human interactions, spoken dialogue modelling also seeks to advance our knowledge and understanding of the principles that govern communicative situations in general.
Article
This paper shows how agents' choice in communicative action can be designed to mitigate the effect of their resource limits in the context of particular features of a collaborative planning task. I first motivate a number of hypotheses about effective language behavior based on a statistical analysis of a corpus of natural collaborative planning dialogues. These hypotheses are then tested in a dialogue testbed whose design is motivated by the corpus analysis. Experiments in the testbed examine the interaction between (1) agents' resource limits in attentional capacity and inferential capacity; (2) agents' choice in communication; and (3) features of communicative tasks that affect task difficulty such as inferential complexity, degree of belief coordination required, and tolerance for errors. The results show that good algorithms for communication must be defined relative to the agents' resource limits and the features of the task. Algorithms that are inefficient for inferentially simple, low coordination or fault tolerant tasks are effective when tasks require coordination or complex inferences, or are fault intolerant. The results provide an explanation for the occurrence of utterances in human dialogues that, prima facie, appear inefficient, and provide the basis for the design of effective algorithms for communicative choice for resource limited agents.
Article
This essay presents a philosophical and computational theory of the representation of de re, de dicto, nested, and quasi-indexical belief reports expressed in natural language. The propositional Semantic Network Processing System (SNePS) is used for representing and reasoning about these reports. In particular, quasi-indicators (indexical expressions occurring in intentional contexts and representing uses of indicators by another speaker) pose problems far natural-language representation and reasoning systems, because—unlike pure indicators—they cannot be replaced by coreferential NPs without changing the meaning of the embedding sentence. Therefore, the referent of the quasi-indicator must be represented in such a way that no invalid coreferential claims are entailed. The importance of quasi-indicators is discussed, and it is shown that all four of the above categories of belief reports can be handled by a single representational technique using belief spaces containing intensional entities. Inference rules and belief-revision techniques for the system ore also examined.
Article
Proceedings of the Seventh Annual Meeting of the Berkeley Linguistics Society (1981), pp. 249-264
Article
A type of OSV word order, Epitomization, is identified and analyzed. This type, previously unaccounted for, is distinguished on the basis of its discourse function. The left‐most (preposed) NP is a proper noun whose referent represents the epitome (superlative value) of some scalar attribute, while the referent of the subject NP is salient in the context. Use of Epitomization conveys a belief on the part of the speaker that the referent of the subject NP is negatively evaluated with respect to the referent of the left‐most NP on a scale representing the attribute in question.
Chapter
An abstract is not available.
Article
This paper describes the design and implementation of CO-OP, a natural language database query system that provides conversationally cooperative responses to natural language requests for data retrieval and operates with a typical CODASYL database management system. A conversationally cooperative response to a question is one which addresses the goals and intentions of the questioner while observing the conventions normally associated with discourse, to demonstrate that the chosen approach captures some fundamental properties of conversational cooperation, CO-OP was designed to be portable to new database domains with a minimum of effort—the lexicon and database itself are the only sources of domain specific knowledge. A limited theory of cooperative behavior in question-answering is informally presented, and a practical technique for applying this theory in a natural language query system is described. Sample transcripts of the system that illustrate cooperative responses in two contrasting applications are given.