MINDMAKERS Forum
Welcome, Guest. Please login or register.
September 08, 2010, 01:57:14 PM

Login with username, password and session length
Search:     Advanced search
NB: Spam bots are becoming smarter every day - we had to turn off regular registration. To become member, please send email to Kris Thorisson ([kris'_lastname] att ru dott is).
337 Posts in 99 Topics by 99 Members
Latest Member: peterwit
* Home Help Search Login Register
+  MINDMAKERS Forum
|-+  Projects
| |-+  SAIBA Multimodal Behavior Generation Framework
| | |-+  Digest of previous email discussions (part 2 of 7)
« previous next »
Pages: [1] Print
Author Topic: Digest of previous email discussions (part 2 of 7)  (Read 1247 times)
hannes
Administrator
Newbie
*****
Posts: 43


View Profile Email
« on: August 21, 2006, 05:22:21 PM »

============================================================================
Date: Tue, 21 Mar 2006 09:22:15 +0100
From: Stefan Kopp <skopp_(at)_techfak.uni-bielefeld.de>
To: Hannes Vilhjalmsson <hannes_(at)_ISI.EDU>
Subject: Re: IVA  paper: new version
...
> - There is therefore no need for a special FML/BML sub-section in the
> Existing Representation section of the paper.  But I can add a
> sub-section describing representations in BEAT and Spark.

Yes, I realized that when I scanned the page, too. Looks cool.  I also
think we should stick to the names FML and BML.
...
> (V) What other attributes describe behaviors, defining certain common
> terms like "amount", "angle", "target", etc. so they can be used
> consistently. (the point here is that researchers may be interested in
> a different number of attributes that describe the behavior with
> different levels of detail - we need not agree on the number of
> attributes!  *but* we need to agree on the definition of some key
> attributes so that when we choose to process an attribute, we know
> what to do with it)

Yes!
...
best,
Stefan
============================================================================
From: Hannes Vilhjalmsson <hannes_(at)_ISI.EDU>
Date: 28. mars 2006 02:12:05 GMT+02:00
To: "Kristinn R. Thorisson" <thorisson_(at)_ru.is>
Subject: Re: comments and edits in paper
...
My responses to Kris's comments in the document:

- [KRTh2] I like the expanded Ymir/YTTM context and background.   
We'll need to re-word the introduction to section 2 to explain 
we're not just presenting alternative XML languages, but also 
approaches to generation (does that mean we need to include more 
background/references to communicative behavior generation in 
general?)

- [KRTh3] Real-time considerations are important, there are still 
some outstanding issues here of course.

- [KRTh4] I absolutely agree, there was some discussion over email 
about this, and I think Stefan also agrees that we should leave the 
Gesticon ouside of the core framework (but maybe suggest it as a 
possible interface).

- [KRTh5] I like the feedback idea.  I had not included the 
feedback messages in the BML specification, but we are in fact 
getting "ENDOFBEHAVIOR ID=x" messages back from the Unreal 
Environment.

- [KRTh6] Regardless of actual terms, we need the following classes 
of participants:
 (1) The one producing the content
 (2) The one or more people the content is directed at (sub-set of (3))
 (3) The one or more people that are ratified receivers of the content
 (4) The zero or more people that are un-ratified receivers of the 
content
Based on Kris's suggestion and on discussions we've had over here 
in the past, I propose the following terms:
 (1) Producer (was "speaker")
 (2) Addressee
 (3) Receiver (was "listener" or "hearer")
 (4) Overhearer
Although I understand the more precise term "content producer", I 
much prefer a single word for the API term, there are just too many 
different ways to represent two words ("ContentProducer", 
"content_producer", "content producer","contentProducer", etc..) 
and it gets a bit too long in a compact message.

- [KRTh8] Yes, "turn" is just a super-category and the various 
things that one can do with a turn ("give", "take", "request", 
"keep") are sub-types that can be described as an attribute.  The 
argument for a super-category like this is that we may not all 
agree on how many sub-types there are - this way we can keep the 
number of attributes (and thus the types of turn management types) 
open.  The "real-time" vs. "pre-planned" use of this message is 
something we can discuss more - the  text in the definition is 
geared towards the "pre-planned" use, but maybe we can make it more 
general.

- [KRTh9] "Performative" here really describes "Content", often in 
the form of a "Speech Act" type and a "Proposition" (i.e. "INFORM" 
and "(is sky blue)").  As for whether it also describes a 
multimodal "Communicative Act", it depends on whether you define 
"Communicative Act" as simply a multimodal version of Speech Act or 
whether it's any communication action, in which case *any* FML 
could be considered a "Communicative Act" (including interaction 
management).

- [KRTh10] Let's get Stacy's and Catherine's take on this, I don't 
use it myself (but I think they do).  The definition includes my 
interpretation of it, they may have a better one.

- [KRTh11] Yes, I like "Performance", though we can't confuse it 
with "Performative" - let's ask what others think.

Cheers,
 -= hannes
============================================================================
Date: Wed, 29 Mar 2006 00:45:57 +0200
From: Stefan Kopp <skopp_(at)_techfak.uni-bielefeld.de>
To: "K. R. Thorisson" <thorisson_(at)_ru.is>, Hannes Vilhjalmsson <hannes_(at)_ISI.EDU>,
Subject: representations paper
...I think we need some discussion about the role and necessity of
the "Gesticon". It is not even clear in the paper what it really is
(it's explained as a behavior lexicon, but also referred to as a
language for describing behaviors in a player-independent way, which is
actually BML supposed to be).  From the comments of Kris and Hannes, I
took that they have some concerns there too.  I also think there is no
agreement yet about how it is currently plugged into the architecture in
figure 1.

Catherine and Brigitte, my suggestions are:
- to state that a behavior lexicon, which may be called "Gesticon", can
of course be part of the architecture, but it should be drawn upon by
the behavior realization component. That is, in Fig. 1 it should be
hooked to the BR box and not the connection between BP and BR. It would
still enable the exchange of behavior definitions as you wanted it.
- to fuse sections 5 and 6. Both address the same problem, namely to
specify overt multimodal behavior, and they are partly redundant. Hannes
and I have made some suggestions in the last long emails as to what can
be part of BML, such that the Gesticon elements would be covered too.  I
think there is a lot of good stuff in Sect. 6 which I would like to see
being described as BML elements.
- to leave out the detailed gesture specification. Besides the fact that
we are probably going to have not enough space in the paper, what is
described there is basically yet another approach in addition to MURML
or other languages, and we didn't really have a discussion about how a
unified gesture spec language could look like (besides sparse email
exchange). Let's have such a discussion and leave it open as a future
topic. If you want, we can mention that there are some markup-languages
out there for describing gesture form, though.

What do think??
...

best,
Stefan
============================================================================
From: Stacy Marsella <marsella_(at)_ISI.EDU>
Subject: Re: representations paper
Date: Wed, 29 Mar 2006 07:38:16 -0800
To: skopp_(at)_techfak.uni-bielefeld.de
...
The lead in sentence threw me a tad. "Human communicative behavior" 
suggests a large scope. Should we limit it? Any perceived action is a 
communication (e.g., Cristiano's BICs). An impatient driver can start 
the car to communicate to a spouse to hurry up so they can leave.. 
Writing a letter or putting up a billboard are also communicative 
behaviors. Perhaps something like face-to-face interaction is more 
the focus here?

The SAIBA diagram and the Gesticon. I agree with whomever made the 
comment that the link from Gesticon to the link between Beh. Planning 
and Beh. Realization seems strange. In our system (SmartBody) we use 
something akin to a Gesticon (we call it a Gestuary) but it is linked 
to both the planning and generation boxes. Also  it is currently a 
repository of BML commands and BML macros, not a distinct 
representation.

The FML section is pretty weak obviously. It reads like what it is - 
we all put our favorite bits (psycho-socio-linguistic theories) in.
The various categories come off as a weird mix. Are we not simply 
saying that an utterance can have a range of socio-emotive goals, 
there are a range of theories that characterize these goals and FML 
is currently being very open in what it includes?

Coping is one way to characterize the socio-affective goal and 
strategy of the utterance - I could clarify that in the text. In   
essence it details (or at least is one way to detail) the 
illocutionary force of the utterance in both personal and social ways.

I don't know what is meant by certainty? Is certainty an annotation 
describing the speaker's confidence in the propositional content, the 
intent to convey that confidence, the imposition on negative face, is 
it the strength of what Searle calls the sincerity conditions, or 
something else?

The Cognitive annottation is also not clear. The text states it is 
the meta-cognitive activities but the example  (problems with recall) 
is strictly speaking not meta-cognitive though of course one could 
have a meta-cognitive strategy for facilitating recall problems - by 
meta-cognitive do people mean performance issues associated with 
cognitive function (cognitive load, recall problems, etc.)
============================================================================
From: "Kristinn R. Thorisson" <thorisson_(at)_ru.is>
Subject: Re: representations paper
Date: Wed, 29 Mar 2006 17:39:00 +0000
To: Stefan Kopp <skopp_(at)_techfak.uni-bielefeld.de>,

On 29.3.2006, at 15:38, Stacy Marsella wrote:
> The lead in sentence threw me a tad. "Human communicative behavior" 
> suggests a large scope. Should we limit it? Any perceived action is 
> a communication (e.g., Cristiano's BICs). An impatient driver can 
> start the car to communicate to a spouse to hurry up so they can 
> leave.. Writing a letter or putting up a billboard are also 
> communicative behaviors. Perhaps something like face-to-face 
> interaction is more the focus here?

I think we are hoping to be more general than simply face-to-face, so 
that the same mechanisms, if not implementations, could be used to 
communicate via starting the car. Of course nobody expects us to 
cover the whole range of human communication from the get-go, but at 
least we are clear about the goals if we say human communicative 
behavior.  In other words, there is no theoretical reason to 
restrict ourselves to f-t-f, but perhaps we could say that we are 
starting a bit more modestly...

> The SAIBA diagram and the Gesticon. I agree with whomever made the 
> comment that the link from Gesticon to the link between Beh. 
> Planning and Beh. Realization seems strange. In our system 
> (SmartBody) we use something akin to a Gesticon (we call it a 
> Gestuary) but it is linked to both the planning and generation 
> boxes. Also  it is currently a repository of BML commands and BML 
> macros, not a distinct representation.

That is a problem. This whole issue of the gesticon needs to be 
cleared up. Whenever we find such problems we can either (a) solve 
them before the paper goes out or (b) leave them out of the paper. I 
suggest Stefan takes the lead on how to handle these things.

> The FML section is pretty weak obviously. It reads like what it is 
> - we all put our favorite bits (psycho-socio-linguistic theories) in.
> The various categories  come off as a weird mix. Are we not simply 
> saying that an utterance can have a range of socio-emotive goals, 
> there are a range of theories that characterize these goals and FML 
> is currently being very open  in what it includes?

I think it's a good idea to pull back a bit on detail in the places 
where there are large divergences... so sure, we can say that it can 
have "a range of socio-emotive goals".

> I don't know what is meant by certainty? Is certainty an annotation 
> describing the speaker's confidence in the propositional content, 
> the intent to convey that confidence, the imposition on negative 
> face, is it the strength of what Searle calls the sincerity 
> conditions, or something else  ?

The way I understood it: assertiveness.

> The Cognitive annottation is also not clear. The text states it is 
> the meta-cognitive activities but the example  (problems with 
> recall) is strictly speaking not meta-cognitive though of course 
> one could have a meta-cognitive strategy for facilitating recall 
> problems - by meta-cognitive do people mean performance issues 
> associated with cognitive function (cognitive load, recall 
> problems, etc.)

Good point. I'm not sure what was meant there.

=K
============================================================================
Date: Thu, 30 Mar 2006 00:52:07 +0200
From: Catherine Pelachaud <c.pelachaud_(at)_iut.univ-paris8.fr>
To: "Kristinn R. Thorisson" <thorisson_(at)_ru.is>
Subject: Re: representations paper

A quick reaction to these last exchanges of emails.
>> The lead in sentence threw me a tad. "Human communicative behavior" 
>> suggests a large scope. Should we limit it? Any perceived action is 
>> a communication (e.g., Cristiano's BICs). An impatient driver can 
>> start the car to communicate to a spouse to hurry up so they can 
>> leave.. Writing a letter or putting up a billboard are also 
>> communicative behaviors. Perhaps something like face-to-face 
>> interaction is more the focus here?
>
> I think we are hoping to be more general than simply face-to-face, so 
> that the same mechanisms, if not implementations, could be used to 
> communicate via starting the car. Of course nobody expects us to 
> cover the whole range of human communication from the get-go, but at 
> least we are clear about the goals if we say human communicative 
> behavior.  In other words, there is no theoretical reason to 
> restrictu ourselves to f-t-f, but perhaps we could say that we are 
> starting a bit more modestly...

I agree with you. Several of us will be at ZIF next week. It has been
discussed at length at a first workshop at ZIF what communicative
gesture is? when does a gesture can be considered communicative? there
are much more gestures that are communicative than the gesture we
normally used for our ECA! so we do need to state that!

>> The SAIBA diagram and the Gesticon. I agree with whomever made the 
>> comment that the link from Gesticon to the link between Beh. 
>> Planning and Beh. Realization seems strange. In our system 
>> (SmartBody) we use something akin to a Gesticon (we call it a 
>> Gestuary) but it is linked to both the planning and generation 
>> boxes. Also  it is currently a repository of BML commands and BML 
>> macros, not a distinct representation.

if necessary, an arrow can be added from planning to gesticon.

> That is a problem. This whole issue of the gesticon needs to be 
> cleared up. Whenever we find such problems we can either (a) solve 
> them before the paper goes out or (b) leave them out of the paper. I 
> suggest Stefan takes the lead on how to handle these things.

It really depends what we aim to achieve. Gesticon (or Gestuary) is
important per se as it defines behaviors independently of graphics model
(or at least independently of most graphics models (eg, it may not be
true for models done in flash animation)). If an aim of the languages we
are working on is to allow modularity, exchange of modules, and the
like; independence from graphics model seems a very important premise.
If so Gesticon has its place.
Using a fix set of token behaviors (smile, head nod,...) does not allow
one to get a large variety of subtle expressions. Personally I am not
keen in this type of representation.

>> I don't know what is meant by certainty? Is certainty an annotation 
>> describing the speaker's confidence in the propositional content, 
>> the intent to convey that confidence, the imposition on negative 
>> face, is it the strength of what Searle calls the sincerity 
>> conditions, or something else  ?
>
> The way I understood it: assertiveness.
>
It looks like several tags of FML are very similar to APML
(meta-cognitive, certainty, contrast...). If it is so, we could use
their definition from APML, that is from Isabella's theory.

best,
Catherine
« Last Edit: August 21, 2006, 05:54:05 PM by hannes » Logged
Pages: [1] Print 
« previous next »
Jump to:  

Powered by MySQL Powered by PHP Powered by SMF 1.1.4 | SMF © 2006-2007, Simple Machines LLC Valid XHTML 1.0! Valid CSS!