August 13, 2007

[PRACTICE TALK] Swapna's Sigdial paper practice talk

Detecting Arguing and Sentiment in Meetings

This paper analyzes opinion categories like Sentiment and Arguing in meetings.
We first annotate the categories manually. We then develop genre-specific lexicons using interesting function word combinations for detecting the opinions. We analyze relations between dialog structure information and opinion expression in context of multi-party discourse. Finally we show that classifiers using lexical and discourse knowledge have significant improvement over baseline.

Posted by nlplab at 08:18 AM

June 18, 2007

[Practice Talk] Convergence and Learning

In this paper we examine whether the student-to-tutor convergence of lexical and speech features is a useful predictor of learning in a corpus of spoken tutorial dialogs. This possibility is raised by the Interactive Alignment Theory, which suggests a connection between convergence of speech features and the amount of semantic alignment between partners in a dialog. A number of studies have shown that users converge their speech productions toward dialog systems. If, as we hypothesize, semantic alignment between a student and a tutor (or tutoring system) is associated with learning, then this convergence may be correlated with learning gains. We present evidence that both lexical convergence and convergence of an acoustic/prosodic feature are useful features for predicting learning in our corpora. We also find that our measure of lexical convergence provides a stronger correlation with learning in a human/computer corpus than did a previous measure of lexical cohesion.
Posted by nlplab at 08:39 AM

June 11, 2007

[PRACTICE TALK] Josh, Rebecca on MT Evaluations

Abstracts etc. TBA
Posted by hwa at 02:00 PM

May 31, 2007

[TALK] Adam Lopez (UMD)

Hierarchical Phrase-Based Translation with Suffix Arrays

A major engineering challenge in statistical machine translation systems is the efficient representation of extremely large translation rulesets. In phrase-based models, this problem can be addressed by storing the training data in memory and using a suffix array as an efficient index to quickly lookup and extract rules on the fly. Hierarchical phrase-based translation introduces the added wrinkle of source phrases with gaps. Lookup algorithms used for contiguous phrases no longer apply and the best approximate pattern matching algorithms are much too slow, taking several minutes per sentence. I describe new lookup algorithms for hierarchical phrase- based translation that reduce the empirical computation time by nearly two orders of magnitude, making on-the-fly lookup feasible for source phrases with gaps. I will also discuss some novel applications of these algorithms.

Speaker Bio

Adam Lopez is a Ph.D. candidate in computer science at the University of Maryland, expecting to graduate in August 2007. His dissertation work focuses on statistical machine translation and his interests are in large-scale natural language processing and algorithms. Prior to graduate school, he worked as a software engineer at the IBM Corporation, after receiving his bachelor's degree in computer science from Duke University.

Posted by hwa at 02:00 PM

May 21, 2007

[TALK] Dialogue Research in Toyota Central Labs

Presenter: Ryoko TOKUHISA ( Toyota Central R&D Labs )

NOTE - THIS TALK WILL BE AT 12 NOON!

I introduce the overview of the researches in Toyota Central Labs. We are developing the dialogue system for the car navigation system and the home robot. I mainly work on the affective dialogue of the home robot, so that it would be closely connected with the Emotion Detection in Tutoring task and Opinion type analysis.

Posted by nlplab at 12:00 PM

March 20, 2007

Ph.D. proposal defense - Mihai Rotaru

CANDIDATE: Mihai Rotaru
TITLE: Applications of Discourse Structure for Spoken Dialogue Systems
WHEN: Tuesday, March 20, 1 pm
WHERE: 5317 Sennott Hall (5th floor conference room)

COMMITTEE MEMBERS:
Diane J. Litman (advisor)
Rebecca Hwa
Carolyn P. Rosé
Janyce M. Wiebe

ABSTRACT:
Just as words in a utterance are organized in a structure (e.g. syntactic, semantic), utterances in a discourse (monologue or dialogue) are organized in structure called the discourse structure. Our proposed work investigates the utility of discourse structure for spoken dialogue systems (computer systems that interact with users via speech).

Two types of applications are being pursued: on the system side and on the user side. On the system side, we investigate if the discourse structure information is useful for various spoken dialogue system tasks: performance analysis, characterization of user affect and characterization of speech recognition problems. On the user side, we investigate whether the discourse structure information is useful for users through a graphical representation of the discourse structure.
Posted by nlplab at 01:00 PM

March 13, 2007

QA with Attitude: Exploiting Opinion Type Analysis for Improving Question Answering

Speaker: Swapna Somasundaran

Room : Board room ( 6th floor - room 6329) Sennot Square

Time : 9:00 am

Practice talk for ICWSM-07.

Abstract
In this work, we explore the utility of attitude types for improving question answering (QA) on both web-based discussions and news data. We present a set of attitude types developed with an eye toward QA and show that they can be reliably annotated. Using the attitude annotations, we develop automatic classifiers for recognizing two main types of attitudes: sentiment and arguing. Finally, we exploit information about the attitude types of questions and answers for improving opinion QA with promising results.

Posted by nlplab at 09:37 AM

December 08, 2006

[TALK] Ray Mooney December 8

Learning to Extract Proteins and their Interactions from Biomedical Text (2:00, Room 5313 Sensq) - NOTE UNUSUAL TIME AND ROOM Learning to Extract Proteins and their Interactions from Biomedical Text Raymond J. Mooney University of Texas at Austin Automatically extracting information from biomedical text holds the promise of easily consolidating large amounts of biological knowledge in computer-accessible form. This strategy is particularly attractive for extracting data on human genes from the 11 million abstracts in Medline. We have developed and evaluated a variety of learned information-extraction systems for identifying human proteins and their interactions in Medline abstracts. We will present our current best results on identifying names of human proteins using Conditional Random Fields and Relational Markov Networks. We will also present our current best results on identifying interactions between proteins using a Support Vector Machine with an underlying string kernel. Finally, we will summarize results from a recent large-scale application of our techniques, in which we mined 753,459 Medline abstracts to extract a database of 6,580 interactions between 3,737 human proteins. By merging this extracted data with existing databases, we have constructed (to our knowledge) the largest database of known human-protein interactions containing 31,609 interactions amongst 7,748 proteins. Bio: Raymond J. Mooney is a Professor in the Department of Computer Sciences at the University of Texas at Austin. He received his Ph.D. in 1988 from the University of Illinois at Urbana/Champaign. He is an author of over 100 published research papers, primarily in the area of machine learning. He was program co-chair of the 2006 National Conference on Artificial Intelligence, general chair of the 2005 joint Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, co-chair of the 1990 International Conference on Machine Learning, a recipient of the Best Research Paper Award at the 2004 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, a former editor of the Machine Learning journal, and a Fellow of the American Association for Artificial Intelligence. His recent research has focused on learning for natural-language processing, text mining, statistical relational learning, transfer learning, active learning, semi-supervised learning, bioinformatics, and autonomic computing.
Posted by nlplab at 04:56 PM

November 13, 2006

How much data is enough? (Experiments with Confidence Bounds for MDP's)

Speaker: Joel Tetreault Data sparsity is one of the major issues that NLP researchers always wrestle with. That is, does one have enough data to make reliable conclusions in an experiment? Using Reinforcement Learning to improve a spoken dialogue system is no exception. Past approaches in this area have simply assumed that there was enough collected data to support a certain state and action space, or used thousands of user simulations to overcome the sparsity issue. In this talk, we present a methodology of confidence bounds on the expected reward to address the problem of data sparsity in MDP's. We show how this methodology works by apply it to a prior experiment of using MDP's to predict the best features to include in a model of the dialogue state. We also show how this approach has applications in model switching and user simulations.
Posted by nlplab at 10:24 AM

September 22, 2006

[TALK] Learning to Show You're Listening: A Trainer for Back-Channeling in Arabic

Nigel G. Ward, Yaffa Al Bayyari, Rafael Escalante, Thamar Solorio

University of Texas at El Paso

12 noon, 5317 Sennott Square (ISP Forum)

Abstract: Good listeners generally produce back-channel feedback, and do so in a
language-appropriate way. Second language learners often lack this
skill. We present a training sequence which enables learners to
acquire a basic Arabic back-channel skill, namely, that of producing
feedback immediately after the speaker produces a sharp pitch
downslope. This training sequence includes an explanation, audio
examples, the use of visual signals to highlight occurrences of the
pitch downslope, auditory and visual feedback on learners' attempts to
produce the cue themselves, and feedback on the learners' performance
as they play the role of an attentive listener in response to one side
of a pre-recorded dialog. Preliminary experiments suggest that this
allows some learners to acquire this behavior.

The talk will also touch on the role of back-channels in various types
of dialog, methods for the discovery and quantification of
dialog-relevant prosodic cues, potential cross-cultural
misunderstandings of prosodic signals, the interplay between
meta-communication and the communication of content, and ways to
quantify the value of good turn-taking relative to other dialog skills.

Posted by nlplab at 12:20 PM

September 15, 2006

[TALK] Cognitive Load and Spoken Interface Design: Comparing Natural and Standardized Approaches to the Generation of Referring Expressions

Speaker: Ellan Campana, Arizona State / University of Rochester 12 noon, 5317 Sennott Square (ISP Forum) Human language capabilities are both context-dependant and flexible. On the one hand psycholinguistic evidence suggests that listeners naturally and rapidly integrate elements of the visual, discourse-level, and social context with incoming speech, using these elements to improve the speed at which they identify the intended referents of referring expressions. On the other hand research in human-human interaction has also shown that listeners are flexible in their use of language in that they are able to adapt to speaker-dependant patterns, and that they are capable of establishing and using new referring expressions and sub-languages for specific domains. In the spoken language interface literature, these two sets of findings have been used to support two different approaches to interface design, which I call the "natural" approach and the "standardized" approach. The natural approach argues that in order to be easy to use, such interfaces should approximate human-human interaction as closely as possible, including context-dependant generation and understanding of referring expressions. The standardized approach argues that instead systems should take advantage of human abilities to learn and adapt while minimizing computational complexity. Thus, users should be exposed to and use consistent, non-context-dependant referring expressions so that the systems will be easier to learn. There is little direct empirical evidence examining which of these design approaches results in less cognitive load on the part of system users. In this talk I will describe the results from my research applying a classic tool of cognitive psychology, the dual-task paradigm, to spoken interface evaluation with the goal of comparing the two approaches directly. Specifically, I examine natural and standardized design approaches with respect to the role of discourse context in user comprehension / system generation of referring expressions. Speaker Bio Ellen Campana is a Lecturer ABD at Arizona State University, in the Arts, Media, and Engineering Program and the Psychology Department. She is also currently a candidate for a joint Ph.D. in Brain & Cognitive Sciences and Computer Science at the University of Rochester. She holds a B.S. in Computer Science and a B.S in Psychology from the University of Wisconsin-Madison, and an M. A. in Brain and Cognitive Sciences from the University of Rochester.
Posted by nlplab at 05:01 PM

September 11, 2006

Interspeech Practice Talk

Building an English-Iraqi Arabic Machine Translation System for Spoken Utterances with Limited Resources.

By: Behrang Mohit
This is a joint work with Jason Riesa, Kevin Knight and Daniel Marcu
The paper can be found here .

Posted by behrang at 01:32 PM

August 30, 2006

Benjamin Han: A Constraint-based Framework for Resolution of Time in Natural Language

Abstract: Temporal information plays a crucial role in many natural language (NL) applications, yet automatic interpretation and reasoning of the information has not seen wide use in practical systems. Solving this problem requires: (1) a suitably designed semantic representation of time that is rich enough to capture the meaning conveyed in NL yet compositional enough to facilitate the construction of a robust syntax-semantic interface; and (2) an appropriate inference mechanism that can work with the representation to reason with a captured temporal scenario. In this talk I will describe a constraint-based framework for processing temporal information in NL. The constraint nature of the approach makes it possible to deal with under-specification and mixed granularities one often encounters in NL. At the lower level our two-tiered framework models a human calendar (such as Gregorian) as a constraint system. Information coming from temporal expressions, verbal tense/aspect and other prepositional phrases is captured in a novel representation called Time Calculus for Natural Language (TCNL), and each TCNL formula is used to instantiate constraint satisfaction problems (CSP) over the calendar model. At the higher level the framework captures a temporal scenario described in NL as a temporal constraint satisfaction problem (TCSP). The solutions to these constraint satisfaction problems can then be solved using conventional (but modified) methods such as AC3 and the all-pair-shortest-path algorithm. Finally queries can be formulated by relating TCSPs via hypothetical constraints, and they can be answered by solving for the consistency of the merged TCSPs.
Posted by nlplab at 11:18 AM

August 02, 2006

Humor: Prosody Analysis and Automatic Recognition for FRIENDS


Speaker: Amruta Purandare

Purpose: Prelim Exam

Abstract: We analyze humorous spoken conversations from a classic comedy television show, FRIENDS, by examining acoustic-prosodic and linguistic features and their utility in automatic humor recognition. Using a simple annotation scheme, we automatically label speaker turns in our corpus that are followed by "laughs" as Humorous, and the rest as Non-Humorous. Our humor-prosody analysis reveals significant differences in prosodic characteristics (such as pitch, tempo, energy etc.) of humorous and non-humorous speech. Humor recognition was carried out using standard supervised learning classifiers, and shows promising results significantly above the baseline.

Posted by nlplab at 12:30 PM

July 06, 2006

ACL practice talk

Speaker: Jan Wiebe
Title: Word Sense and Subjectivity

Posted by nlplab at 03:52 PM

July 05, 2006

[TALK] ACL Practice

Mihai's practice talk
Posted by nlplab at 03:57 PM

June 28, 2006

"Reinforcement Learning Of Dialogue Strategies Using Restricted Contexts"

by Matthew Frampton (visiting PhD student from University of Edinburgh) This thesis investigates restricted contexts in reinforcement learning (RL) of effective dialogue strategies for information seeking spoken dialogue systems (e.g. COMMUNICATOR Walker et al. 2001). The contexts used are richer than in previous research e.g. Levin and Pieraccini 1997,Scheffler and Young 2001,Singh et al. 2002,Pietquin and Renals 2002 which use only slot-based information i.e. whether or not a slot (e.g. destination city) has been filled and the confidence score associated with any supplied value. The contexts remain much less complex than the full dialogue ``Information States'' explored in Henderson et al. 2005, for which tractability is an issue. Feature engineering is used in order to identify relevant context features from the COMMUNICATOR data. The reinforcement learner then uses a context which includes these relevant features plus the slot-status features. It learns dialogue strategies as it interacts with n-gram user simulations, the probabilities for which are derived from the COMMUNICATOR data. Finally, the learned strategies are evaluated using human subjects. The central hypothesis is that the additional contextual information will enable the learning of more effective dialogue strategies. Results have been obtained in experiments which use n-gram user simulations in both training and testing. The baseline strategy was learned with only the slot-status information. The best performing strategy was learned after adding both the last system and user dialogue moves. It improved over the baseline by 7.8% in average reward per dialogue (significance level p < 0.005) and over the (hand-coded) COMMUNICATOR systems, by 65.9%. The new `emergent' strategies do better in problematic situations where they employ `focus switching' and make effective use of the `give-help' action.
Posted by nlplab at 03:50 PM

June 21, 2006

Manual Annotation of Opinion Categories in Meeting

Speaker Swapna Somasundaran

Practice talk for ACL 2006 Workshop on Frontiers in Annotation

Abstract

This paper applies the categories from an opinion annotation scheme developed for monologue text to the genre of multiparty meetings. We describe modifications to the coding guidelines that were required to extend the categories to the new type of data, and present the results of an inter-
annotator agreement study. As researchers have found with other types of
annotations in speech data, interannotator agreement is higher when the
annotators both read and listen to the data than when they only read the transcripts. Previous work exploited combinations of prosodic and lexical clues to perform automatic detection of speaker emotion (Liscombe et al. 2003). Our findings suggest that doing so to recognize opinion categories would be a promising line of work.

Posted by nlplab at 10:18 PM

June 14, 2006

[TALK] Exploiting Discourse Structure for Spoken Dialogue Performance Analysis

Speaker: Mihai Rotaru

Practice talk for EMNLP 2006. Here is the paper abstract:

In this paper we study the utility of discourse structure for spoken dialogue performance modeling. We experiment with various ways of exploiting the discourse structure: in isolation, as context information for other factors (correctness and certainty) and through trajectories in the discourse structure hierarchy. Our correlation and PARADISE results show that, while the discourse structure is not useful in isolation, using the discourse structure as context information for other factors or via trajectories produces highly predictive parameters for performance analysis.

Posted by nlplab at 12:30 PM

May 31, 2006

[TALK] Keynote practice talk

Diane will be giving a practice for her HLT-NAACL keynote.
Posted by nlplab at 01:57 PM

May 17, 2006

Annotating the Contextual Polarity of Subjective Expressions

Presentation by: Paul Hoffmann
(for preliminary exam) I will be talking about an annotation scheme that adds sentiment tags to subjective expressions in the Multiperspective Question Answering Corpus.
Posted by nlplab at 12:30 PM

May 10, 2006

Automatically Adapting Lexical Resources to the Biomedical Domain

Invited Speaker: Dominic Widdows.
Wednesday, May 10, 2006.
NOTE: Room change - Talk to be held in Room 5317

About the Author
Dominic Widdows is a Senior Research Engineer at MAYA Design, Inc , and author of Geometry and Meaning, a critically
acclaimed introduction to Text Mining for the general reader.

Abstract

Automatically Adapting Lexical Resources to the Biomedical Domain
(work with Beate Dorow, Adil Toumouh and Ahmed Lehireche)

After giving a brief introduction to the combination of lexicosyntactic patterns and graph theory, as used in recent years for lexical acquisition from corpora, the talk will focus on some recent experiments on using these techniques to adapt WordNet to the medical domain. Our basic technique is to extract relationships between terms using the Ohsumed corpus, a large collection of abstracts from PubMed, and to compare the relationships extracted with those that would be expected for medical terms, given the structure of the WordNet ontology. The linguistic methods involve the use of a variety of lexicosyntactic patterns, that enable us to extract pairs of coordinate noun terms, and also related groups of adjectives and nouns, using Markov clustering. This enables us in many cases to analyse ambiguous words and select the correct meaning for the biomedical domain. While results are often encouraging, the paper also highlights evident problems and drawbacks with the method, and outlines suggestions for future work. This will be described as part of the ongoing challenge to produce lexical semantic language models to complement traditional n-gram and syntactic language models.

The recent results in this talk are drawn from our upcoming LREC paper, and from Beate Dorow's recent PhD thesis.

Posted by nlplab at 05:45 PM

March 29, 2006

Using Reinforcement Learning to Build a Better Model of Dialogue State - Joel Tetreault

(Practice talk for EACL one week later) Given the growing complexity of tasks that spoken dialogue systems are trying to handle, Reinforcement Learning (RL) has been increasingly used as a way of automatically learning the best policy for a system to make. While most work has focused on generating better policies for a dialogue manager, very little work has been done in using RL to construct a better dialogue state. This talk presents a RL approach for determining what dialogue features are important to a spoken dialogue tutoring system. Our experiments show that incorporating dialogue factors such as dialog acts, emotion, repeated concepts and performance play a significant role in tutoring and should be taken into account when designing dialogue systems.
Posted by nlplab at 12:51 PM

March 22, 2006

[TALK] The role of discourse structure in analyzing spoken tutoring dialogues

Speaker: Mihai Rotaru

Based on recent advancements in spoken dialogue technologies, researchers have begun implementing spoken dialogue systems in more complex domains. This work is part of our ongoing project that studies the challenges posed by the tutoring domain to spoken dialogue design. Our approach is to study dependencies between speech recognition problems and various dialogue factors. In our previous work, we found interesting results using this methodology: chaining effects for certain speech recognition problems (our Interspeech 2005 paper) and interactions with certainty, correctness and frustration/anger (paper submitted to ACL 2006).

In this presentation, I talk about our preliminary results that analyze the role of dialogue structure for understanding several dialogue phenomena.

Posted by nlplab at 12:30 PM

March 15, 2006

Building an English-Iraqi Arabic Machine Translation System for Spoken Utterances with Limited Resources

Presentation by: Behrang Mohit

I will talk about the challenge of Speech Translation and the ways that we used a text translation system to build a speech translator. Specifically, our efforts were aimed at leveraging from the resources in Modern Standard Arabic (MSA) to enrich the translation and language models of a speech translation system for the Iraqi Arabic and English.
This is a joint work with Jason Riesa, Kevin Knight and Daniel Marcu.

Posted by behrang at 12:10 AM

March 01, 2006

March 1, 2006: Comparing Synthesized versus Pre-recorded Tutor Speech in an Intelligent

Note: We will meet at 1 PM!! (Talk by Kate Forbes-Riley, Wed., March 1) This is a practice talk for our paper accepted at the FLAIRS 2006 conference (with: Diane Litman and Scott Silliman and Joel Tetreault). Here's the abstract for the paper: We evaluate the impact of tutor voice quality in the context of our intelligent tutoring spoken dialogue system. We first describe two versions of our system which yielded two corpora of human-computer tutoring dialogues: one using a tutor voice pre-recorded by a human, and the other using a low-cost text-to-speech tutor voice. We then discuss the results of two-tailed t-tests comparing student learning gains, system usability, and dialogue efficiency across the two corpora. Overall, our results suggest that although the quality of the tutor voice does not significantly impact student learning gains, it does impact the usability and efficiency of our tutoring system.
Posted by nlplab at 09:18 AM

February 22, 2006

[talk] No meeting Feb. 22 due to Black Wednesday

Posted by nlplab at 11:49 AM

February 08, 2006

[talk] Subjectivity Analysis and Recognizing Contextual Polarity

This will be a practice talk for my talk at Maryland.
Posted by nlplab at 10:12 AM

January 25, 2006

Syntax-Based Named Entity Extraction for English and Arabic

By Behrang Mohit

Abstract: I present a framework to train a named entity (NE) tagger from a limited amount of annotated lexical resources. My approach leverages from other available resources such as syntactic and shallow semantic analyses. These resources are helpful in locating potential named entities that can be used to train a tagger with unsupervised approaches. My final goal was the development of the system for Arabic or other languages with limited resources. I first performed a proof of concept study on English as well. I report experimental results showing that there is a steady boost of classification accuracy when we use the extracted unla-beled data together with a small set of labeled training data. I also report the results of our effort on porting the system to the Arabic language. While the accuracy of the Arabic system is lower than the English system, our findings about the effects of different syntactic features hold for both languages

Posted by behrang at 02:46 PM

January 18, 2006

Jan. 18: Modelling User Satisfaction and Student Learning in a Spoken Dialogue Tutoring System with Generic, Tutoring, and User Affect Parameters

(by Kate Forbes-Riley, Talk for Wed., Jan 18, 12:30) This talk summarizes a paper submission (with Diane Litman). Here is the abstract for that paper: We investigate the use of the PARADISE framework for developing predictive models of system performance in our spoken dialogue tutoring system. We represent system performance using two metrics: user satisfaction, and student learning. We train and test predictive models of these metrics in our tutoring system corpora. We predict user satisfaction with 2 types of parameters: 1) generic system parameters, and 2) tutoring-specific parameters. To predict student learning, we also use a third type: 3) user affect parameters. Though generic parameters are useful predictors of user satisfaction in other PARADISE applications, overall our parameters produce less useful user satisfaction models in our system. However, generic and tutoring-specific parameters do produce useful models of student learning in our system. User affect parameters can increase the usefulness of these models.
Posted by nlplab at 03:54 PM

January 13, 2006

Language Modeling and Its Applications

Eugene Charniak

Brown University

Friday, January 13, 2006
10:30am - SENSQ 5317

Refreshments at 10:00am

Hosted by Jan Wiebe
Abstract

Parsing is the problem of mapping a sentence (in, say, English) to a phrase structure. It is important because it gives us a first rough cut at meaning. During the 1990s there was a flurry of new results using statistical techniques that gave us our first robust parsers ready for every-day use. While there has been continued results since then, the practical parsers at the start of 2005 were no better than what has available in 2000. The first part of the talk will recap this ancient history.

The last 12 months, however have seen a dramatic turn-around, with error rates decreasing by 25%. The second and third parts of the talk describe the two techniques responsible for this state of affairs: discriminative reranking and self training. We also show that the latest results seem to be less corpus specific than the previous results. (That is, they carry over to text corpora reasonably different than those upon which they were trained.

Finally we discuss a new parsing paradigm, course-to-find parsing, and present some starry-eyed proposals for radically different views of parsing.

Posted by behrang at 02:26 AM

November 21, 2005

Jost's Talk *CANCELLED*

*****Jost is not able to visit us and give his talk due to Hurricane Wilma*****

Jost Schatzmann (http://mi.eng.cam.ac.uk/~js532/) is visiting on Nov 21. He
is going to give a talk on "Learning Dialogue Management Strategies with a
Simulated User" at 10:30AM in SS5317. You can sign up to talk to him by making comments to this message.


Posted by nlplab at 10:30 AM

November 09, 2005

Nov 09: Using Reinforcement Learning to Improve Human-Computer Tutoring Systems

(by Joel Tetreault, talk for Wed Nov 09)

II'll be giving a talk on the work I have been doing on using Markov Decision Processes (MDP's)'s to determine good policies for our ITSPOKE tutoring dialogues. The problem with dialogue systems, and especially tutoring ones, is that there are a lot of possible actions a tutor can take depending on the student state. For example, if a student appears frustrated and uncertain answering the last question the tutor poses, we may want to ask the student an easier question or give a hint the next turn. Or if the student has been doing really well lately and is breezing through our tutoring session, we may want to give him or her a harder question and possibly also ease back on the amount of feedback to give. Given the fact there are a wide range of features to describe the student state, making a hand-tuned policy for every possible student state is simply too laborious of a task to undertake. What we propose is to use MDP's to learn the best policies for a computer to take in a system. In this talk I will present preliminary results of our research.

Posted by nlplab at 02:53 PM

October 19, 2005

Adapting Resources for Parsing Arabic Dialects

By Rebecca Hwa and Carol Nichols
ABSTRACT:
Parsing is an important component in many NLP systems. While
recent advances in statistical methods and machine learning have made
it possible to build highly accurate parsers, the success depends on
the quantity and quality of annotated training data, which may not
always be available. Arabic is an interesting case because it is
diglossic (i.e., the language exists in two forms: a "prestigious"
variety for formal writings (Modern Standard Arabic) and colloquial
varieties that are primarily spoken and are not standardized (Arabic
dialects)). There is much on-going NLP work in building resources for
MSA, but resources and NLP research for Arabic dialect are still at an
infancy stage. Because there are no parallel written corpora between
any of the dialects and any other language, including MSA, most of the
techniques developed for parsing that exploit supervised machine
learning do not apply.

In this talk, we describe our framework for leveraging existing
resources and tools for MSA in order to parse Arabic dialects. In
particular, we focus on building a bilexicon between MSA and the
Levantine dialect and building a Levantine part-of-speech tagger by
adapting from a MSA tagger. We will also present some preliminary
findings in building a Levantine parser from these resources.

This work was conducted as a part of the Parsing Arabic Dialect team
at the 2005 JHU Summer Workshop on Language Engineering.

Posted by nlplab at 01:41 PM

September 28, 2005

Swapna

Subjectivity, Emotions and Prosody - Literature review

Posted by nlplab at 03:33 PM

[TALK] Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis

Theresa Wilson

In this talk, I will present a new approach to phrase-level sentiment analysis that first determines whether an expression is neutral or polar and then disambiguates the polarity of the polar expressions. With this approach, our system is able to automatically identify the contextual polarity for a large subset of sentiment expressions, achieving results that are significantly better than baseline.

Practice talk for EMNLP

Posted by nlplab at 11:05 AM

September 21, 2005

A Backoff Model for Bootstrapping Resources for Non-English Languages

Chenhai Xi

The lack of annotated data is an obstacle to the development of many natural language processing applications; the problem is especially severe when the data is non-English. Previous studies suggested the possibility of acquiring resources for non-English languages by bootstrapping from high quality English NLP tools and parallel corpora; however, the success of these approaches seems limited for dissimilar language pairs. In this paper, we propose a novel approach of combining bootstrapped resource with a small amount of manually
annotated data. We compare the proposed approach with other bootstrapping methods in the context of training a Chinese Part-of-Speech tagger. Experimental results show that our proposed approach achieves a significant improvement over EM and self-training and systems that are only trained on manual annotations.

This is a practice talk for EMNLP 2005.

Posted by nlplab at 02:37 PM

September 14, 2005

[talk] Noah Smith

Noah Smith
Johns Hopkins University

Title: Contrastive Estimation for Unsupervised Sequence Modeling

Abstract:

Conditional random fields (Lafferty, McCallum, and Pereira, 2001) are
quite effective at sequence labeling tasks like shallow parsing (Sha
and Pereira, 2003) and named-entity extraction (McCallum and Li,
2003). CRFs are *log-linear*, allowing the incorporation of arbitrary
features into the model. Clever new features are one way to improve
performance; clever objective functions are another (see, for
instance, recent work on max-margin parsing by Taskar, Klein, et al.,
2004).

We have developed a method to do both, in the unlabeled data
framework. That is, we use log-linear models capable of exploiting
new features, and a new class of objective functions: contrastive
estimation (CE). CE can be intuitively understood as exploiting
implicit negative evidence and is computationally efficient (unlike
log-linear EM). In fact, CE generalizes EM and a variety of other
objective functions. By engineering classes of implicit negative
evidence, CE can be adapted for specific applications.

We describe applications to two natural language learning
problems---POS tagging of unlabeled text with a dictionary (Merialdo,
1994) and dependency grammar induction (Klein and Manning, 2004)---and
show how contrastive estimation outperforms EM (with the same feature
sets), is more robust to loss of domain knowledge (dictionary
degradation or uninformative initialization), and can recover by
modeling additional, nonorthogonal features.

This is joint work with Jason Eisner and was presented at ACL 2005 and
the IJCAI 2005 Workshop on Grammatical Inference Applications.

Schedule:

10:15 -- 10:30 Rebecca SENSQ 5421
10:30 -- 11:30 Behrang, Carol, Chenhai
11:30 -- 12:30 Lunch (Rebecca, Diane, Jan, Mihai)
12:30 -- 2:00 Talk
2:00 -- 2:30 Mihai
2:30 -- 3:00 Theresa
3:00 -- 3:30 Amruta, Hua
3:30 -- 4:00 Swapna, Paul
4:00 -- 4:30 Rebecca

Posted by hwa at 12:30 PM

August 03, 2005

[TALK] MIHAI and DIANE - PRACTICE TALKS FOR INTERSPEECH/EUROSPEECH and SIGDIAL

Mihai: This will be my practice talk for the paper I will present at INTERSPEECH/EUROSPEECH.

Title:
Interactions between Speech Recognition Problems and User Emotions

Abstract:
Understanding how speech recognition problems affect the interaction with the user is a topic of great interest for the spoken dialogue community. We examine the dependencies between speech recognition problems in adjacent turns. We also examine the dependencies between speech recognition problems and student emotions within a turn and in adjacent turns. We apply Chi Square (χ2) analysis to a corpus of speech-based computer tutoring dialogues to discover these dependencies. We find that rejections are followed by more rejections than expected if there was no dependency between rejections, and that misrecognitions are followed by more misrecognitions than expected. We also find a strong dependency between recognition problems in the previous turn and user emotion in the current turn: after a system rejection there are more emotional user turns than expected. Surprisingly, in our data, we find no relationship between user emotions and recognition problems within a turn nor between previous turn user emotions and current turn recognition problems.

Diane: This will be my practice talk for Sigdial

Title:
Using Bigrams to Identify Relationships Between Student Certainness States and Tutor Responses in a Spoken Dialogue Corpus

Abstract:
We use n-gram techniques to identify dependencies between student affective states of certainty and subsequent tutor dialogue acts, in an annotated corpus of human-human spoken tutoring dialogues. We first represent our dialogues as bigrams of annotated student and tutor turns. We next use chi square analysis to identify dependent bigrams. Our results show dependencies between many student states and subsequent tutor dialogue acts. We then analyze the dependent bigrams and suggest ways that our current computer tutor can be enhanced to adapt its dialogue act generation based on these dependencies

Posted by nlplab at 12:00 PM

July 27, 2005

[TALK] Paul - Polarity Literature Review

Abstract TBA.

Posted by nlplab at 02:32 AM

July 07, 2005

[TALK] Barbara Di Eugenio

Title: Natural Language Generation for Intelligent Tutoring Systems: A Case Study

Speaker: Dr. Barbara DiEugenio, University of Illinois at Chicago

When: Thursday, July 7, 10:00am

Where: Sennott Square 5317, University of Pittsburgh

Abstract:
---------

It is still an open question whether Natural Language (NL) interaction
between students and an Intelligent Tutoring System (ITS) improves
learning, and if yes, what specific features of the NL interaction are
responsible for the improvement. To investigate this issue, we developed
two different feedback generation engines for an ITS that teaches students
to troubleshoot complex systems. We systematically evaluated the two NL
interfaces in a three way comparison that included the original ITS as
well. We found that the version of the ITS which intuitively produces the
best language does engender the most learning. Specifically, it appears
that presenting feedback at a more abstract level is responsible for the
improvement.

Posted by litman at 02:23 PM

July 06, 2005

[TALK] MIHAI - PRACTICE TALK FOR AAAI WORKSHOP

This will be my practice talk for the paper I will present at the AAAI Workshop on Question Answering in Restricted Domains.

Title:
Improving Question Answering for Reading Comprehension Tests by Combining Multiple Systems

Abstract:
Most work on reading comprehension question answering systems has focused on improving performance by adding complex natural language processing (NLP) components to such systems rather than by combining the output of multiple systems. Our paper empirically evaluates whether combining the outputs of seven such systems submitted as the final projects for a graduate level class can improve over the performance of any individual system. We present several analyses of our combination experiments, including performance bounds, impact of both tie-breaking methods and ensemble size on performance, and an error analysis. Our results, replicated using two different publicly available reading test corpora, demonstrate the utility of system combination via majority voting in our restricted domain question answering task.

Posted by nlplab at 12:00 PM

June 22, 2005

[talk] Carol and Theresa practice talks for ACL

Carol: abstract TBA


Theresa: Annotating Attributions and Private States

Posted by hwa at 12:00 PM

June 08, 2005

[Talk] Art - Landscape Model & Learning

A Landscape Model analysis, adopted from the text processing
literature, was run on transcripts of tutoring sessions, and a
technique developed to count the occurrence of key physics points in
the resulting connection matrices. This point-count measure was found
to be well correlated with learning.

Posted by nlplab at 04:24 PM

[TALK] BEATRIZ - CO-TRAINING WITH SPOKEN DIALOGUE DATA

ABSTRACT
In this paper we take advantage of the
availability of a large amount of manually
annotated data to analyze the applicability
of Co-training (Blum and Mitchell,
1998) for predicting emotions with Spoken
Dialogue Data. The manual annotations
yielded the Upper Bounds of Cotraining
that show the trade-off between
the size of the initial train set and the accuracy
obtained by the final train set generated
with this method when adding examples
based on agreements, disagreements
and confidence of the predictions. We
found that in the ideal case, disagreements
would lead to build a more robust system,
but there is a gap between the Upper
Bounds and the behavior of Co-training
that lays on the amount of incorrect examples
added by the system. Our best
results achieve a maximum accuracy improvement
of 18.51% over the majority
class baseline, and 1.49% of improvement
over the accuracy of the initial train set.

Posted by nlplab at 12:00 PM

May 18, 2005

[Talk] Diane - Correlations between Dialogue and Learning

We examine correlations between dialogue behaviors and learning in tutoring, using two corpora of spoken tutoring dialogues: a human-human corpus and a human-computer corpus. To formalize the notion of dialogue behavior, we manually annotate our data using a tagset of student and tutor dialogue acts relative to the tutoring domain. A unigram analysis of our annotated data shows that student learning is correlated both with the dialogue acts of the tutor and with the dialogue acts of the student. A bigram analysis of our data shows that learning is also correlated with joint patterns of tutor and student dialogue acts. Our results show that while the use of dialogue act n-grams is a promising method for examining correlations between dialogue behavior and learning, specific findings can differ in human versus computer tutoring, with the latter better motivating adaptive strategies for implementation. In addition, we also show that although many of our students experience problems with speech recognition, such problems do not negatively correlate with student learning.

Posted by nlplab at 12:40 PM

April 22, 2005

[TALK] Theresa - Attitude Types

Abstract forthcoming

Posted by nlplab at 12:00 PM

April 15, 2005

[TALK] WENDY CHAPMAN - Annotating clinical conditions in emergency department reports

Annotating clinical conditions in reports is necessary for compiling reference standards against which automated indexing systems are compared. However, the task is vague and produces substantial variation among annotators. For example, the sentence "Patient has severe left-sided chest pain" could result in several different annotations, including "pain," "chest pain," "left-sided chest pain," and "severe left-sided chest pain." We created guidelines detailing medical and linguistic instructions about what text to include in annotations of clinical concepts and measured agreement between two annotators using the guidelines. I will present our results and describe future plans for the guidelines.

Posted by nlplab at 03:52 PM

April 08, 2005

[TALK] Beatriz's Research

Beatriz will talk about her research and results of applying Cotraining and Self Training in Spoken Dialogue Data.

Posted by at 12:00 PM

March 18, 2005

[talk] Dan Gildea

Dr. Dan Gildea from University of Rochester will be giving a talk on his recent work on March 18th, Noon.

Syntactic Structure and Statistical Machine Translation

Given that statistical methods have revolutionized both
natural language parsing and machine translation, it may
seem surprising that most current statistically-based
translation systems make no use of syntactic structure.
I will describe work on models of translation that aim
to fill this gap, presenting results for models that
make use of syntactic information provided for one or
both languages, as well as models that infer structure
directly from parallel bilingual text. I will also
describe the use of syntactic information for the
automatic evaluation of machine-produced translations.


Please sign up for a slot to meet with Dan

9:45 -- 10:00 Rebecca (SENSQ 5421)
10:00 -- 10:30 Behrang (SENSQ 5503)
10:30 -- 11:00 Paul, Swapna, and Jason (SENSQ 5422)
11:00 -- 11:15 Rebecca Part Deux (SENSQ 5421)
11:15 -- 11:45 Daqing and Hua (Cheng) (SENSQ 5111)
11:45 -- 12:00 Talk prep
12:00 -- 1:15 Talk (SENSQ 5317)
1:15 -- 2:45 Lunch (with Rebecca, Lillian, Oren, Bo, Diane?, Mihai)
2:45 -- 3:15 Diane (SENSQ 5105)
3:15 -- 3:45 Amruta and Hua (Ai) (SENSQ 5108)
3:45 -- 4:15 Theresa (SENSQ 5422)
4:15 -- 4:45 Mihai and Beatriz (SENSQ 5420)

Dinner at 6pm (with Jan, Joel, Rebecca)

Posted by hwa at 10:30 AM

February 11, 2005

Interactive Alignment Model

One of the purposes of our NLP meeting is to have an opportunity to read
and discuss new research. Art will briefly present, then lead discussion
of the paper: "Toward a mechanistic psychology of dialogue" by Martin
Pickering and Simon Garrod. This paper proposes a mechanism of automatic
alignment, by which two dialogue partners come to use similar semantic,
syntactic, lexical and phonological representations. The resulting
alignment simplifies the production and comprehension of dialogue.

Art will forward a PDF to the NLP mailing list. If you don't get your copy, please let him know.

Posted by nlplab at 12:00 PM

January 28, 2005

[talk] chenhai

TBA

Posted by nlplab at 12:00 PM

December 13, 2004

Empirical Evaluations of Pronoun Resolution

December 13: NLP meeting.

I'll be using the NLP meeting to give a practice talk for my dissertation defense the 17th. All are welcome!

Posted by nlplab at 12:05 PM

December 06, 2004

[TALK] Tatiana Gavrilova

Tatiana Gavrilova
Visiting Fulbright Scholar

Informal Ontology Design

Posted by litman at 01:22 PM

November 29, 2004

[talk] Bo Pang

Bo Pang
Cornell University

Title: A sentimental education: Sentiment analysis using
subjectivity summarization based on minimum cuts.

Abstract

Sentiment analysis, which seeks to identify the viewpoint(s)
underlying a text span, has recently attracted a great deal
of attention. Automatic analysis of such information can be
helpful for business intelligence applications, recommender systems,
and editorial sites. One example application is to determine
a review's sentiment polarity (``thumbs up'' or ``thumbs down'').
In particular, we consider the domain of movie reviews, which was
shown to be difficult for the polarity classification task in
previous work. We propose a novel machine-learning method to
first extract the subjective portions of the documents and then
apply text-categorization techniques to the resulting extracts
rather than to the entire reviews. Discarding the objective
portions of the review helps prevent the polarity classifier
from considering irrelevant or even potentially misleading text;
in addition, subjective extracts created in this process can be
presented to users as summaries of subjective content.

Our results show that the subjective extracts we create compactly
and accurately represent sentiment information: they are as informative
as the original documents while at the same time being 40% shorter.
Depending on the choice of downstream polarity classifier, using these
extracts can even lead to highly statistically significant improvement
for the polarity classification task. Also, we explore extraction
methods based on a minimum cuts formulation, which provides an efficient
and effective means for integrating inter-sentence-level contextual
information with traditional bag-of-words features.

This is joint work with Lillian Lee.

Posted by hwa at 12:30 PM

November 22, 2004

[TALK] Tessa Warren

Tessa Warren (Psychology and LRDC), syntactic complexity and reference, details TBA

Posted by litman at 04:47 PM

November 15, 2004

[Talk] Polarity

Paul Hoffmann

Title: Polarity in Context

Abstract: This talk describes an annotation scheme for marking the polarity of ons and expressive subjective elements in context and presents results of an annotation study.

Posted by nlplab at 11:01 AM

November 08, 2004

[talk] Oren Kurland

Oren Kurland
Cornell University

Title: Corpus structure, language models, and ad hoc information retrieval

Abstract:
The fundamental principle of the language-modeling approach to ad hoc
information retrieval is that given a query, documents will be ranked
according to their estimated language models' similarity to that of the
query.

Most previous work on the language-modeling approach to ad hoc information
retrieval, however, focuses on document specific-characteristics, and
therefore doesn't take into account the structure of the surrounding corpus.
We propose a novel algorithmic framework in which information provided by
document-based language models is enhanced by the incorporation of
information drawn from clusters of similar documents.

In this talk, we will first present the framework and describe a suite of new
algorithms that are natural instantiations of it. Even the simplest typically
outperforms the standard language-modeling approach. We will then discuss
connections to other work such as latent-variable models and present
experimental results which show that our best-performing algorithms post
improvements with respect to state of the art language-modeling based
algorithms over various data corpora.

This is joint work with Lillian Lee.

Posted by hwa at 12:30 PM

November 01, 2004

[TALK] Jan Wiebe

Title: Opinions In Question Answering: Current Research Directions.

Abstract: This talk will describe current research directions in the ARDA AQUAINT project "Opinions in Question Answering". We will focus on our current research in extracting "opinion frames" to represent subjective expressions in text.

Posted by nlplab at 12:30 PM

October 25, 2004

[TALK] Ed Hovy

Ed Hovy, visiting from ISI. Starting time moved to 1 (or later).

Posted by litman at 02:53 PM

October 11, 2004

[Talk] Named Entity Extraction for Arabic

Named Entity Extraction from Arabic Text
Behrang Mohit

In this presentation, I will talk about our ongoing work on the task of Named Entity (NE) Extraction from Arabic text. NE Extraction is a challenging task for Arabic since the language does not have the important word capitalization feature (For English, using only word capitalization can identify NEs with a fairly high accuracy).

As a baseline, we consider porting a system that learns to classify English NEs (Collins and Singer, 1999) to Arabic. Under this framework, the system highlights NEs by a syntactic approach (using parse tree rules) and is followed by an unsupervised classification of names into different classes of named entities. Due to linguistic differences between English and Arabic, a direct application of this approach does not yield as good a result for Arabic as it did for English.

We are currently improving the coverage of this model by adding richer syntactic information. The early part of the talk will include an introduction to the Arabic language structure and some of the major challenges that exist in working with this language.

Posted by behrang at 11:26 PM

October 04, 2004

[Talk] Magnitude Estimation: Useful or not for NLP annotation?

Magnitude Estimation is a technique originally used in psychophysics to measure judgments of sensory stimulation, for instance, brightness and loudness. However, studies in linguistics have shown that magnitude estimation can reliably be used to make other judgments of scale, such as how gramatical is a sentence. Magnitude estimation has been used to judge document relevance, plausibility of adjective-noun pairs, and politeness of spoken Japanese. In my talk, I will be giving an introduction to magnitude estimation and discussing its possible usefulness for the task of annotating the intensity of opinions, emotions, and other private states in text.

Posted by nlplab at 11:32 AM

September 27, 2004

[TALK] WORK HARD ON HARD

Daqing He

In this talk I will talk about our participation (University of Maryland and John's Hopkins University team) to TREC High Accuracy Retrieval of Document (HARD) track in both 2003 and 2004. I will first intruduce the HARD experiment setting, then talk about interactive relevance feedback in HARD framework. I will also talk about building passage retrieval module for identifying sub-document unit that are highly relevant to user's queries. Our passage retrieval module was among the best in the track last year, but it is still far from matching to human performance. Finally, I will talk about some other interesting areas that HARD is trying to explore beyond plain batch news article document retrievals.

Posted by nlplab at 11:25 AM

September 20, 2004

[TALK] Predicting student emotions during spoken tutoring dialogues using word-level prosodic features

Mihai Rotaru

In this talk I will present our ongoing work in developing features and models for detecting student emotional states, given only information available during a spoken tutoring dialogue. Prior research has primarily focused on the use of turn-level prosodic features as predictors. We extend the turn-level prosodic feature set used in our previous studies, and additionally apply these same set of features at the word level. Even under a simplifying word-level emotion model, our preliminary results show an improvement in prediction using word level features compared to using turn level features.

Posted by nlplab at 01:09 PM

September 13, 2004

[talk] MS Thesis Defense talk by Amruta

In this meeting, I will talk about my Master's Thesis that I have
recently finished from the University of Minnesota. This talk will
essentially be same as my thesis defense. Thesis report and defense slides
are available online at - http://www.cs.pitt.edu/~amruta/pubs.html

Title: "Unsupervised Word Sense Discrimination by Clustering Similar
Contexts."

Abstract: Word sense discrimination is the problem of identifying
different contexts that refer to the same meaning of an ambiguous
word. For example, given multiple contexts that include the word
'sharp', we would hope to discriminate between those that refer to an
intellectual sharpness versus those that refer to a cutting sharpness.
Our methodology is based on the strong contextual hypothesis of Miller
and Charles (1991), which states that "two words are semantically
related to the extent that their contextual representations are
similar."

This thesis presents corpus--based unsupervised solutions that
automatically group together contextually similar instances of a word
as observed in a raw text. We do not utilize any manually created or
maintained knowledge--rich resources such as dictionaries, thesauri
or annotated corpora. As a result, our approach is well suited to the
fluid and dynamic nature of word meanings. It is also portable to
different domains and languages, and scales easily to larger samples
of text.

The overall objective of this thesis is to study the effect of various
feature types, context representations and clustering methods on the
accuracy of sense discrimination. We also apply dimensionality
reduction techniques to capture conceptual similarities among the
contexts and don't just rely on the surface forms of words in the text.

Posted by nlplab at 12:30 PM

July 14, 2004

[TALK] Predicting Student Emotions in Computer-Human Tutoring Dialogues

This is Diane's practice talk for the ACL conference.

Abstract: We examine the utility of speech and lexical features for automatically predicting student emotions in human-computer spoken tutoring dialogues. We first annotate studentturns for negative, neutral, positive and mixed emotions.
We then extract acoustic-prosodic features from the speech signal, and lexical
items from the transcribed or recognized speech. We compare the results of
machine learning experiments using these features alone or in combination to
predict various categorizations of the annotated emotions. Our best results yield a 19-36% relative improvement in error reduction over a baseline. Finally, we compare our results with predicting emotion in human-human dialogues.

Posted by litman at 12:39 PM

July 07, 2004

[JOINT TALK] Word Alignment Testbed, POS Tag Projection

Carol Nichols
Karina Ivanetich

We are sharing this date to present our DMP summer research.

Carol's project is creating a test bed for collecting word alignment data from bilingual speakers of English and Chinese for use by a machine translator. This program will also gather data on how sure the people providing the word alignments are about their alignments and how long it took them, and this information would be useful to experimenters studying machine translation and word alignments.

Karina's Abstract:
Some languages (such as English) are rich in annotated resources, while many other languages experience a shortage or absence of annotated data. In addition, human annotation, although highly accurate, is costly in terms of both time and money. Researchers have created systems that utilize well-annotated languages in order to project POS tags onto other languages. However, the result has often been less than accurate. Researchers David Yarowksy and Grace Ngai have added to traditional projection algorithms, and for English-to-French projections, have obtained much higher levels of accuracy.
In my work here this summer, I will attempt to replicate their results, this time for English-to-Chinese projections. Since translation issues differ between these two sets of languages, I am expecting that I will need to improve the model to better serve the English-to-Chinese projections. My presentation will discuss this proposal as well as current progress.

Posted by nlplab at 11:29 AM

June 30, 2004

[TALK] Just How Mad Are You? Finding strong and weak opinion clauses

Theresa Wilson, presenting joint work with Janyce Wiebe and Rebecca Hwa.

This will be a practice talk for AAAI 2004.

Abstract: There has been a recent swell of interest in the automatic identification and extraction of opinions and emotions in text.
In this paper, we present the first experimental results classifying the strength of opinions and other types of subjectivity and classifying the subjectivity of deeply nested clauses. We use a wide range of features, including new syntactic features developed for opinion recognition. In 10-fold cross-validation experiments using support vector regression, we achieve improvements in mean-squared error over baseline ranging from 57\% to 64\%.


Posted by nlplab at 03:54 PM

June 23, 2004

Measuring Agreement among Raters

Kappa is the primary statistic used in NLP research to evaluate agreement among raters. However, there are many problems with the kappa statistic. In this talk I will discuss kappa and how to account for problems not addressed by kappa with different statistics. I will also describe how to calculate a generalizability coefficient that measures the reliability or reproducability of a reference standard created from human raters. I will use data from a current study we are evaluating to help understand how all the agreement statistics can help answer the question, "How good is my reference standard?"

Posted by nlplab at 01:41 PM

June 16, 2004

[talk] Discussion on Giving Talks

Rebecca Hwa

This is not so much a talk but a round-table discussion that I'd like to host. With the conference season fast approaching, it might be good for us to get together and trade ideas on giving presentations.

Posted by hwa at 12:00 PM

June 09, 2004

[TALK] Learning in Spoken vs. Typed Dialogues

Diane Litman and Kate Forbes-Riley

We compare the learning gains from tutoring with spoken versus typed dialogue. In one experiment, the tutor was a human. In the other experiment, the tutor was a tutoring system. The main results of our study are that changing the modality from text to speech caused large differences in the learning gains, time and superficial dialogue characteristics of human tutoring, but for computer tutoring, it made less difference. (This is material that will be presented at the Intelligent Tutoring Systems Conference).

Posted by litman at 10:56 AM

June 02, 2004

Opinions in Question Answering

Jan Wiebe will describe a new project entitled Opinions in Question Answering. The project is part of the ARDA AQUAINT Question Answering program, and is joint with Claire Cardie at Cornell and Ellen Riloff at Utah. The goals of the project are to extract detailed information about opinions from text and then create summary representations of the opinions expressed about a topic in one or many documents.

Posted by nlplab at 12:00 PM

May 26, 2004

[TALK] Co-training for Predicting Emotions with Spoken Dialogue Data

Beatriz Maeireizo Tokeshi

On May 26th 2004, Beatriz will give a small talk about the poster submitted to ACL 2004, as a result of the research done IN the CS PhD course 2002 (Research Experience in CS).

ABSTRACT
Natural Language Processing applica-tions often require large amounts of an-notated training data, which are expensive to obtain. In this paper we in-vestigate the applicability of Co-training to train classifiers that predict emotions in spoken dialogues. In order to do so, we have first applied the wrapper ap-proach with Forward Selection and Naïve Bayes, to reduce the dimensionality of our feature set. Our results show that Co-training can be highly effective when a good set of features are chosen.

Posted by at 11:55 AM

May 19, 2004

meeting agenda

The NLP Group will continue its weekly meetings throughout the summer. This week, we will meet to set up the talk schedule for the rest of the term.

Posted by hwa at 02:58 PM

May 10, 2004

[TALK] The Role of Initiative in Tutorial Dialogue

Mark Core
University of Edinburgh

Monday May 10 , 10:00
731 LRDC

This work is the first systematic investigation of initiative in
human-human tutorial dialogue. We studied initiative management in two
dialogue strategies: didactic tutoring and Socratic tutoring. We
hypothesized that didactic tutoring would be mostly tutor-initiative while
Socratic tutoring would be mixed-initiative, and that more student
initiative would lead to more learning (i.e., task success for the
tutor). Surprisingly, students had initiative more of the time in the
didactic dialogues (21% of the turns) than in the Socratic dialogues (10%
of the turns), and there was no direct relationship between student
initiative and learning. However, Socratic dialogues were more interactive
than didactic dialogues as measured by percentage of tutor utterances that
were questions and percentage of words in the dialogue uttered by the
student, and interactivity had a positive correlation with learning.

(The above is his EACL 2003 talk. Since that was a short talk,
if time permits he might also present some research that he is
presenting at HLT-NAACL...

Robustness versus Fidelity in Natural Language Understanding

A number of issues arise when trying to scale-up natural language
understanding (NLU) tools designed for relatively simple domains (e.g.,
flight information) to domains such as medical advising or tutoring where
deep understanding of user utterances is necessary. Because the subject
matter is richer, the range of vocabulary and grammatical structures is
larger meaning NLU tools are more likely to encounter out-of-vocabulary
words or extra-grammatical utterances. This is especially true in medical
advising and tutoring where users may not know the correct vocabulary and
use common sense terms or descriptions instead. Techniques designed to
improve robustness (e.g., skipping unknown words, relaxing grammatical
constraints, mapping unknown words to known words) are effective at
increasing the number of utterances for which an NLU sub-system can produce
a semantic interpretation. However, such techniques introduce additional
ambiguity and can lead to a loss of fidelity (i.e., a mismatch between the
semantic interpretation and what the language producer meant). To control
this trade-off, we propose semantic interpretation confidence scores akin
to speech recognition confidence scores, and describe our initial attempt
to compute such a score in a modularized NLU sub-system.)

----

Short bio:

Mark received his Ph.D. from the University of Rochester under the supervision
of Len Schubert. The subject of his dissertation was dialog parsing; his
dialog parser identified speech repairs as well as the dialogue acts of
utterances. Starting in 2000, Mark has been a researcher at the University
of Edinburgh, working with Johanna Moore on the BEETLE tutorial dialogue
system. He built a natural language understanding module for BEETLE using
the CARMEL workbench, adding features such as unknown word handling and
semantic-confidence-score calculation. The second area of his research is
dialogue annotation and analysis, looking at phenomena such as initiative,
and dialogue acts and games.

Posted by litman at 10:53 AM

May 07, 2004

[TALK] Empirical Approaches to Pronoun Resolution

Joel Tetrault
University of Rochester

Friday May 7, 1:30
731 LRDC

In a spoken dialog system, the job of a reference resolution module is to
identify noun phrases and resolve them to entities evoked in the dialogue.
This involves finding antecedents for pronouns such as "that" or "they" and
resolving definite noun phrases such as "the two hospitals" or "the ambulance
here." Though reference is just one part of the overall interpretation of
a sentence, it is a very important piece because failure to resolve the
entities in a sentence correctly can lead to an incorrect interpretation
of a sentence and thus an erroneous response to the user.

Many approaches to reference resolution, specifically pronoun resolution,
have relied heavily on syntactic and surface features. While these
methods are able to perform very well, such as resolving as much as 80% of
the pronouns in a large corpus correctly, the "20% gap" has been hard
to overcome because these pronoun require additional information on top of
syntactic features for resolution. In this talk I present work that
incorporates discourse structure and semantic features into a pronoun
resolution algorithm to improve performance over two types of corpora: a
newspaper domain (Penn Treebank) and human-human spoken dialogue.

Short Bio:

Joel Tetreault is in his final year of his PhD in Computer Science at the
University of Rochester. He received his bachelor's degree from Harvard
University in 1998 and Master's from Rochester in 2000. His main
interest is Natural Language Processing. He has done work in reference
resolution, discourse processing, spoken dialogue systems, and information
retrieval techniques for detecting affect.

Posted by litman at 10:48 AM

April 21, 2004

[TALK]: Predicting Emotion in Spoken Dialogue from Multiple Knowledge Sources

Diane will give a practice talk (about 20 minutes) of our HLT-NAACL paper:

==================================================================
TITLE
-----
Predicting Emotion in Spoken Dialogue from Multiple Knowledge Sources
==================================================================
AUTHORS
-------
Kate Forbes-Riley and Diane Litman
==================================================================
ABSTRACT
--------
We examine the utility of multiple types of turn-level and contextual
linguistic feature s for automatically predicting student emotions in
human-human spoken tutoring dialogues. We first annotate student
turns in our corpus for negative, neutral and positive emotions. We
then automatically extract features representing acoustic-prosodic and
other linguistic information from the speech signal and associated
transcriptions. We compare the results of a variety of machine
learning experiments using different feature sets to predict the
annotated emotions. Our best performing feature set contains both
acoustic-p rosodic and other types of linguistic features, extracted
from both the current turn and a context o f previous student turns.
This feature set yields a prediction accuracy of 84.75%, which is a
44% relative improvement in error reduction over a baseline. Our
results suggest that the intelligent tutoring spoken dialogue system
we are developing can be enhanced to automatically predict and adapt
to student emotions.

Posted by litman at 12:00 PM

April 07, 2004

Spring Symposium Report

On April 7, Jan will summarize the AAAI Spring Symposium on

Exploring Attitude and Affect in Text: Theories and Applications

Posted by litman at 12:00 PM

April 01, 2004

[talk] Regina Barzilay's visit

Regina Barzilay will be a guest speaker in the Department of Computer Science colloquium series. She will be here on both 4/1 and 4/2.

NOTE: The talk is on Thurs. afternoon (4/1), not Friday morning

>
> What: Learning to Model Text Structure
> When: 4/1 at 3:30pm, refreshments at 3
> Where: SENSQ 5317/9
>
> Talk abstract:
>
> The natural language processing community has struggled for years to
> develop computational models of text structure. Such models are essential
> both for interpretation of human-written text and for evaluation of
> machine-generated text. Applications such as text summarization and
> machine translation would greatly benefit from such models.
>
> In this talk, I will present our first steps towards learning to model
> text structure. I will describe two models that are induced from a large
> collection of unannotated texts. The first model captures the notion of
> text cohesion by considering connectivity patterns characteristic of
> well-formed texts. These patterns are inferred from a matrix that
> combines distributional and syntactic information about text entities. The
> second model captures the content structure of texts within a specific
> domain, in terms of the topics the texts address and the order in which
> these topics appear. I will present an effective method for learning
> content models, utilizing a novel adaptation of algorithms for Hidden
> Markov Models. To conclude my talk, I will show how these text models can
> be effectively integrated into natural language generation and
> summarization systems.
>
> This is joint work with Mirella Lapata and Lillian Lee.
>
>

Posted by hwa at 03:30 PM

March 17, 2004

[Talk] Exploring the Topic Dependence of Features for Opinion Recognition

Theresa Wilson and Janyce Wiebe


We present work investigating the topic dependence of words and phrases that have been used in automatic opinion and sentiment recognition. This work is based on machine learning experiments in opinion recognition using topics for cross validation instead of random splits of the data. We find that the clues from previous work are very robust to changes in topic. Surprisingly, while bag-of-words features are not as robust, they do not degrade as much as expected. The best results are obtained when all clues are combined.

Posted by nlplab at 06:37 PM

March 03, 2004

[talk] Chapman

Abstract TBA

Posted by hwa at 12:00 PM

February 11, 2004

Annotating Student Emotional States in Spoken Tutoring Dialogues

Kate Forbes-Riley and Diane Litman

We present an annotation scheme for student emotions in tutoring dialogues.
Analyses of our scheme with respect to interannotator agreement and predictive accuracy
indicate that our scheme is reliable in our domain, and that our emotion
labels can be predicted with a high degree of accuracy.
We discuss issues concerning the implementation of emotion
prediction and adaptation in the computer tutoring dialogue system we are developing.

Posted by hwa at 11:43 AM

January 28, 2004

Multimodal Person Recognition using HMM Based Face Recognition and GMM Speaker Recognition

Beatriz Maeireizo-Tokeshi presents the work she did while interning in Japan.

Posted by hwa at 11:55 AM

January 21, 2004

Annotating Attitude Types

Sarah Kura, Jan Wiebe, Theresa Wilson discuss the latest development in their work on annotating attitude types.

Posted by hwa at 11:32 AM

January 14, 2004

Reading Comprehension Question-Answering Projects

Diane Litman, Mihai Rotaru, Behrang Mohit, Yanna Shen, Art Ward present the results for the reading comprehension question-answering projects for the Fall 2003 NLP Class.

Posted by hwa at 11:36 AM

November 19, 2003

Early Efforts Towards an Information Extraction System for Surgical Pathology Reports: Negation and Attribute:Values

Rebecca Crowley and Kevin Mitchell

Medical Reports are an important and fertile area for Natural Language
Processing. Information from these free-text documents would be extremely
valuable if it could be automatically extracted and combined with other
data. However, Information Extraction from medical text poses significant
challenges. We describe the early development of a system for Information
Extractipn from Surgical Pathology Reports - a document which contains
essential data related to Cancer diagnosis and prognosis. It includes a
GATE implementation of NegEx - Wendy Chapman's algorithm for negation
detection. We will spend the first half of the talk describing our system
and detailing an evaluation of the Negation tagger compared to a
human-annotated corpus of negations. In the second half of the talk -
we'll show you a set of human annotated examples of attribute:value pairs
and shamelessly solicit advice on how to best extract them.

Posted by hwa at 11:39 AM

November 05, 2003

Chinese Question-Answering

Speaker: Yanna Shen

Abstract:

Question Answering has become a growing interest in the NLP area in recent
years. But Chinese Question Answering systems still lack behind, so I am
interested in doing some work in Chinese Question Answering systems.
This work was done with other fellow students in NLP Laboratory at
Northeastern University, China. We just borrowed some ideas from several
QA papers and built a small QA demo. Then we tried to utilize these ideas
into the Chinese QA system.
In this talk, I will discuss the design of the demo, and give a few points
in building a Chinese QA system.

Posted by hwa at 10:04 PM

October 29, 2003

Semantic Extraction with Wide-Coverage Lexical Resources

Speaker: Behrang Mohit

Semantic Extraction is an NLP task that pertains to the assignment of
semantic bindings to short units of text (usually sentences). NLP problems
such as Information Extraction, Question Answering Systems and Text
Classification Systems could benefit from Semantic Extraction. We have
used two manually-built knowledge bases (WordNet and FrameNet) to automate
Semantic Extraction.

In my presentation, I will give an overview of the FrameNet project and
then talk about my work with Srini Narayanan on Semantic Extraction. I
presented this work last summer as a short paper in NAACL-HLT 2003. The
paper can be downloaded from:
http://www.cs.pitt.edu/~behrang/MohitNarayananHLT2003.pdf

Posted by hwa at 10:00 PM

October 15, 2003

Recognizing Emotions from Student Speech in Tutoring Dialogues

Speakers: Diane Litman and Kate Forbes

Abstract:

We investigate the automatic classification of student emotional
states in a corpus of human-human spoken tutoring dialogues. We
first annotated student turns in this corpus for negative, neutral and
positi ve emotions. We then automatically extracted acoustic and
prosodic features from the student speech, and compared the results of
a variety of machine learning algorithms that use 8 different feature
sets to predict the annotated emotions. Our best results have an
accuracy of 80.53% and show 26.28% relative improvement over a
baseline. These results suggest that th e intelligent tutoring spoken
dialogue system we are developing can be enhanced to automatically
predict and adapt to student emotional states.

This will be an early practice talk for a paper that
will be presented in December at ASRU.

Posted by hwa at 10:02 PM

October 01, 2003

Identifying respiratory findings from emergency department reports with

Speaker: Wendy Chapman

Abstract

Biosurveillance systems use electronic patient medical information to
monitor for possible natural or bioterristic outbreaks. Currently, the only
information used by these systems is a patient's triage chief complaint,
which is a short phrase describing the patient's reason for coming to an
emergency room. To monitor for specific diseases or syndromes like Severe
Acute Respiratory Syndrome (SARS) or pneumonia, more specific clinical
information needs to be gathered. That information is in free-text patient
reports.

I will describe a project I embarked on this summer at the National Library
of Medicine in which I applied an NLP indexing tool called MetaMap that was
created for the literature to the task of identifying respiratory findings
from emergency department reports.

Posted by hwa at 10:03 PM

August 31, 2003

Summary of 2002-2003 Meetings

* 06/19: Kate Forbes (Annotating Emotion in Spoken Tutoring Dialogues: Working Session)
* 06/18: Lin Ma (Predicting Medical Reasoning Codings in Pathology Protocols using Natural Language Features: Master's Project Presentation)
* 04/28: Mihai Rotaru (Practice CoNLL03 talk)
* 04/14: Janyce Wiebe and Theresa Wilson (Learning Extraction Patterns for Subjective Expressions)
* 03/31: Kate Forbes (Preliminary Results from the ITSPOKE Spoken Tutorial Dialogue Corpus)
* 03/17: Mihai Roturu (Comparing Command, Normal, and Hyperarticulated Speech)
* 03/10: Wendy Chapman
* 02/03: Janyce Wiebe (Improving Subjectivity Classification using Features Learned from Extraction Patterns)
* 11/14: Theresa Wilson (A First Exploration of Subjective Language in Spoken Dialogue)
* 10/31: Diane Litman and Scott Silliman (Spoken Dialogue for the Why2 Intelligent Tutoring System)
* 10/17: Mihai Rotaru (Typicality and Natural Language Learning)
* 10/03: Wendy Chapman (NLP in Medicine)
* 09/26: Theresa Wilson (Opinion Annotation in Newspaper Articles)

Posted by hwa at 10:06 PM