Natural Language Processing       

links

Natural Language Processing Publications



2016

Web Information Extraction
Laura Chiticariu, Marina Danilevsky, Howard Ho, Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghavan, Frederick Reiss, Shivakumar Vaithyanathan, Huaiyu Zhu
Encyclopedia of Database Systems, pp. 1--9, Springer New York, 2016


2015



VINERy: A Visual IDE for Information Extraction
Yunyao Li, Elmer Kim, Marc A. Touchette, Ramiya Venkatachalam, Hao Wang
PVLDB 8(12), 1948--1959, 2015

Generating High Quality Proposition Banks for Multilingual Semantic Role Labeling
Alan Akbik, Laura Chiticariu, Marina Danilevsky, Yunyao Li, Shivakumar Vaithyanathan, Huaiyu Zhu
ACL, pp. 397--407, 2015



2014

SystemML’s Optimizer: Plan Generation for Large-Scale Machine Learning Programs
Matthias Boehm, Douglas R Burdick, Alexandre V Evfimievski, Berthold Reinwald, Frederick R Reiss, Prithviraj Sen, Shirish Tatikonda, Yuanyuan Tian
2014 - 131.107.65.22

Cleaning inconsistencies in information extraction via prioritized repairs
Ronald Fagin, Benny Kimelfeld, Frederick Reiss, Stijn Vansummeren
Proceedings of the 33rd ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems, pp. 164--175, 2014

Compiling text analytics queries to FPGAs
Raphael Polig, Kubilay Atasu, Heiner Giefers, Laura Chiticariu
24th International Conference on Field Programmable Logic and Applications, FPL 2014, Munich, Germany, 2-4 September, 2014, pp. 1--6

Giving text analytics a boost
Raphael Polig, Kubilay Atasu, Laura Chiticariu, Christoph Hagleitner, H Peter Hofstee, Frederick R Reiss, Huaiyu Zhu, Eva Sitaridi
IEEE Micro 34(4), 6--14, IEEE, 2014


2013

Spanners: A Formal Framework for Information Extraction
Ronald Fagin, Benny Kimelfeld, Frederick Reiss, Stijn Vansummeren
PODS, 2013

Hardware-Accelerated Regular Expression Matching for High-Throughput Text Analytics
Kubilay Atasu, Raphael Polig, Christoph Hagleitner and Frederick. R. Reiss
23rd International Conference on Field Programmable Logic and Applications, pp. 1--7, IEEE, 2013

Adaptive Parser-Centric Text Normalization
Congle Zhang, Tyler Baldwin, Howard Ho, Benny Kimelfeld, Yunyao Li
Proceedings of ACL, pp. 1159--1168, 2013
slides

I can do text analytics!: designing development tools for novice developers
Huahai Yang, Daina Pupons-Wickham, Laura Chiticariu, Yunyao Li, Benjamin Nguyen, Arnaldo Carreno-Fuentes
Proceedings of the 2013 ACM annual conference on Human factors in computing systems, pp. 1599--1608
slideshare


Semantic Technologies in IBM Watson
A. Gliozzo, O. Biran, S. Patwardhan, K. McKeown
Proceedings of the Fourth Workshop on Teaching NLP and CL, pp. 85--92, 2013

Long-Distance Time-Event Relation Extraction
A. Moschitti, S. Patwardhan, C. Welty
Proceedings of the International Joint Conference on Natural Language Processing, pp. 1330--1338, 2013

Parallel and Nested Decomposition for Factoid Questions
B. Boguraev, S. Patwardhan, A. Kalyanpur, J. Chu-Carroll, A. Lally
Natural Language EngineeringFirstView, 1--28, 2013


Domain-Adaptive Translation Models Based on Bilingual Data Clustering
Fei Huang and Bing Xiang
Technical Report, 2013

Efficient Domain-adaptive Word Segmentation with Larger Context and Co-training
Fei Huang, Abraham Ittycheriah and Salim Roukos
RC25411 (WAT1309-085), 2013

Generalized Reordering Rules for Improved SMT
Fei Huang and Cezar Pendus
ACL, pp. 387-392, ACL, 2013

Automatic Term Ambiguity Detection
Tyler Baldwin, Yunyao Li, Bogdan Alexe, Ioana R Stanoi
Proceedings of ACL, 2013

Flexible and Efficient Hypergraph Interactions for Joint Hierarchical and Forest-to-String Decoding
Martin C, Haitao Mi, Bowen Zhou
EMNLP 2013

Anchor Graph: Global Reordering Contexts for Statistical Machine Translation
Hendra Setiawan, Bowen Zhou and Bing Xiang
EMNLP 2013: Conference on Empirical Methods in Natural Language Processing

What is Hidden among Translation Rules
Libin Shen and Bowen Zhou
EMNLP 2013: Conference on Empirical Methods in Natural Language Processing

A Corpus Level MIRA Tuning Strategy for Machine Translation
Ming Tan, Tian Xia, Shaojun Wang and Bowen Zhou
EMNLP 2013: Conference on Empirical Methods in Natural Language Processing

Incorporating author preference in sentiment rating prediction of reviews
Subhabrata Mukherjee, Gaurab Basu, Sachindra Joshi
Proceedings of the 22nd international conference on World Wide Web companion (WWW 2013), pp. 47--48

Intent classification of voice queries on mobile devices
Subhabrata Mukherjee, Ashish Verma, Kenneth W Church
Proceedings of the 22nd international conference on World Wide Web companion (WWW 2013), pp. 149--150

Sentiment Aggregation using ConceptNet Ontology
Subhabrata Mukherjee, Sachindra Joshi
To Appear In Proceedings of The 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), Nagoya, Japan, Oct 14-18, 2013

Offering language based services on social media by identifying user's preferred language(s) from romanized text
Mitesh M. Khapra, Salil Joshi, Ananthakrishnan Ramanathan, Karthik Visweswariah
22nd International World Wide Web Conference, WWW '13, Rio de Janeiro, Brazil, May 13-17, 2013, Companion Volume, pp. 71--72

More than meets the eye: Study of Human Cognition in Sense Annotation
Salil Joshi, Diptesh Kanojia, Pushpak Bhattacharyya
Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, Proceedings, June 9-14, 2013, Westin Peachtree Plaza Hotel, Atlanta, Georgia, USA, pp. 733--738

I can do text analytics!: designing development tools for novice developers
Huahai Yang, Daina Pupons-Wickham, Laura Chiticariu, Yunyao Li, Benjamin Nguyen, Arnaldo Carreno-Fuentes
CHI, pp. 1599-1608, 2013


Analysis of Watson's Strategies for Playing Jeopardy!
G. Tesauro, D. G. Gondek, J. Lenchner, J. Fan and J. M. Prager
JAIR47, 205-251, 2013
Abstract

Analysis of Watson's Strategies for Playing Jeopardy!
G. Tesauro, D. G. Gondek, J. Lenchner, J. Fan and J. M. Prager
Journal of Artificial Intelligence Research47, 205-251, 2013
Abstract

Tools and Methods for Building Watson
Eric Brown, Eddie Epstein, J William Murdock, Tong-Haing Fin
IBM Research Report RC25356, 2013
Abstract

Enlisting the Ghost: Modeling Empty Categories for Machine Translation
Bing Xiang, Xiaoqiang Luo, Bowen Zhou
The 51st Annual Meeting of the Association for Computational Linguistics, 2013

Two-Neighbor Orientation Model with Cross-Boundary Global Contexts Our ACL Talk Slides
Hendra Setiawan, Bowen Zhou, Bing Xiang and Libin Shen
The 51st Annual Meeting of the Association for Computational Linguistics, 2013

Statistical Machine Translation for Speech: A Perspective on Structures, Learning, and Decoding
Bowen Zhou [accepted version in PDF]
Proceedings of the IEEE (Volume:101 , Issue: 5 ) 101, 1180 - 1202 , 2013
Abstract

Multiscale Manifold Learning
Chang Wang and Sridhar Mahadevan
The 27th AAAI Conference on Artificial Intelligence (AAAI 2013)

Manifold Alignment Preserving Global Geometry
Chang Wang and Sridhar Mahadevan
The 23rd International Joint Conference on Artificial Intelligence (IJCAI 2013),

Detecting factual inconsistencies between a document and a fact-base
Indrajit Bhattacharya, Tanveer A. Faruquie, Shantanu Godbole, Mukesh K. Mohania, Ullas B. Nambiar (United States: US8370275)
US Patent 8,370,275

Distant Supervision for Relation Extraction with an Incomplete Knowledge Base
Bonan Min, Ralph Grishman, Li Wan, Chang Wang, David Gondek
The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2013)

The IBM speech-to-speech translation system for smartphone: improvements for resource-constrained tasks
Bowen Zhou, Xiaodong Cui, Songfang Huang, Martin Cmejrek, Wei Zhang, Jian Xue, Jia Cui, Bing Xiang, Gregg Daggett, Upendra V. Chaudhari, Sameer Maskey, Etienne Marcheret
Computer Speech and Language 27(2), 592-618, Elsevier, 2013

Optimizing Temporal Topic Segmentation for Intelligent Text Visualization
Shimei Pan, Michelle X. Zhou, Yangqiu Song, Weihong Qian, Fei Wang, Shixia Liu
International Conference on Intelligent User Interfaces (IUI), ACM, 2013
Abstract

Discriminative Training of 150 Million Translation Parameters and Its Application to Pruning
Hendra Setiawan, Bowen Zhou
Proceedings of NAACL-HLT, pp. 335--341, Association for Computational Linguistics, 2013

N-gram posterior probability confidence measures for statistical machine translation: an empirical study
Adria Gispert, Graeme Blackwood, Gonzalo Iglesias, William Byrne
Machine Translation27, 85-114, Springer Netherlands, 2013

Constrained Text Co-Clustering with Supervised and Unsupervised Constraints
Yangqiu Song, Shimei Pan, Shixia Liu, Furu Wei, Michelle Zhou, Weihong Qian
IEEE Transactions on Knowledge and Data Engineering (TKDE), 2013


2012

WizIE: a best practices guided development environment for information extraction
Yunyao Li, Laura Chiticariu, Huahai Yang, Frederick R Reiss, Arnaldo Carreno-Fuentes
Proceedings of the ACL 2012 System Demonstrations, pp. 109--114

WizIE: A Best Practices Guided Development Environment for Information Extraction
Yunyao Li, Laura Chiticariu, Huahai Yang, Frederick Reiss, Arnaldo Carreno-Fuentes
ACL (Demonstration), pp. 109-114, 2012

Towards Efficient Named-Entity Rule Induction for Customizability
Ajay Nagesh, Ganesh Ramakrishnan, Laura Chiticariu, Rajasekar Krishnamurthy, Ankush Dharkar, Pushpak Bhattacharyya
EMNLP-CoNLL, pp. 128-138, 2012

Refining a dictionary for information extraction
Laura Chiticariu, Vitaly Feldman, Frederick R Reiss, Sudeepa Roy, Huaiyu Zhu
US Patent App. 13/598,946

Distilling and Exploring Nuggets from a Corpus (demo paper)
Vittorio Castelli, Hema Raghavan, Radu Florian, Ding-jung B Han, Xiaoqiang Luo, Salim Roukos
The 35th International ACM SIGIR conference on research and development in Information Retrieval, 2012

Labeling by Landscaping: Classifying Tokens in Context by Pruning and Decorating Trees
S. Patwardhan, B. Boguraev, A. Agarwal, A. Moschitti, J. Chu-Carroll
Proceedings of CIKM '12: International Conference on Information and Knowledge Management, 2012

When Did that Happen? -- Linking Events and Relations to Timestamps
D. Hovy, J. Fan, A. Gliozzo, S. Patwardhan, C. Welty
Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, 2012

Finding Needles in the Haystack -- Search and Candidate Generation
J. Chu-Carrol, J. Fan, B. Boguraev, D. Carmel, D. Sheinwald, C. Welty
IBM Journal of Research and Development, 2012

When Did that Happen? -- Linking Events and Relations to Timestamps
D. Hovy, J. Fan, A. Gliozzo, S. Patwardhan and C. Welty
13th Conference of the European Chapter of the Association for Computational Linguistics (EACL), 2012

Question Analysis: How Watson Reads a Clue
A. Lally, J. Prager, M. McCord, B. Boguraev, S. Patwardhan, J. Fan, P. Fodor, J. Chu-Carroll
IBM Journal of Research and Development, 2012

Relation Extraction and Scoring in DeepQA
C. Wang, A. Kalyanpur, J. Fan, B. Boguraev, D. Gondek
IBM Journal of Research and Development, 2012

Automatic Knowledge Extraction from Documents
J. Fan, A. Kalyanpur, D. Gondek and D. Ferrucci
IBM Journal of Research and Development, 2012

Making Watson fast
E. A. Epstein, M. I. Schor, B. Iyer, A. Lally, E. W. Brown, J. Cwiklik
IBM Journal of Research and Development 56(3.4), 15--1, IBM, 2012

A framework for merging and ranking of answers in DeepQA
D. C. Gondek, A. Lally, A. Kalyanpur, J. W. Murdock, P. Duboue, L. Zhang, Y. Pan, Z. M. Qiu, C. Welty
IBM Journal of Research and Development 56(3/4), 14:1 - 14:12, 2012

Fact-based question decomposition in DeepQA
A. Kalyanpur, S. Patwardhan, B. K. Boguraev, A. Lally, J. Chu-Carroll
IBM Journal of Research and Development 56(3.4), 2012

Question analysis: How Watson reads a clue
A. Lally, J. M. Prager, M. C. McCord, B. K. Boguraev, S. Patwardhan, J. Fan, P. Fodor, J. Chu-Carroll
IBM Journal of Research and Development 56(3.4), 2--1, IBM, 2012

Typing candidate answers using type coercion
JW Murdock, A Kalyanpur, C Welty, J Fan, DA Ferrucci, DC Gondek, L Zhang, H Kanayama
IBM Journal of Research and Development 56(3/4), 7:1 - 7:13, IBM, 2012
Abstract

Deep parsing in Watson
MC McCord, JW Murdock, BK Boguraev
IBM Journal of Research and Development 56(3/4), 3:1 - 3:15, 2012

Identifying implicit relationships
J. Chu-Carroll, E. W. Brown, A. Lally, J. W. Murdock
IBM Journal of Research and Development 56(3/4), 12:1 - 12:10, 2012

Textual evidence gathering and analysis
J. W. Murdock, J. Fan, A. Lally, H. Shima, B. K. Boguraev
IBM Journal of Research and Development 56(3/4), 8:1 - 8:14, 2012
Abstract

Structured Data and Inference in DeepQA
A. Kalyanpur, B. Boguraev, S. Patwardhan, J.W. Murdock, A. Lally, C. Welty, J. Prager, B. Coppola, A. Fokoue
IBM Journal of Research and Development 56(3/4), 10:1 - 10:14, IBM, 2012


2011

Facilitating pattern discovery for relation extraction with semantic-signature-based clustering
Yunyao Li, Vivian Chu, Sebastian Blohm, Huaiyu Zhu, Howard Ho
Proceedings of the 20th ACM international conference on Information and knowledge management, pp. 1415--1424, 2011

The SystemT IDE: an integrated development environment for information extraction rules
Laura Chiticariu, Vivian Chu, Sajib Dasgupta, Thilo W. Goetz, Howard Ho, Rajasekar Krishnamurthy, Alexander Lang, Yunyao Li, Bin Liu, Sriram Raghavan, Frederick Reiss, Shivakumar Vaithyanathan, Huaiyu Zhu
SIGMOD (Demonstration), pp. 1291-1294, 2011

A Graph Approach to Spelling Correction in Domain-Centric Search.
Zhuowei Bao, Benny Kimelfeld, Yunyao Li
ACL, pp. 905--914, 2011

The SystemT IDE: an integrated development environment for information extraction rules
Laura Chiticariu, Vivian Chu, Sajib Dasgupta, Thilo W Goetz, Howard Ho, Rajasekar Krishnamurthy, Alexander Lang, Yunyao Li, Bin Liu, Sriram Raghavan, others
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data, pp. 1291--1294


SystemT: A Declarative Information Extraction System.
Yunyao Li, Frederick Reiss, Laura Chiticariu
ACL (System Demonstrations), pp. 109--114, 2011

Entity Detection and Tracking
Xiaoqiang Luo, Imed Zitouni
Multi-Lingual Natural Language Processing, 2011

Learning to Transform and Select Elementary Trees for Improved Syntax-based Machine Translations
Bing Zhao, Young-Suk Lee, Xiaoqiang Luo, Liu Li
Proc. Annual Meeting of Association of Computational Linguistics (ACL), pp. 846--855, 2011

A Statistical Tree Annotator and Its Applications
Xiaoqiang Luo, Bing Zhao
Proc. Annual Meeting of Association of Computational Linguistics (ACL), 2011

Mining Knowledge from Large Corpora for Type Coercion in Question Answering
James Fan, Aditya Kalyanpur, J. William Murdock, and Branimir K. Boguraev
Web Scale Knowledge Extraction (WEKEX) Workshop at International Semantic Web Conference, 2011

Selectivity estimation for extraction operators over text data
Daisy Zhe Wang, Long Wei, Yunyao Li, Frederick Reiss, Shivakumar Vaithyanathan
Data Engineering (ICDE), 2011 IEEE 27th International Conference on, pp. 685--696, Citeseer

Optimal Training Data Selection for Rule-based Data cleansing Models
S Chaturvedi, T A Faruquie, L V Subramaniam, K H Prasad, G Venkatachaliah, S Padmanabhan
SRII Global Conference (SRII), 2011 Annual, pp. 126--134


Fact-Based Question Decomposition for Candidate Answer Re-Ranking
Aditya Kalyanpur, Siddharth Patwardhan, Branimir Boguraev, Adam Lally, and Jennifer Chu-Carroll
Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), 2011, pp. 2045--2048

Citation recommendation without author supervision
Q He, D Kifer, J Pei, P Mitra, C L Giles
Proceedings of the fourth ACM international conference on Web search and data mining (WSDM), pp. 755--764, 2011

A SYSTEM AND METHOD FOR TIME-AWARE TECHNOLOGY RELATION MINING
Qi He, W Spangler, Bin He, Ying Chen, with intern Xin Jin

A Semi-supervised Data Integration Model for Named Entity Classification
Qi He, W Spangler

Systems, Method and Computer Program Products for Fast and Scalable Proximal Search for Search Queries
Bin He, W Spangler, Qi He, with intern Sumit Bhatia

L1 vs. L2 Regularization in Text Classification when Learning from Labeled Features
S Mazilu, J Iria
Proceedings of the 10th IEEE International Conference on Machine Learning and Applications, 2011

Smarter log analysis
E Aharoni, S Fine, Y Goldschmidt, O Lavi, O Margalit, M Rosen-Zvi, L Shpigelman
IBM Journal of Research and Development 55(5), 10:1 - 10:10 , IBM, 2011

Mining Knowledge from Large Corpora for Type Coercion in Question Answering
James Fan, Aditya Kalyanpur, J. William Murdock and Branimir K. Boguraev
Web Scale Knowledge Extraction (WEKEX) Workshop at International Semantic Web Conference, 2011

Leveraging Community-built Knowledge for Type Coercion in Question Answering
Aditya Kalyanpur, J William Murdock, James Fan, Chris Welty
International Semantic Web Conference (ISWC 2011). Winner of the Best Paper Award (In-Use Track), pp. 144--156, Springer

Using Syntactic and Semantic Structural Kernels for Classifying Definition Questions in Jeopardy!
A Moschitti, J Chu-Carroll, S Patwardhan, J Fan, G Riccardi
Proceedings of the Conference on Empirical Methods for Natural Language Processing, pp. 73--76, 2011

Leveraging Wikipedia Characteristics for Search and Candidate Generation in Question Answering
J Chu-Carroll, J Fan
Twenty-Fifth AAAI Conference on Artificial Intelligence, 2011

Latent Graphical Models for Quantifying and Predicting Patent Quality.
Liu Y., Hsueh, P. et al.
17th ACM Knowledge Discovery and Data Mining (KDD 2011)(TOP PIC CONFERENCE)

A Unified Alignment Algorithm for Bilingual Data
C.Tillmann and S. Hewavitharana
Natural Language Engineering Journal, (Published online), 2011

An Efficient Unified Alignment Algorithm for Bilingual Data
C.Tillmann and S. Hewavitharana
Interspeech 2011

The IBM 2009 GALE Arabic speech transcription system
Brian Kingsbury, Hagen Soltau, George Saon, Stephen Chu, Hong-Kwang Kuo, Lidia Mangu, Suman Ravuri, Nelson Morgan, Adam Janin
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pp. 4672--4675

Relation Extraction with Relation Topics
Chang Wang, James Fan, Aditya Kalyanpur, and David Gondek
The 2011 Conference on Empirical Methods in Natural Language Processing (EMNLP 2011).

Structure Mapping for Jeopardy! Clues
J William Murdock
19th International Conference on Case Based Reasoning (ICCBR'11), pp. 6-10, Springer-Verlag, 2011

Relevance Feedback Exploiting Query-Specific Document Manifolds
Chang Wang, Emine Yilmaz, and Martin Szummer
The 20th ACM Conference on Information and Knowledge Management (CIKM2011)

Active online classification via information maximization
N Slonim, K Crammer, and E Yom-Tov
22nd International Joint Conference on Artificial Intelligence (IJCAI), 2011

Prognostic Data-Driven Clinical Decision Support - Formulation and Implications
R Rinott, B Carmeli, C Kent, D Landau, Y Maman, Y Rubin, and N Slonim
23rd International Conference of the European Federation for Medical Informatics (MIE), 2011

Heterogeneous Domain Adaptation using Manifold Alignment
Chang Wang and Sridhar Mahadevan
The 22nd International Joint Conference on Artificial Intelligence (IJCAI 2011)

Jointly Learning Data-Dependent Label and Locality-Preserving Projections
Chang Wang and Sridhar Mahadevan
The 22nd International Joint Conference on Artificial Intelligence (IJCAI 2011)

Enterprise blogging in a global context: comparing Chinese and American practices
Qinying Liao, Shimei Pan, Jennifer C Lai, Chang Yang
Proceedings of the ACM 2011 conference on Computer supported cooperative work, pp. 35--44, ACM

Automatic Classification of Change Requests for Improved IT Service Quality
C Kadar, D Wiesmann, J Iria, D Husemann, M Lucic
Proceedings of the SRII Global Conference 2011, pp. 430--439

Domain Adaptation for Text Categorization by Feature Labeling
C Kadar, J Iria
Proceedings of the 33rd European Conference on Information Retrieval (ECIR'11) (runner up for best paper award), 2011

Handling Complexity in Decoding for SMT
C. Tillmann
Handbook of Natural Language Processing and Machine Translation: DARPA Global Autonomous Language Exploitation. Joseph Olive, Caitlin Christianson, John McCary (Editors), pp. 280-287, 2011


2010

Refining Information Extraction Rules using Data Provenance.
Bin Liu, Laura Chiticariu, Vivian Chu, HV Jagadish, Frederick Reiss
IEEE Data Eng. Bull. 33(3), 17--24, Citeseer, 2010

Automatic Rule Refinement for Information Extraction
Bin Liu, Laura Chiticariu, Vivian Chu, H. V. Jagadish, Frederick Reiss
Proceedings of the VLDB Endowment Journal 3(1), 588-597, VLDB Endowment, 2010

Refining Information Extraction Rules using Data Provenance
Bin Liu, Laura Chiticariu, Vivian Chu, H. V. Jagadish, Frederick Reiss
IEEE Data Eng. Bull. 33(3), 17-24, Citeseer, 2010

Enterprise information extraction: recent developments and open challenges
Laura Chiticariu, Yunyao Li, Sriram Raghavan, Frederick Reiss
SIGMOD (Tutorial), pp. 1257-1258, 2010

Using Bagging and Boosting Techniques for Improving Coreference Resolution
Smita Vemulapalli, Xiaoqiang Luo, John F. Pitrelli, Imed Zitouni
Informatica 1(34), 111-118, 2010

Learning to Predict Readability using Diverse Linguistic Features
Rohit Kate; Xiaoqiang Luo; Siddharth Patwardhan; Martin Franz; Radu Florian; Raymond Mooney; Salim Roukos; Chris Welty
Proc. of COLING, 2010


Building Watson: An Overview of the DeepQA Project
D Ferrucci, E Brown, J Chu-Carroll, J Fan, D Gondek, AA Kalyanpur, A Lally, J. William Murdock, E Nyberg, J Prager, N Schlaefer, and C Welty
AI Magazine 31(3), 59-79, Association for the Advancement of Artificial Intelligence, 2010

Variant search and syntactic tree similarity based approach to retrieve matching questions for SMS queries
A Langer, R Banga, A Mittal, L V Subramaniam
Proceedings of the fourth workshop on Analytics for noisy unstructured text data, pp. 67--72, 2010

Estimating accuracy for text classification tasks on large unlabeled data
S Chaturvedi, T A Faruquie, L V Subramaniam, M K Mohania
Proceedings of the 19th ACM international conference on Information and knowledge management, pp. 889--898, 2010

Unsupervised cleansing of noisy text
Danish Contractor, Tanveer A Faruquie, L Venkata Subramaniam
Proceedings of the 23rd International Conference on Computational Linguistics: Posters, pp. 189--196, 2010

Improving mention detection robustness to noisy input
R Florian, J F Pitrelli, S Roukos, I Zitouni
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 335--345

What do people want in microblogs? measuring interestingness of hashtags in twitter
J Weng, E P Lim, Q He, C W K Leung
Proceedings of the 2010 IEEE International Conference on Data Mining (ICDM), pp. 1121--1126

Keep It Simple with Time: A re-examination of Probabilistic Topic Detection Models
Qi He, Kuiyu Chang, Ee-Peng Lim and Arindam Banerjee
the IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) 32(10), 1795-1808, 2010

Context-aware citation recommendation
Q He, J Pei, D Kifer, P Mitra, L Giles
Proceedings of the 19th international conference on World Wide Web (WWW), pp. 421--430, 2010

Twitterrank: finding topic-sensitive influential twitterers
J Weng, E P Lim, J Jiang, Q He
Proceedings of the third ACM international conference on Web search and data mining (WSDM), pp. 261--270, 2010


Building Analytics for Enterprise Search: IBM's Project ES2
Vuk Ercegovac, Rajasekar Krishnamurthy, Sriram Raghavan, Frederick Reiss, Eugene Shekita, Sandeep Tata, Shivakumar Vaithyanathan, and Huaiyu Zhu
Hadoop in Action, Manning Publications, 2010

Rapid and inexpensive development of speech action classifiers for natural language call routing systems
Ea-Ee Jan, Brian Kingsbury
Spoken Language Technology Workshop (SLT), 2010 IEEE, pp. 348--353

The IBM Attila speech recognition toolkit
Hagen Soltau, George Saon, Brian Kingsbury
Spoken Language Technology Workshop (SLT), 2010 IEEE, pp. 97--102

The IBM 2008 GALE Arabic speech transcription system
George Saon, Hagen Soltau, Upendra Chaudhari, Stephen Chu, Brian Kingsbury, Hong-Kwang Kuo, Lidia Mangu, Daniel Povey
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on, pp. 4378--4381

Using Machine Translation for the Localization of Electronic Support Content: Evaluating End-User Satisfaction
O Stewart, D Lubensky, S Macdonald, J Marcotte
Ninth Conference of the Association for Machine Translation in the Americas (AMTA), Denver, CO November 4, 2010
Abstract

Constrained co-clustering for textual documents
Y Song, S Pan, S Liu, F Wei, M X Zhou, W Qian
Proceedings of AAAI , 2010

Natural Language Aided Visual Query Building for Complex Data Access
S. Pan, M. Zhou, K. Houck, P. Kissa
Proceedings of IAAI, 2010

A General Purpose FrameNet-based Shallow Semantic Parser

Bonaventura Coppola, Alessandro Moschitti
Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010)


Building re-usable dictionary repositories for real-world text mining
Shantanu Godbole, Indrajit Bhattacharya, Ajay Gupta, Ashish Verma
Proceedings of the 19th ACM international conference on Information and knowledge management, pp. 1189--1198, 2010

Incorporating sparse representation phone identification features in automatic speech recognition using exponential families
V. Goel, T.N. Sainath, B. Ramabhadran, P. Olsen, D. Nahamoo, D. Kanevsky
Eleventh Annual Conference of the International Speech Communication Association, 2010

Graph-based Semantic Relatedness for Named Entity Disambiguation
A L Gentile, Z Zhang, L Xia, J Iria
Serdica Journal of Computing 2(4), 2010

Learning to Predict Readability using Diverse Linguistic Features
Rohit Kate, Xiaoqiang Luo, Siddharth Patwardhan, Martin Franz, Radu Florian, Raymond Mooney, Salim Roukos, Chris Welty
Proceedings of the 23rd International Conference on Computational Linguistics, pp. 546--554, 2010

Widening the Field of View of Information Extraction through Sentential Event Recognition
Siddharth Patwardhan
PhD Thesis, University of Utah, 2010

Acquiring Thesauri from Wikis by Exploiting Domain Models and Lexical Substitution
C Giuliano, A Gliozzo, A Gangemi, K Tymoshenko
The Semantic Web: Research and Applications, 121--135, Springer, 2010

A Geometric Framework For Transfer Learning Using Manifold Alignment
Chang Wang
Doctoral Dissertations for UMass Amherst. Paper AAI3427610, 2010

PRISMATIC: Inducing Knowledge from a Large Scale Lexicalized Relation Resource
James Fan, David Ferrucci, David Gondek and Aditya Kalyanpur
NAACL Workshop on Formalisms and Methodology for Learning by Reading , 2010

Support or Oppose? Classifying Positions in Online Debates from Reply Activities and Opinion Expressions
A. Murakami and R. Raymond
Proceedings of the 23th international conference on Computational Linguistics, 2010

Two Methods for Extending Hierarchical Rules from the Bilingual Chart Parsing
Martin Cmejrek and Bowen Zhou.
COLING, 2010

Classification and retrieval from mailing lists and forums
Preethi Raghavan, K Rose Catherine, Shajith Ikbal, Nanda Kambhatla
FIRE Working notes, 2010

DIVERSIFY AND COMBINE: IMPROVING WORD ALIGNMENT FOR MACHINE TRANSLATION ON LOW-RESOURCE LANGUAGES
Bing Xiang, Yonggang Deng and Bowen Zhou
ACL, 2010

APPLYING LOG LINEAR MODEL BASED CONTEXT DEPENDENT MACHINE TRANSLATION TECHNIQUES TO GRAPHEME-TO-PHONEME CONVERSION
Rong Zhang and Bowen Zhou
ICASSP, 2010

RAPID INTEGRATION OF PARTS OF SPEECH INFORMATION TO IMPROVE REORDERING MODEL FOR ENGLISH-FARSI SPEECH TO SPEECH TRANSLATION
Sameer Maskey and Bowen Zhou
ICASSP, 2010

Improving Domain-specific Entity Recognition with Automatic Term Recognition and Feature Extraction
Z Zhang, J Iria and F Ciravegna
Proceedings of 7th International Conference on Language Resources and Evaluation, 2010

A Random Graph Walk-based Approach to Compute Semantic Relatedness Using Knowledge from Wikipedia
Z Zhang, A L Gentile, L Xia, J Iria and S Chapman
Proceedings of 7th International Conference on Language Resources and Evaluation, 2010

Semantic Relatedness Approach for Named Entity Disambiguation
A L Gentile, Z Zhang, L Xia, J Iria
Digital Libraries: Communications in Computer and Information Science, Springer-Verlag, 2010


Super-human multi-talker speech recognition: A graphical modeling approach
J R Hershey, S J Rennie, P A Olsen, T T Kristjansson
Computer Speech & Language 24(1), 45--66, Elsevier, 2010

The monaural speech separation and recognition challenge
M Cooke, J R Hershey, S J Rennie
Computer Speech & Language 24(1), 1--15, Elsevier, 2010

Accessibility challenges and tool features: an IBM Web developer perspective
Shari Trewin, Brian Cragun, Cal Swart, Jonathan Brezin, John Richards
W4A '10: Proceedings of the 2010 International Cross Disciplinary Conference on Web Accessibility (W4A), pp. 1--10, ACM
Abstract

Syntax based reordering with automatically derived rules for improved statistical machine translation
Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen, Vijil Chenthamarakshan, Nanda Kambhatla
Proceedings of the 23rd international conference on computational linguistics, pp. 1119--1127, 2010

TIARA: A Visual Exploratory Text Analytic System
Furu Wei, Shixia Liu, Yangqiu Song, Shimie Pan, Michelle X. Zhou, Weihong Qian, Lei Shi, Li Tan, Qiang Zhang
Proceedings of KDD , pp. 153--162, ACM, 2010
Abstract

Enterprise information extraction: recent developments and open challenges
Laura Chiticariu, Yunyao Li, Sriram Raghavan, Frederick R Reiss
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data, pp. 1257--1258, ACM
Abstract

Large scale relation detection
Chris Welty, James Fan, David Gondek and Andrew Schlaikjer
NAACL Workshop on Formalisms and Methodology for Learning by Reading , 2010

Toward modeling auditory information seeking strategies on the web
Shari Trewin, John Richards, Rachel Bellamy, Bonnie E John, John Thomas, Cal Swart, Jonathan Brezin
CHI EA '10: Proceedings of the 28th of the international conference extended abstracts on Human factors in computing systems, pp. 3973--3978, ACM, 2010
Abstract

Domain adaptation of rule-based annotators for named-entity recognition tasks
Laura Chiticariu, Rajasekar Krishnamurthy, Yunyao Li, Frederick Reiss, Shivakumar Vaithyanathan
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 1002--1012

SystemT: an algebraic approach to declarative information extraction
Laura Chiticariu, Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghavan, Frederick R Reiss, Shivakumar Vaithyanathan
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pp. 128--137, Association for Computational Linguistics, 2010
Abstract


2009

Towards a Scalable Enterprise Content Analytics Platform
Kevin Beyer, Vuk Ercegovac, Rajasekar Krishnamurthy, Sriram Raghavan, Jun Rao, Frederick Reiss, Eugene J. Shekita, David Simmen, Sandeep Tata, Shivakumar Vaithyanathan, and Huaiyu Zhu
IEEE Data Engineering Bulletin, 2009

Enabling enterprise mashups over unstructured text feeds with infosphere mashuphub and systemt
David E Simmen, Frederick Reiss, Yunyao Li, Suresh Thalamati
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data, pp. 1123--1126, ACM
Abstract

Enterprise Information Extraction
Frederick Reiss, Yunyao Li, Laura Chiticariu, Sriram Raghavan
2009 - Citeseer, Citeseer

A Unified Model of Phrasal and Sentential Evidence for Information Extraction
Siddharth Patwardhan, Ellen Riloff
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pp. 151--160, Association for Computational Linguistics
Abstract

Acquiring Paraphrases from Text Corpora
Rahul Bhagat, Eduard Hovy, Siddharth Patwardhan
Proceedings of the Fifth International Conference on Knowledge Capture, pp. 161--168, ACM, 2009
Abstract

Snippets: Using Heuristics to Bootstrap a Machine Learning Approach
Daniel M. Bikel, Vittorio Castelli, Radu Florian, Xiaoqiang Luo, Scott McCarley, Todd Ward
GALE Book, 2009

A Cascaded Approach to Mention Detection and Chaining in Arabic
Imed Zitouni, Xiaoqiang Luo, Radu Florian
IEEE Trans. On Audio, Speech and Language Processing pp. 17, 935-944, 2009

Classifier Combination Techniques for Coreference Resolution: Bagging and Boosting
Smita Vemulapalli, Xiaoqiang Luo, John F. Pitrelli, Imed Zitouni
Proc. of HLT/NAACL (student workshop), 2009

Improving Coreference Resolution by Using Conversational Metadata (short paper)
Xiaoqiang Luo, Radu Flroian, Todd Ward
Proc. of HLT/NAACL, 2009

Classifier Combination Techniques for Coreference Resolution: Bagging and Boosting (Poster demo)
Smita Vemulapalli, Xiaoqiang Luo, John F. Pitrelli, Imed Zitouni
Proc. of CICLING, 2009

Towards the Open Advancement of Question Answering Systems
D Ferrucci, E Nyberg, J Allan, K Barker, E Brown, J Chu-Carroll, A Ciccolo, P Duboue, J Fan, D Gondek, E Hovy, B Katz, A Lally, M McCord, P Morarescu, J. William Murdock, B Porter, J Prager, T Strzalkowski, C Welty, W Zadrozny
IBM Research Report RC24789, IBM Research Report. RC24789 (W0904-093), IBM Research, New York, 2009


Scalable highly expressive reasoner (sher)
J Dolby, A Fokoue, A Kalyanpur, E Schonberg, K Srinivas
Journal of Web Semantics: Science, Services and Agents on the World Wide Web 7(4), 357--361, Elsevier, 2009

Extracting enterprise vocabulary using linked open data
J Dolby, A Fokoue, A Kalyanpur, K Srinivas, E Schonberg
8th International Semantic Web Conference (ISWC) 2009

Global differences in attributes of email usage
John C Tang, Tara Matthews, Julian Cerruti, Stephen Dill, Eric Wilcox, Jerald Schoudt, Hernan Badenes
Proceeding of the 2009 international workshop on Intercultural collaboration, pp. 185--194, ACM
Abstract

Interactive, topic-based visual text summarization and analysis
S Liu, M X Zhou, S Pan, W Qian, W Cai, X Lian
Proceeding of the 18th ACM conference on Information and knowledge management (CIKM), pp. 543--552, 2009

Effective decision support for workforce deployment service systems
Kashyap Dixit, Munish Goyal, Pranav Gupta, Nanda Kambhatla, Rohit M Lotlikar, Debapriyo Majumdar, Gyana R Parija, Sambuddha Roy, Soujanya Soni
Services Computing, 2009. SCC'09. IEEE International Conference on, pp. 104--111

Who is the expert? Analyzing gaze data to predict expertise level in collaborative applications
Yan Liu, Pei-Yun Hsueh, Jennifer Lai, Mirweis Sangin, M-A Nussli, Pierre Dillenbourg
Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on, pp. 898--901

Towards the Open Advancement of Question Answer Systems
David Ferrucci, Eric Nyberg, James Allan, Ken Barker, Eric Brown, Jennifer Chu-Carroll, Art Ciccolo, Pablo Duboue, James Fan, David Gondek, Eduard Hovy, Boris Katz, Adam Lally, Michael McCord, Paul Morarescu, Bill Murdock, Bruce Porter, John Prager, Tomek
Technical Report, 2009


Efficient reasoning on large SHIN Aboxes in relational databases
Julian Dolby, Achille Fokoue, Aditya Kalyanpur, Li Ma, Chintan Patel, Edith Schonberg, Kavitha Srinivas, Xingzhi Sun
Proceedings of the 5th International Workshop on Scalable Semantic Web knowledge Base Systems (SSWS2009), pp. 110--125

COBRA--mining web for COrporate Brand and Reputation Analysis
S Spangler, Y Chen, L Proctor, A Lelescu, A Behal, B He, T D Griffin, A Liu, B Wade, T Davis
Web Intelligence and Agent Systems 7(3), 243--254, IOS Press, 2009

Advances in Arabic speech transcription at IBM under the DARPA GALE program
Hagen Soltau, George Saon, Brian Kingsbury, H-KJ Kuo, Lidia Mangu, Daniel Povey, Ahmad Emami
Audio, Speech, and Language Processing, IEEE Transactions on 17(5), 884--894, IEEE, 2009

Data quality from crowdsourcing: a study of annotation selection criteria
P Y Hsueh, P Melville, V Sindhwani
Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing, pp. 27--35


2008

An Algebraic Approach to Rule-Based Information Extraction
Frederick Reiss, Sriram Raghavan, Rajasekar Krishnamurthy, Huaiyu Zhu, and Shivakumar Vaithyanathan
International Conference on Data Engineering, pp. 933 - 942, 2008

An algebraic approach to rule-based information extraction
Frederick Reiss, Sriram Raghavan, Rajasekar Krishnamurthy, Huaiyu Zhu, Shivakumar Vaithyanathan
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on, pp. 933--942, IEEE

SystemT: a system for declarative information extraction
Rajasekar Krishnamurthy, Yunyao Li, Sriram Raghavan, Frederick Reiss, Shivakumar Vaithyanathan, Huaiyu Zhu
ACM SIGMOD Record 37(4), 7--13, ACM, 2008
Abstract

Combining Global Relevance Information with Local Contextual Clues for Event-Oriented Information Extraction
Siddharth Patwardhan
Proceedings of the 23rd National Conference on Artificial Intelligence, pp. 1863--1864, 2008

SemantiClean: Cleaning Noisy Data Using Semantic Technology
C Welty, JW Murdock, J Fan
Language Resources and Evaluation 42(4), 395-408, Springer Netherlands, 2008

Vector based Approaches to Semantic Similarity Measures
J M Huerta
Advances in Natural Language Processing and Applications, 163, Citeseer, 2008

Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model
A Coden, G Savova, I Sominsky, M Tanenblatt, J Masanz, K Schuler, J Cooper, W Guan, P C Groen de
Journal of Biomedical Informatics, Elsevier, 2008

An Empirical Analysis of Word Error Rate and Keyword Error Rate
Youngja Park, Siddharth Patwardhan, Karthik Visweswariah, Stephen Gates
Proceedings of the International Conference on Spoken Language Processing, pp. 2070--2073, 2008

A theory-based decision heuristic for DPLL (T)
D Goldwasser, O Strichman, S Fine
Proceedings of the 2008 International Conference on Formal Methods in Computer-Aided Design

Regular expression learning for information extraction
Yunyao Li, Rajasekar Krishnamurthy, Sriram Raghavan, Shivakumar Vaithyanathan, H V Jagadish
Proceedings of the Conference on Empirical Methods in Natural Language Processing, pp. 21--30, Association for Computational Linguistics, 2008
Abstract

A Rule-Driven Dynamic Programming Decoder for Statistical MT
C Tillmann
Proc. of the Workshop SSST at ACL'08, pp. 37, 2008

Mining top issues from contact center logs for self help portals
Dinesh Garg, Nanda Kambhatla, Maja Vukovic, Gopal Pingali
Services Computing, 2008. SCC'08. IEEE International Conference on, pp. 171--178

Navigating through Dense Annotation Spaces
B K Boguraev, M S Neff
6th International Conference on Language Resources and Evaluation, 2008

Aggregating Distributed STT, MT, and Information Extraction Engines: The GALE Interoperability-Demo System
J F Pitrelli, B L Lewis, E A Epstein, M Franz, D Kiecza, J L Quinn, G Ramaswamy, A Srivastava, P Virga
Proceedings of Interspeech, pp. 23--26, 2008

Event matching using the transitive closure of dependency relations (Outstanding Short Paper Award)
D M Bikel, V Castelli
Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers, pp. 145--148, 2008

Accelerated monte carlo for kullback-leibler divergence between gaussian mixture models
J.Y. Chen, J.R. Hershey, P.A. Olsen, E. Yashchin
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4553--4556

Textual demand analysis: detection of users' wants and needs from opinions
Hiroshi Kanayama, Tetsuya Nasukawa
Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1, pp. 409--416, Association for Computational Linguistics, 2008
Abstract


The IBM rich transcription 2007 speech-to-text systems for lecture meetings
Jing Huang, Etienne Marcheret, Karthik Visweswariah, Vit Libal, Gerasimos Potamianos
Multimodal Technologies for Perception of Humans, pp. 429--441, Springer, 2008

Applications of voting theory to information mashups
A Alba, V Bhagwan, J Grace, D Gruhl, K Haas, M Nagarajan, J Pieper, C Robson, N Sahoo
Semantic Computing, 2008 IEEE International Conference on, pp. 10--17

Variational bhattacharyya divergence for hidden markov models
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4557--4560

Boosted MMI for model and feature-space discriminative training
Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Karthik Visweswariah
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on, pp. 4057--4060


2007

UMND1: Unsupervised Word Sense Disambiguation using Contextual Semantic Relatedness
Siddharth Patwardhan, Satanjeev Banerjee, Ted Pedersen
SemEval-2007: Proceedings of the 4th International Workshop on Semantic Evaluations, pp. 390--393, Association for Computational Linguistics
Abstract

Measures of Semantic Similarity and Relatedness in the Biomedical Domain
Ted Pedersen, Serguei Pakhomov, Siddharth Patwardhan, Christopher Chute
Journal of Biomedical Informatics 40(3), 288--299, Elsevier, 2007

Effective Information Extraction with Semantic Affinity Patterns and Relevant Regions
Siddharth Patwardhan, Ellen Riloff
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 717--727

A Statistical Model for Arabic Mention Detection and Chaining
Imed Zitouni, Xiaoqiang Luo, Radu Florian
Arabic Computational Linguistics, The University of Chicago Press, 2007

Coreference or not: A Twin-model for Coreference Resolution
Xiaoqiang Luo
Porc. of 2007 NAACL/HLT

COALA: A Tool for Inter-document Coreference Resolution Evaluation
B Andrews, J Fan, JW Murdock, C Welty
AAAI 2007 Spring Symposium Series: Machine Reading

Enabling domain-awareness for a generic natural language interface
Y Li, I Chaudhuri, H Yang, S Singh, HV Jagadish
AAAI, pp. 48109, 2007

If We Want Your Opinion
Daniel M Bikel, J Sorensen, IBMTJWR Center, Y Heights
Semantic Computing, 2007. ICSC 2007. International …, 2007

The IBM 2006 GALE Arabic System
H Soltau, G Saon, D Povey, L Mangu, B Kingsbury, M …
submitted to: ICASSP, 2007

Variational Kullback-Leibler divergence for hidden Markov models
J.R. Hershey, P.A. Olsen, S.J. Rennie
Automatic Speech Recognition \& Understanding, 2007. ASRU. IEEE Workshop on, pp. 323--328

IBM in TREC2006 Enterprise Track
J Chu-Carroll, G Averboch, P Duboue, D Gondek, JW Murdock, J Prager, P Hoffmann, J Wiebe
The Fifteenth Text REtrieval Conference (TREC 2006) Proceedings, 2007


IBM ACE’07 System Description
R Florian, B Han, X Luo, N Kambhatla, I Zitouni
Proceedings of NIST, 2007

Scalable cleanup of information extraction data using ontologies
J Dolby, J Fan, A Fokoue, A Kalyanpur, A Kershenbaum, L Ma, JW Murdock, K Srinivas, C Welty
Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference, pp. 100--113, Springer, 2007
Abstract

Discriminative training of subspace constrained GMMs for speech recognition
S Axelrod, V Goel, R Gopinath, P Olsen, K Visweswariah
IEEE Transactions on Speech and Audio Processing 15(1), 172-189, 2007

On generating EFSM models from use cases
A. Sinha, A. Paradkar, C. Williams
Scenarios and State Machines, 2007. SCESM'07: ICSE Workshops 2007. Sixth International Workshop on, pp. 1--1

Word confusability-measuring hidden Markov model similarity
J.Y. Chen, P.A. Olsen, J.R. Hershey
Eighth Annual Conference of the International Speech Communication Association, 2007

Automatic segmentation and summarization of meeting speech
G Murray, PY Hsueh, S Tucker, J Kilgour, J …
Proceedings of Human Language Technologies: The …, 2007 - portal.acm.org

A Block Bigram Prediction Model for Statistical MT
C Tillmann, T Zhang
ACM Transactions on Speech and Language Processing (TSLP) 4(3), 6, ACM, 2007

Automated functional conformance test generation for semantic web services
A.M. Paradkar, A. Sinha, C. Williams, R.D. Johnson, S. Outterson, C. Shriver, C. Liang
Web Services, 2007. ICWS 2007. IEEE International Conference on, pp. 110--117

Matching patient records to clinical trials using ontologies
C Patel, J Cimino, J Dolby, A Fokoue, A Kalyanpur, A Kershenbaum, L Ma, E Schonberg, K Srinivas
Proc of International Semantic Web Conference (ISWC), pp. 816, Springer, 2007

What decisions have you made: Automatic decision detection in conversational speech
PY Hsueh, J Moore
Proceedings of NAACL HLT, 2007 - acl.ldc.upenn.edu

Extracting social networks and biographical facts from conversational speech transcripts
H Jing, N Kambhatla, S Roukos
ANNUAL MEETING-ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, pp. 1040, 2007

Timebank evolution as a community resource for TimeML parsing
B Boguraev, J Pustejovsky, R Ando, M Verhagen
Language Resources and Evaluation 41(1), 91--115, Springer, 2007

The IBM 2006 Gale Arabic ASR system
Hagen Soltau, George Saon, Brian Kingsbury, Jeff Kuo, Lidia Mangu, Daniel Povey, Geoffrey Zweig
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--349

Learning by reading: A prototype system, performance baseline and lessons learned
Ken Barker, Bhalchandra Agashe, Shaw Chaw, James Fan, Noah Friedland, Michael Glass, Jerry Hobbs, Ed Hovy, David Israel, Doo Soon Kim, Rutu Mulkar, Sid Patwardhan, Bruce Porter, Dan Tecuci and Peter Yeh
AAAI 2007

Scalable semantic retrieval through summarization and refinement
J Dolby, A Fokoue, A Kalyanpur, A Kershenbaum, E Schonberg, K Srinivas, L Ma
Proceedings of the National Conference on Artificial Intelligence (AAAI-07), pp. 299, 2007

Evaluation of proposed modifications to MPE for large scale discriminative training
Daniel Povey, Brian Kingsbury
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--321

Towards effective browsing of large scale social annotations
Rui Li, Shenghua Bao, Yong Yu, Ben Fei, Zhong Su
Proceedings of the 16th international conference on World Wide Web, pp. 943--952, 2007

Approximating the Kullback Leibler divergence between Gaussian mixture models
J.R. Hershey, P.A. Olsen
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--317


2006

Using WordNet-based Context Vectors to Estimate the Semantic Relatedness of Concepts
Siddharth Patwardhan, Ted Pedersen
Proceedings of the EACL 2006 Workshop on Making Sense of Sense: Bringing Psycholinguistics and Computational Linguistics Together, pp. 1--8, Association for Computational Linguistics

Feature Subsumption for Opinion Analysis
Ellen Riloff, Siddharth Patwardhan, Janyce Wiebe
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 440--448, Association for Computational Linguistics
Abstract

Learning Domain-Specific Information Extraction Patterns from the Web
Siddharth Patwardhan, Ellen Riloff
Proceedings of the Workshop on Information Extraction Beyond The Document, pp. 66--73, Association for Computational Linguistics, 2006
Abstract

A view of OWL from the field: Use cases and experiences
A Kershenbaum, A Fokoue, C Patel, C Welty, E Schonberg, J Cimino, L Ma, K Srinivas, R Schloss, J W Murdock
W3C Web Ontology Language (OWL) - Experiences and Directions Workshop, 2006

Overview of Component Services for Knowledge Integration in UIMA (a.k.a. SUKI)
D Ferrucci, JW Murdock, C Welty
IBM Research Report RC24074, IBM, 2006


Word independent model for syllable stress evaluation
A Verma, K Lal, YY Lo, J Basak
2006 IEEE International Conference on Acoustics, Speech and …, 2006 - ieeexplore.ieee.org

Transition relevance place: a proposal for adaptive user interface in natural language dialog management systems
O Stewart, J M Huerta
CHI'06 extended abstracts on Human factors in computing systems, pp. 1361--1366, 2006

Coreference resolution on RDF Graphs generated from Information Extraction: first results
M Yatskevich, C Welty, JW Murdock
ISWC'06 Workshop on Web Content Mining with Human Language Technologies, 2006

Improving QA accuracy by question inversion
J Prager, P Duboue, J Chu-Carroll
ANNUAL MEETING-ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, pp. 1073--1080, 2006

A non-linear speaker adaptation technique using kernel ridge regression
G Saon, IBMTJWR Center, Y Heights
2006 IEEE International Conference on Acoustics, Speech and …, 2006 - ieeexplore.ieee.org

Transition relevance place: a proposal for adaptive user interface in natural language dialog management systems
O Stewart, J M Huerta
CHI'06 extended abstracts on Human factors in computing systems, pp. 1361--1366, 2006

Obtaining Formal Knowledge from Informal Text Analysis
J W Murdock, C Welty
IBM Research Report RC23961, 2006

Minority vote: at-least-N voting improves recall for extracting relations
N Kambhatla
Proceedings of the COLING/ACL on Main conference poster sessions, pp. 466, 2006

English-Chinese information retrieval at IBM
M Franz, J S McCarley, W J Zhu
ifi.uzh.ch, IBM THOMAS J WATSON RESEARCH CENTER YORKTOWN HEIGHTS NY, 2006

Concept-based electronic health records: opportunities and challenges
Shahram Ebadollahi, Anni R Coden, Michael A Tanenblatt, Shih-Fu Chang, Tanveer Syeda-Mahmood, Arnon Amir
Proceedings of the 14th annual ACM international conference on Multimedia, pp. 997--1006, 2006

Answering the question you wish they had asked: The impact of paraphrasing for question answering
PA Duboue, J Chu-Carroll
ANNUAL MEETING-ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2006

The web beyond popularity: a really simple system for web scale rss
D Gruhl, D N Meredith, J H Pieper, A Cozzi, S Dill
Proceedings of the 15th international conference on World Wide Web, pp. 192, 2006

Efficient Dynamic Programming Search Algorithms for Phrase-based SMT
C Tillmann
Proc. of the Workshop CHPSLP at HLT'06, pp. 9--16, 2006

Towards an Interoperability Standard for Text and Multi-Modal Analytics
David Ferrucci, Adam Lally, Daniel Gruhl, Edward Epstein, Marshall Schor, J. William Murdock, Andy Frenkiel, Eric W. Brown, Thomas Hampp, Yurdaer Doganata, Christopher Welty, Lisa Amini, Galina Kofman, Lev Kozakov, Yosi Mass
IBM Research Report RC24122, 2006

Enabling context-sensitive information seeking
MX Zhou, K Houck, S Pan, J Shaw, V Aggarwal, Z Wen
Proceedings of IUI, pp. 116--123, ACM, 2006
Abstract

Explaining conclusions from diverse knowledge sources
J. William Murdock, Deborah L. McGuinness, Paulo Pinheiro da Silva, Chris Welty, and David Ferrucci
Proceedings of the 5th International Semantic Web Conference (ISWC'06), 2006

Open-domain Question-Answering
John Prager
Found. Trends Inf. Retr.1, 91--231, Now Publishers Inc., 2006
Abstract

Towards knowledge acquisition from information extraction
C Welty, J Murdock
The Semantic Web-ISWC 2006, 709--722, Springer

Semantic search via XML fragments: a high-precision approach to IR
J. Chu-Carroll, J. Prager, K. Czuba, D. Ferrucci, P. Duboue
Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 445--452, ACM, 2006

A Discriminative Global Training Algorithm for Statistical MT
C Tillmann, T Zhang
Proc. of COLING'06 and ACL'06, pp. 721--728, 2006

Fully automatic lexicon expansion for domain-oriented sentiment analysis
Hiroshi Kanayama, Tetsuya Nasukawa
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing, pp. 355--363

Semantic Annotation for Knowledge Management: Requirements and a Survey of the State of the Art
V Uren, P Cimiano, J Iria, S Handschuh, M Vargas-Vera, E Motta, F Ciravegna
Web Semantics: Science, Services and Agents on the World Wide Web 4(1), 14--28, Elsevier, 2006

Question answering by predictive annotation
J Prager, J Chu-Carroll, E Brown, K Czuba
Advances in Open Domain Question Answering, 307--347, Springer, 2006


2005

Identifying Sources of Opinions with Conditional Random Fields and Extraction Patterns
Yejin Choi, Claire Cardie, Ellen Riloff, Siddharth Patwardhan
Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 355--362, Association for Computational Linguistics, 2005
Abstract

SenseRelate::TargetWord - A Generalized Framework for Word Sense Disambiguation
Siddharth Patwardhan, Ted Pedersen, Satanjeev Banerjee
Proceedings of the Twentieth National Conference on Artificial Intelligence (Intelligent Systems Demonstrations), pp. 1692--1693, 2005

OpinionFinder: A System for Subjectivity Analysis
Theresa Wilson, Paul Hoffmann, Swapna Somasundaran, Jason Kessler, Janyce Wiebe, Yejin Choi, Claire Cardie, Ellen Riloff, Siddharth Patwardhan
Proceedings of HLT/EMNLP on Interactive Demonstrations, pp. 34--35, Association for Computational Linguistics, 2005
Abstract

Measures of Semantic Similarity and Relatedness in the Medical Domain
Ted Pedersen, Serguei Pakhomov, Siddharth Patwardhan
University of Minnesota Digital Technology Center Research Report, 2005

Maximizing Semantic Relatedness to Perform Word Sense Disambiguation
Ted Pedersen, Satanjeev Banerjee, Siddharth Patwardhan
University of Minnesota Supercomputing Institute Research Report, 2005

SenseRelate:: TargetWord: a generalized framework for word sense disambiguation
S. Patwardhan, S. Banerjee, T. Pedersen
Proceedings of the ACL 2005 on Interactive poster and demonstration sessions, pp. 73--76

The impact of morphological stemming on Arabic mention detection and coreference resolution
Imed Zitouni, Jeffrey Sorenson, Xiaoqiang Luo, Radu Florian
Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages, 2005

Multi-Lingual Coreference Resolution with Syntactic Features
Xiaoqiang Luo and Imed Zitouni
Proc. of EMNLP/HLT, 2005

On Coreference Resolution Performance Metrics
Xiaoqiang Luo
Proc. Of EMNLP (and HLT), 2005

Exploiting pervasive enterprise chronicles using unstructured information management
Anthony Levas, Gopal Pingali, Mark Podlaseck, and J. William Murdock
International Conference on Pervasive Services 2005 (ICPS'05), pp. 239--248

Exposing Extracted Knowledge Supporting Answers
D L McGuinness, P P Silva da, J W Murdock, D Ferrucci
Technical Report KSL-05-03, Knowledge Systems Laboratory, Stanford University, 2005


Holistic Information Management Solutions
Y Chen, S Ong
IBM Research Report, 2005 - domino.research.ibm.com

IBM's PIQUANT II in TREC2004
JC Carroll, K Czuba, J Prager, A Ittycheriah, S Blair-Goldensohn
NIST Special Publication 500-261: Proceedings of the Thirteenth Text REtrieval Conference (TREC 2004), 2005

The Semantic Analysis Workbench (SAW): Towards a framework for knowledge gathering and synthesis
A Levas, E Brown, JW Murdock, D Ferrucci
Proceedings of the International Conference on Intelligence Analysis, 2005

Sentence Similarity Computing Based on Multi-Features Fusion
Y Zhao, B Qin, T Liu, L Zhang, Z Su
the Proceedings of JSCL, 2005 - ir.hit.edu.cn

Discriminatively trained features using fMPE for multi-stream audio-visual speech recognition
J Huang, D Povey
Ninth European Conference on Speech Communication and …, 2005 - ISCA

Initializing subspace constrained Gaussian mixture models
P.A. Olsen, K. Visweswariah, R. Gopinath
Proc. of the ICASSP, pp. 661--664, 2005

Exploiting unlabeled data using multiple classifiers for improved natural language call-routing
R Sarikaya, H K J Kuo, V Goel, Y Gao
Ninth European Conference on Speech Communication and Technology, 2005

Improving end-to-end performance of call classification through data confusion reduction and model tolerance enhancement
C Wu, X Li, H K J Kuo, EE Jan, V Goel, D Lubensky
Ninth European Conference on Speech Communication and Technology, 2005

Encoding Extraction as Inferences
JW Murdock, PP Da Silva, D Ferrucci, C Welty, DL McGuinness
Proceedings of the 2005 AAAI Spring Symposium on Metacognition in Computation, pp. 92--97

Tracking information extraction from intelligence documents
Christopher Welty, J. William Murdock, Paulo Pinheiro da Silva, Deborah McGuinness, and David Ferrucci
Proceedings of the International Conference on Intelligence Analysis, 2005

Feature adaptation using projection of Gaussian posteriors
K. Visweswariah, P. Olsen
Ninth European Conference on Speech Communication and Technology, 2005


IBM's PIQUANT II in TREC 2005
J Chu-Carroll, K Czuba, P Duboue, J Prager
Proceedings of the Fourteenth Text REtrieval Conference, 2005

Indirect anaphora resolution as semantic path search
James Fan, Ken Barker, Bruce Porter
Proceedings of the 3rd international conference on Knowledge capture, pp. 153--160, ACM, 2005
Abstract

Two-way adaptation for robust input interpretation in practical multimodal conversation systems
Shimei Pan, Siwei Shen, Michelle X Zhou, Keith Houck
Proceedings of the 10th international conference on Intelligent user interfaces, pp. 35--42, ACM, 2005

Visualizing information across multidimensional post-genomic structured and textual databases
Y Tao, C Friedman, Y A Lussier
Bioinformatics 21(8), 1659, Oxford Univ Press, 2005

Tell me what you do and I'll tell you what you are: learning occupation-related activities for biographies
Elena Filatova, John Prager
Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing, pp. 113--120, Association for Computational Linguistics, 2005
Abstract

Timebank-driven TimeML analysis for temporal reasoning
B K Boguraev, R K Ando
Dagstuhl International Workshop on Annotating, Extracting and Reasoning about Time and Events, 2005

Improving Web accessibility through an enhanced open-source browser
VL Hanson, JP Brezin, S Crayne, S Keates, R Kjeldsen, JT Richards, C Swart, S Trewin
IBM Systems Journal 44(3), 573--588, IBM, 2005


A Localized Prediction Model for Statistical MT
C Tillmann, T Zhang
Proc. of ACL'05, pp. 557--564, 2005

TimeML-compliant text analysis for temporal reasoning
Branimir Boguraev, Rie Ando
Dagstuhl Conference on Annotating and Reasoning with time and events, 2005

The IBM 2004 conversational telephony system for rich transcription
Hagen Soltau, Brian Kingsbury, Lidia Mangu, Daniel Povey, George Saon, Geoffrey Zweig
ICASSP - IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 205--208, 2005


2004

Generative Models for Semantic Role Labeling
Cynthia Thompson, Siddharth Patwardhan, Carolin Arnold
Proceedings of SENSEVAL-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, pp. 235--238, 2004

WordNet::Similarity - Measuring the Relatedness of Concepts
Ted Pedersen, Siddharth Patwardhan, Jason Michelizzi
Proceedings of the Nineteenth National Conference on Artificial Intelligence (Intelligent Systems Demonstrations), pp. 1024--1025, 2004

WordNet:: Similarity: measuring the relatedness of concepts
T. Pedersen, S. Patwardhan, J. Michelizzi
Demonstration Papers at HLT-NAACL 2004, pp. 38--41

A Mention-Synchronous Coreference Resolution Algorithm Based on the Bell Tree
Xiaoqiang Luo, Abe Ittycheriah, Hongyan Jing, Nanda Kambhatla and Salim Roukos
Proc. of ACL, 2004

A Statistical Model for Multilingual Entity Detection and Tracking
Radu Florian, Hany Hassan and Abe Ittycheriah, Hongyan Jing, Nanda Kambhatla, Xiaoqiang Luo, Nicolas Nicolov, Salim Roukos
Proc. of HLT-NAACL, 2004

IBM's PIQUANT II in TREC2004
K Czuba, J Chu-Carroll, J Prager, A Ittycheriah, S. Blair-Goldensohn
Proceedings of Text Retrieval Conference (TREC2004), NIST

A Multi-Strategy, Multi-Question Approach to Question Answering
J.M. Prager, J. Chu-Carroll, and K. Czuba
New Directions in Question-Answering, Maybury, M. (Ed.), AAAI Press, 2004

Term Aggregation: Mining Synonymous Expressions using Personal Stylistic Variations
Akiko Murakami Tetsuya Nasukawa
AMT Nasukawa, 2004

Reusable Dialog Components--Mainstreaming speech-enabled Web applications
J Huerta, D Lubensky, D Nahamoo, R Pieraccini, TV Raman, C Wiecha
J Huerta , D Lubensky, D Nahamoo, R ..., 2004

Question answering using constraint satisfaction
JM Prager, J Chu-Carroll, KW Czuba
Proceedings of the 42nd Meeting of the Association for Computational Linguistics, 2004

HSpell-the free Hebrew spell checker and morphological analyzer
N Har’el, D Kenigsberg
Israeli Seminar on Computational Linguistics, 2004

Evaluating ontology cleaning
C Welty, R Mahindru, J Chu-Carroll
Proceedings of the National Conference on Artificial Intelligence, pp. 311--316, 2004

IBM site report
Y Al-Onaizan, N Ge, Y S Lee, K Papineni, F Xia, C Tillmann
NIST 2004 MT Workshop

Improving document retrieval according to prediction of query difficulty
E Yom-Tov, S Fine, D Carmel, A Darlow, E Amitay
Proceedings of the 13th Text REtrieval Conference (TREC2004)

Interpreting loosely encoded questions
James Fan and Bruce Porter
AAAI 2004

IBMs PIQUANT II in TREC 2004
J Chu-Carroll, K Czuba, J Prager, A Ittycheriah, S Blair-Goldensohn
Proceedings of the 13th TREC Conference, 2004

Deeper sentiment analysis using machine translation technology
Kanayama Hiroshi, Nasukawa Tetsuya, Watanabe Hideo
Proceedings of the 20th international conference on Computational Linguistics, Association for Computational Linguistics, 2004
Abstract

Question answering using constraint satisfaction: QA-by-Dossier-with-Constraints
J Prager, J Chu-Carroll, K Czuba
Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, pp. 574, 2004


A Unigram Orientation Model for Statistical MT
C Tillmann
Proc. of HLT-NAACL 2004, Short Papers, pp. 101--104

A text-mining system for knowledge discovery from biomedical documents
N Uramoto, H Matsuzawa, T Nagano, A Murakami, H Takeuchi, K Takeda
IBM Systems Journal 43(3), 516--533, Armonk, NY: International Business Machines Corp., 2004

Modeling inverse covariance matrices by basis expansion
P.A. Olsen, R.A. Gopinath
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. 37--46, IEEE, 2004

Combining lexical, syntactic, and semantic features with maximum entropy models for extracting relations
N Kambhatla
Proceedings of the ACL 2004 on Interactive poster and demonstration sessions, pp. 22

Intricacies of Collins’ Parsing Model
Daniel M Bikel
Computational Linguistics, 2004

UIMA: an architectural approach to unstructured information processing in the corporate research environment

Nat. Lang. Eng. 10(3-4), Cambridge University Press, 2004
Abstract

A statistical model for multilingual entity detection and tracking
R Florian, H Hassan, A Ittycheriah, H Jing, N Kambhatla, X Luo, N Nicolov, S Roukos, T Zhang
of HLT-NAACL, IBM THOMAS J WATSON RESEARCH CENTER YORKTOWN HEIGHTS NY, 2004


2003

Incorporating Dictionary and Corpus Information into a Context Vector Measure of Semantic Relatedness
Siddharth Patwardhan
Master's Thesis, University of Minnesota, 2003

Using Measures of Semantic Relatedness for Word Sense Disambiguation
Siddharth Patwardhan, Satanjeev Banerjee, Ted Pedersen
Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics, pp. 241--257, Springer, 2003

Common Data Format Archiving of Large-Scale Intelligent Transportation Systems Data for Efficient Storage, Retrieval, and Portability
Taek Kwon, Nirish Dhruv, Siddharth Patwardhan, Eil Kwon
Transportation Research Record: Journal of the Transportation Research Board1836, 111--117, Transportation Research Board of the National Academies, 2003

CDF Archival of Large-Scaled ITS Data For Efficient Archival, Retrieval, and Portability
Nirish Dhruv, Taek Kwon, Siddharth Patwardhan, Eil Kwon
Proceedings of the Transportation Research Board 82nd Annual Meeting, 2003

An Efficient Decoder for History-Based Statistical Parsers
Xiaoqiang Luo, Min Tang
IBM Technical Report, RC23122 (W0402-129), 2003

An EM Algorithm for History-Based Statistical Parsers
Xiaoqiang Luo, Min Tang, Salim Roukos, Robert Ward
IBM Technical Report, RC23121 (W0402-128), 2003

A Maximum Entropy Chinese Character-Based Parser
Xiaoqiang Luo
Proc. of EMNLP, 2003

HowtogetaChineseName (Entity): Segmentation and Combination Issues
Hongyan Jing, Radu Florian, Xiaoqiang Luo, Tong Zhang, Abe Ittycheriah
," Proceedings of EMNLP, 2003

tRuEcasing
L Vita, A Ittycheriah, S Roukos, N Kambhatla
Proceedings of the ACL, 2003

The Semantics of Multiple Annotations
C Welty, JW Murdock
Research Report RC22979, IBM, 2003

Information Access in Large Spoken Archives
M Franz, B Ramabhadran, T Ward, M Picheny
ISCA Workshop on Multilingual Spoken Document Retrieval, 2003

IBM Research and the University of Colorado TREC 2003 Genomics Track
E W Brown, A Dolbey, L Hunter
Proceedings of The 12th Text Retrieval Conference, Gaithersburg, Md, November 2003

TIPS: a translingual information processing system
Y Al-Onaizan, R Florian, M Franz, H Hassan, YS Lee, S McCarley, K Papineni, S Roukos, J Sorensen, C Tillmann, others
Linguistics 29(1), 97--133, 2003

A framework for large scalable natural language call routing systems
C Wu, D Lubensky, J Huerta, X Li, H K J Kuo
Natural Language Processing and Knowledge Engineering, 2003, pp. 65--71

Document Clustering Based on Vector Quantization and Growing-Cell Structure
Z Su, L Zhang, Y Pan
LECTURE NOTES IN COMPUTER SCIENCE, 2003 - Springer

Multi-resolution disambiguation of term occurrences
E Amitay, R Nelken, W Niblack, R Sivan, A Soffer
Proceedings of the twelfth international conference on …, 2003 - portal.acm.org

A hand-held speech-to-speech translation system
B. Zhou, Y. Gao, J. Sorensen, D. D\'echelotte, M. Picheny
Automatic Speech Recognition and Understanding, 2003. ASRU'03. 2003 IEEE Workshop on, pp. 664--669

A real-time prototype for small-vocabulary audio-visual ASR
JH Connell, N Haas, E Marcheret, C Neti, G Potamianos, S Velipasalar
Multimedia and Expo, 2003, pp. II--469

Towards ontologies on demand
Y Park, B K Boguraev, R J Byrd
Semantic Web Technologies for Searching and Retrieving of Scientific Data, 2003

Use of statistical N-gram models in natural language generation for machine translation
F.H. Liu, L. Gu, Y. Gao, M. Picheny
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--636


Maximum likelihood training of subspaces for inverse covariance modeling
Karthik Visweswariah, P Olsen, Ramesh Gopinath, Scott Axelrod
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--848

Arabic information retrieval at IBM
M Franz, J S McCarley
NIST SPECIAL PUBLICATION SP, 260--262, NATIONAL INSTIUTE OF STANDARDS \& TECHNOLOGY, 2003

Identifying and tracking entity mentions in a maximum entropy framework
A Ittycheriah, L Lita, N Kambhatla, N Nicolov, S Roukos, M Stys
Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers-Volume 2, pp. 42

A Phrase-based Unigram Model for Statistical MT
C Tillmann, F Xia
Proc. of HLT'03, Short Papers, pp. 108, 2003

Topic Distillation with Knowledge Agents
E Amitay, D Carmel, A Darlow, R Lempel, A Soffer, …
NIST SPECIAL PUBLICATION SP, 2003 - trec.nist.gov

tRuE-casIng
L V Lita, A Ittycheriah, S Roukos, N Kambhatla
Proc, pp. 152--159, 2003

An architecture and data model for pipelined NLP applications
Mary S Neff, Roy J Byrd, Branimir K Boguraev
Proceedings of the HLT-NAACL 2003 workshop on Software engineering and architecture of language technology systems - Volume 8, pp. 1--8, Association for Computational Linguistics
Abstract

IBM's PIQUANT in TREC2003
J Prager, J Chu-Carroll, K Czuba, C Welty, A Ittycheriah, R Mahindru
Proceedings of the 12th Text REtrieval Conference, Citeseer, 2003


Efficient query evaluation using a two-level retrieval process
AZ Broder, D Carmel, M Herscovici, A Soffer, J …
Proceedings of the twelfth international conference on …, 2003 - portal.acm.org

A case for automated large-scale semantic annotation
S Dill, N Eiron, D Gibson, D Gruhl, R Guha, A Jhingran, T Kanungo, K S McCurley, S Rajagopalan, A Tomkins, others
Web Semantics: Science, Services and Agents on the World Wide Web 1(1), 115--132, Elsevier, 2003

Searching XML documents via XML fragments
Carmel, YS Maarek, M Mandelbrod, Y Mass, A Soffer
Proceedings of the 26th annual international ACM SIGIR …, 2003 - portal.acm.org

SemTag and Seeker: Bootstrapping the semantic web via automated semantic annotation
S Dill, N Eiron, D Gibson, D Gruhl, R Guha, A Jhingran, T Kanungo, S Rajagopalan, A Tomkins, J A Tomlin, others
Proceedings of the 12th international conference on World Wide Web, pp. 186, 2003


2002

Active learning for statistical natural language parsing
Min Tang, Xiaoqiang Luo, Salim Roukos
Proceedings of the 40th Annual Meeting of the ACL, 2002

Dynamic Programming based Search Algorithm for Statistical MT
C Tillmann, W Re-ordering
ordering - Ph. D. thesis, RWTH Aachen, 2001, 2002

In Question-Answering, Hit-List Size Matters
JM Prager
IBM TJ Watson Research Center Research Report\# RC22297, 2002

Seeker: An architecture for web-scale text analytics
S Dill, N Eiron, D Gibson, D Gruhl, A Jhingran, T Kanungo, KS McCurley, S Rajagopalan, A Tomkins, JA Tomlin, others
Technical Report, 2002

Use of WordNet hypernyms for answering what-is questions
J Chu-Carroll, J Prager
Proceedings of the Tenth Text REtrieval Conference, 2002

The VIP interface design system
R E Droms, K T Huang, C B Swart
Visual Languages, 1988, pp. 102--108, 2002

How Many Bits are Needed to Store Term Frequencies?
M Franz, J S McCarley
Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 378, 2002

Statistical answer-type identification in open-domain question answering
J. Prager, J. Chu-Carroll, K. Czuba
Proceedings of the second international conference on Human Language Technology Research, pp. 150--156, Morgan Kaufmann Publishers Inc., 2002

A machine learning approach to introspection in a Question Answering system
Krzysztof Czuba, John Prager, Jennifer Chu-Carroll
Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10, pp. 265--272, Association for Computational Linguistics, 2002
Abstract

A hybrid approach to natural language Web search
J. Chu-Carroll, J. Prager, Y. Ravin, C. Cesar
Proceedings of the ACL-02 conference on Empirical methods in natural language processing-Volume 10, pp. 180--187, Association for Computational Linguistics, 2002

MIND: A Semantics-based Multimodal Interpretation Framework for Conversational Systems
J Chai, Shimei Pan, M Zhou
Proceedings of International CLASS Workshop on Natural, Intelligent and Effective Interaction in Multimodal Dialog Systems, 2002

Improvements to the IBM Hub-5E system
J Huang, B Kingsbury, L Mangu, G Saon, R Sarikaya, G Zweig
NIST RT-02 Workshop, 2002

Segmentation and detection at IBM: Hybrid statistical models and two-tiered clustering
S Dharanipragada, M Franz, JS McCarley, T Ward, W J Zhu
Proc, pp. 135--148, Kluwer Academic Publishers Norwell, MA, USA, 2002

Turn-based language modeling for spoken dialog systems
R. Sarikaya, Y. Gao, H. Erdogan, M. Picheny
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. I--781

Capitalization recovery for text
E W Brown, A R Coden
Lecture notes in computer science, 11--22, Springer, 2002


A flexible framework for developing mixed-initiative dialog systems
Judith Hochberg, Nanda Kambhatla, Salim Roukos
Proceedings of the 3rd SIGdial workshop on Discourse and dialogue - Volume 2, pp. 60--63, Association for Computational Linguistics, 2002
Abstract

A flexible framework for developing mixed-initiative dialog systems
J Hochberg, N Kambhatla, S Roukos
Proc. of 3rd SIGDIAL, 2002

Detecting similar documents using salient terms
J W Cooper, A R Coden, E W Brown
Proceedings of the eleventh international conference on Information and knowledge management, pp. 251, 2002

Recovering Latent Information in Treebanks
D Chiang, Daniel M Bikel
Proceedings of COLING02, 2002

A multi-strategy and multi-source approach to question answering
J Chu-Carroll, J Prager, C Welty, K Czuba, D Ferrucci
Proceedings of the 11th Text REtrieval Conference, pp. 281--288, Citeseer, 2002

Automatic glossary extraction: beyond terminology identification
Youngja Park, Roy J Byrd, Branimir K Boguraev
Proceedings of the 19th international conference on Computational linguistics - Volume 1, pp. 1--7, Association for Computational Linguistics, 2002
Abstract

Automatic query wefinement using lexical affinities with maximal information gain
D Carmel, E Farchi, Y Petruschka, A Soffer
Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 283--290, 2002

Maximum entropy model for punctuation annotation from speech
J Huang, G Zweig
Seventh International Conference on Spoken Language …, 2002 - ISCA

IBM's statistical question answering system-TREC-10
A Ittycheriah, M Franz, S Roukos
NIST SPECIAL PUBLICATION SP, 258--264, NATIONAL INSTIUTE OF STANDARDS \& TECHNOLOGY, 2002

Self-similarity in the web
S Dill, R Kumar, K S McCurley, S Rajagopalan, D Sivakumar, A Tomkins
ACM Transactions on Internet Technology (TOIT) 2(3), 205--223, ACM New York, NY, USA, 2002


2001

Speech Recognition for DARPA Communicator
A Aaron, S Chen, P Cohen, S Dharanipragada, E Eide, M Franz, J-M Leroux, X Luo, B Maison, L Mangu, T Mathes, M Novak, P Olsen, M Picheny, H Printz, B Ramabhadran, A Sakrajda, G Saon, B Tydlitat, K Visweswariah, D Yuk
Proc. ICASSP, 2001



Medical non-intrusive prevention based on network of embedded systems
N Kambhatla, D Kanevsky, W W Zadrozny, A Zlatsin
US Patent ..., 2001 - Google Patents, Google Patents
US Patent 6,238,337

A conversational interface for online shopping
S Govindappa, V Horvath, N Kambhatla, N Nicolov, W …
Human Language Technology Confererence, 2001

The information of observations and application for active learning with uncertainty
S Axelrod, S Fine, R Gilad-Bachrach, R Mendelson, N Tishby
Citeseer, Citeseer, 2001

A conversational interface for online shopping
J Chai, V Horvath, N Kambhatla, N Nicolov, M Stys-Budzikowska
Proceedings of the first international conference on Human language technology research, pp. 4, 2001

Automated authoring of coherent multimedia discourse in conversation systems
MX Zhou, Shimei Pan
Proceedings of the ninth ACM international conference on Multimedia (ACM MM), 2001

Conversational sales assistant for online shopping
Margo Budzikowska, Joyce Chai, Sunil Govindappa, Veronika Horvath, Nanda Kambhatla, Nicolas Nicolov, Wlodek Zadrozny
Proceedings of the first international conference on Human language technology research, pp. 1--2, Association for Computational Linguistics, 2001
Abstract

Summarisation miniaturisation: Delivery of news to hand-helds
B K Boguraev, R Bellamy, C Swart
2nd Meeting of the North American Chapter of the Association for Computational Linguistics (NAACL), 2001

Quantifying the utility of parallel corpora
M Franz, J S McCarley, T Ward, W J Zhu
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 398--399, 2001

Summarisation miniaturisation: Delivery of news to hand-helds
B Boguraev, R Bellamy, C Swart
NAACL 2001 workshop on automatic summarization, Pittsburgh

The role of a natural language conversational interface in online sales: A case study
J Chai, J Lin, W Zadrozny, Y Ye, M Stys-Budzikowska, V Horvath, N Kambhatla, C Wolf
International Journal of Speech Technology 4(3), 285--295, Springer, 2001

Natural language sales assistant-a web-based dialog system for online sales
J Chai, V Horvath, N Nicolov, M Stys-Budzikowska, N Kambhatla, W Zadrozny
Proc, pp. 19--26, 2001

Enhancing GMM Scores using SVM" Hints"
S Fine, J Navratil, R A Gopinath
Seventh European Conference on Speech Communication and Technology, 2001

Representing roles and purpose
James Fan, Ken Barker, Bruce Porter, Peter Clark
Proceedings of the 1st international conference on Knowledge capture, pp. 38--43, ACM, 2001
Abstract

Use of WordNet hypernyms for answering what-is questions
J Prager, J Chu-Carroll, K Czuba
Proceedings of the 10th Text REtrieval Conference, 2001

Designing an E-grocery application for a palm computer: usabilityand interface issues
R Bellamy, C Swart, WA Kellogg, J Richards, J Brezin
IEEE [see also IEEE Wireless Communications] Personal Communications 8(4), 60--64, 2001

One search engine or two for question-answering
J Prager, E Brown, D R Radev, K Czuba
Proceedings of TREC9, 2001

Question answering using maximum entropy components
A Ittycheriah, M Franz, W J Zhu, A Ratnaparkhi, R J Mammone
Second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies, pp. 1--7, 2001

Unsupervised and supervised clustering for topic tracking
M Franz, J S McCarley
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 310--317, 2001


Static index pruning for information retrieval systems
D Carmel, D Cohen, R Fagin, E Farchi, M Herscovici, Y S Maarek, A Soffer
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 43--50, 2001

IBM's statistical question answering system
A Ittycheriah, M Franz, W J Zhu, A Ratnaparkhi, R J Mammone
NIST SPECIAL PUBLICATION SP, 229--234, NATIONAL INSTIUTE OF STANDARDS \& TECHNOLOGY, 2001


2000

Semantic Tokenization of Verbalized Numbers in Language Models
Xiaoqiang Luo, Martin Franz
Inter. Conf. of Spoken Language Processing, 2000

Parser Adaptation Via Householder Transform
Xiaoqiang Luo
Proc. of ICASSP, 2000

The effects of analysing cohesion on document summarisation
Branimir K Boguraev, Mary S Neff
Proceedings of the 18th conference on Computational linguistics - Volume 1, pp. 76--82, Association for Computational Linguistics, 2000
Abstract

Exploring A New Paradigm for E-Groceries
R K E Bellamy, W A Kellogg, J T Richards, C B Swart
Proceedings of IBM Ease of Use Conference, 2000

DMML: An XML Language for Interacting with Multi-Modal Dialog Systems
Nanda Kambhatla, Malgorzata Budzikowska, Sylvie Levesque, Nicolas Nicolov, Wlodek Zadrozny, Charles Wiecha, Julie MacNaught, others
AAAI/IAAI, pp. 1008--1013, 2000

Anti-serendipity: finding useless documents and similar documents
J W Cooper, J M Prager
Hawaii International Conference on System Sciences, 2000, pp. 8

Influence of speech recognition errors on topic detection
J S McCarley, M Franz
Proceedings of the 23rd ACM SIGIR Conference on Information Retrieval, pp. 342--344, 2000

Ad hoc, cross-language and spoken document information retrieval at IBM
M Franz, J S McCarley, R T Ward
NIST SPECIAL PUBLICATION SP, 391--398, Citeseer, 2000


Evaluating automatic dialogue strategy adaptation for a spoken dialogue system
Jennifer Chu-Carroll, Jill Suzanne Nickerson
Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference, pp. 202--209, Morgan Kaufmann Publishers Inc., 2000
Abstract

Statistical Methods for Machine Translation
S Vogel, F J Och, C Tillmann, S Nie{ss}en, H Sawaf, H Ney
Verbmobil: Foundations of Speech-to-Speech Translation, 377--393, 2000


MIMIC: An adaptive mixed initiative spoken dialogue system for information queries
J Chu-Carroll
Proceedings of the 6th Conference on Applied Natural Language Processing, 2000

Discourse segmentation in aid of document summarization
B K Boguraev, M S Neff
33rd International Conference on System Sciences, 2000

Multi-document summarization by visualizing topical content
R K Ando, B K Boguraev, R J Byrd, M S Neff
Proceedings of the 2000 NAACL-ANLP Workshop on Automatic Summarization, pp. 79--98, Association for Computational Linguistics

Two Statistical Parsing Models Applied to the Chinese Treebank
Daniel M Bikel, D Chiang
Proceedings of the second workshop on Chinese language …, 2000

Algorithms for statistical translation of spoken language
H Ney, S Nie{ss}en, FJ Och, H Sawaf, C Tillmann, S Vogel, L Inf fur, T H Aachen
IEEE Transactions on Speech and Audio Processing 8(1), 24--36, 2000


Efficient clustering of very large document collections
Inderjit S. Dhillon, James Fan, Yuqiang Guan
Data Mining for Scientific and Engineering Applications, Kluwer Academic Publishers, 2000

Document clustering using word clusters via the information bottleneck method
N Slonim, N Tishby
Proceedings of the 23rd conference on Research and development in information retrieval (SIGIR), pp. 208--215, 2000

Information retrieval on the web
M Kobayashi, K Takeda
ACM Computing Surveys (CSUR,null), 2000 - portal.acm.org

Statistical Methods for Machine Translation.
S. Vogel, F.J. Och, H. Sawaf, C.Tillmann and H. Ney
Verbmobil: foundations of speech-to-speech translation, pp. 377-393, Springer verlag, 2000


1999

Unsupervised Adaptation of Statistical Parsers Based on Markov Transform
Xiaoqiang Luo, Salim Roukos, Todd Ward
IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU), 1999

Probabilistic Classification of HMM States
Xiaoqiang Luo, Frederick Jelinek
Inter. Conf. of Acoustics, Speech and Signal Processing (ICASSP), 1999

Cursive word recognition using a random field based hidden Markov model. Int
G Saon
Journal of Pattern Recognition and Artificial Intelligence, 1999

Acoustic-prosodic disambiguation of direct and indirect speech acts
J Nickerson, J Chu-Carroll
roceedings of the 14 th International Congress of Phonetic Sciences, 1999

Maximum likelihood estimates for exponential type density families
S. Basu, C.A. Micchelli, P.A. Olsen
Acoustics, Speech, and Signal Processing, 1999. ICASSP'99. Proceedings., 1999 IEEE International Conference on, pp. 361--364

Dynamic visual metaphors for news story abstractions
R Bellamy, B Boguraev, C Kennedy
32nd International Conference on System Sciences, pp. 2003, 1999

Tracking initiative in collaborative dialogue interactions
M K Brown, J Chu-Carroll
Madrid, Spain Patent
US Patent 5,999,904

Phrase splicing and variable substitution using the IBM trainable speech synthesis system
RE Donovan, M Franz, JS Sorensen, S Roukos
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing- Proceedings, pp. 373--376, 1999

Word informativeness and automatic pitch accent modeling
S Pan, K McKeown
Proceedings of EMNLP/VLC99, 1999 - Citeseer

Constructing and utilizing a model of user preferences in collaborative consultation dialogues
S Carberry, J Chu-Carroll, S Elzer
Computational Intelligence, 1999

Form-based reasoning for mixed-initiative dialogue management in information-query systems
J Chu-Carroll
Sixth European Conference on Speech Communication and Technology, 1999

Empirically evaluating an adaptable spoken dialogue system
DJ Litman, Shimei Pan
Proceedings of the 7th International Conference on User Modeling (UM), 1999

A Statistical Parser for Czech
M Collins, J Hajic, L Ramshaw, C Tillman
Proc. of ACL'99, pp. 505--512, 1999

Vector-based natural language call routing
J Chu-Carroll, B Carpenter
Computational Linguistics25, 361--388, MIT Press, 1999
Abstract

Improved Alignment Models for Statistical Machine Translation
F J Och, C Tillmann, H Ney, others
Proc. of EMNLP'99, pp. 20--28, 1999

An algorithm that learns what's in a name
DM Bikel, R Schwartz, RM Weischedel
Machine learning, 1999 - Springer


1998

Nonreciprocal Data Sharing in Estimating HMM parameters
Xiaoqiang Luo, Frederick Jelinek
Inter. Conf. of Spoken Language Processing (ICSLP'98), 1998

NetVista: Growing an Internet solution for schools
W A Kellogg, J T Richards, C Swart, P Malkin, M Laff, V Hanson, B Hailpern
IBM Systems Journal 37(1), 19--41, IBM, 1998

Core natural language processing technology applicable to multiple languages--Workshop’98
J Hajic, E Brill, M Collins, B Hladka, D Jones, C Kuo, L Ramshaw, O Schwartz, C Tillmann, D Zeman
Technical Report, 1998

A method for relating multiple newspaper articles by using graphs, and its application to webcasting
Naohiko Uramoto, Koichi Takeda
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics-Volume 2, pp. 1307--1313, 1998

Conversation machines for transaction processing
W Zadrozny, C Wolf, N Kambhatla, Y Ye, others
PROCEEDINGS OF THE NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, pp. 1160--1167, 1998

A study on natural language call routing
CH Lee, B Carpenter, W Chou, J Chu-Carroll, W Reichl, A Saad, Q Zhou
IEEE 4th Workshop Interactive Voice Technology for Telecommunication Applications, 1998

A statistical model for discourse act recognition in dialogue interactions
J Chu-Carroll
Applying Machine Learning to Discourse Processing, 1998

A pattern-based machine translation system extended by example-based processing
H Watanabe, K Takeda
Proceedings of the 17th international conference on …, 1998 - portal.acm.org

Dialogue management in vector-based call routing
Jennifer Chu-Carroll, Bob Carpenter
Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 1, pp. 256--262, Association for Computational Linguistics, 1998
Abstract

Dynamic presentation of document content for rapid on-line browsing.
B Boguraev, YY Wong, C Kennedy, R Bellamy, S Brawer, J Swartz
AAAI Spring Symposium on Intelligent Text Summarization, pp. 109--118, Stanford, CA, 1998

Natural language call routing: A robust, self-organizing approach
B Carpenter, J Chu-Carroll
Fifth International Conference on Spoken Language Processing, 1998

An evidential model for tracking initiative in collaborative dialogue interactions
J Chu-Carroll, M Brown
User Modeling and User-Adapted Interaction 8(3), 215--254, Springer, 1998

Collaborative response generation in planning dialogues
Jennifer Chu-Carroll, Sandra Carberry
Comput. Linguist.24, 355--400, MIT Press, 1998
Abstract

The hierarchical hidden Markov model: Analysis and applications
S Fine, Y Singer, N Tishby
Machine learning 32(1), 41--62, Springer, 1998


1997

Optimal dimension reduction by local PCA
N Kambhatla, TK Leen
Neural Computation, 1997

Initiative in collaborative interactions-its cues and effects
J Chu-Carroll, M Brown
Proc. of AAAI Spring Symposium, 1997

Word triggers and the EM algorithm
C Tillmann, H Ney
Proc. of CoNLL'97 at ACL/EACL'97, pp. 117-124, 1997

Dimension reduction by local PCA
N Kambhatla, TK Leen
Neural Computation, 1997

Agnostic classification of Markovian sequences
R El-Yaniv, S Fine, N Tishby
In Advances in Neural Information Processing (NIPS-97, 1997

Accelerated DP based search for Statistical Translation
C Tillmann, S Vogel, H Ney, A Zubiaga, H Sawaf
Proc. of EUROSPEECH'97, pp. 2667--2670, 1997

A DP based Search using Monotone Alignments in Statistical Translation
C Tillmann, S Vogel, H Ney, A Zubiaga
Proc. of ACL'97, pp. 289--296, 1997

Dimension reduction by local principal component analysis
N Kambhatla, TK Leen
Neural Computation, 1997 - MIT Press

Nymble: a high-performance learning name-finder
DM Bikel, S Miller, R Schwartz, R Weischedel
Proceedings of the fifth conference on Applied natural …, 1997 - portal.acm.org


1996

An Iterative Algorithm to Build Chinese Language Models
Xiaoqiang Luo, Salim Roukos
Proc. of ACL, pp. 139-143, 1996

Modeling intention: Issues for spoken language dialogue systems
S Carberry, J Chu-Carroll, L Lambert
Proceedings of the International Symposium on Spoken Dialogue, 1996

Statistical Language Modeling and Word Triggers
C Tillmann, H Ney
Proc. of SPECOM'96, pp. 22--27, 1996

Recognition of unconstrained handwritten words using Markov random fields and HMMs
George A Saon, A Belaid
Fifth International Workshop on Frontiers in Handwriting …, 1996

Selection criteria for word trigger pairs in language modelling
C Tillmann, H Ney
Lecture Notes in Computer Science1147, 95--106, Springer, 1996

Conflict detection and resolution in collaborative planning
J Chu-Carroll, S Carberry
Lecture Notes in Computer Science,, pp. 111--126, Springer, 1996

Pattern-based machine translation
K Takeda
Proceedings of the 16th conference on Computational …, 1996 - portal.acm.org

Pattern-based context-free grammars for machine translation
K Takeda
Proceedings of the 34th annual meeting on Association for …, 1996 - portal.acm.org


HMM-based word alignment in statistical translation
S Vogel, H Ney, C Tillmann
Proc. of COLING'96, pp. 836--841, 1996


1995

Lexical semantics in context
James Pustejovsky, Branimir Boguraev
Journal of Semantics, 1995

Machine-readable dictionaries and computational linguistic research
Branimir K Boguraev
Automating the Lexicon: Research and Practice in a Multi-Lingual Environment, Oxford University Press, 1995

Communication for conflict resolution in multi-agent collaborative planning
J Chu-Carroll, S Carberry
Proceedings of the First International Conference on Multiagent Systems, 1995

Generating information-sharing subdialogues in expert-user consultation
J Chu-Carroll, S Carberry
INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, pp. 1243--1250, 1995

Response generation in collaborative negotiation
Jennifer Chu-Carroll, Sandra Carberry
Proceedings of the 33rd annual meeting on Association for Computational Linguistics, pp. 136--143, Association for Computational Linguistics, 1995
Abstract

Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task
LR Bahl, S. Balakrishnan-Aiyer, JR Bellgarda, M. Franz, PS Gopalakrishnan, D. Nahamoo, M. Novak, M. Padmanabhan, MA Picheny, S. Roukos
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on, pp. 41--44


1994

Tricolor DAGs for machine translation
K Takeda
Proceedings of the 32nd annual meeting on Association for …, 1994 - portal.acm.org

Portable knowledge sources for machine translation
K Takeda
Proceedings of the 15th conference on Computational …, 1994 - portal.acm.org


Use of stochastic models in text recognition
A Belaid, G Saon
??????? ???, 1994 - dbpia.co.kr

Fast non-linear dimension reduction
TK Leen, N Kambhatla
Advances in Neural Information Processing Systems, 1994 - cse.ogi.edu


Lexical assistance at the information-retrieval user interface
R J Byrd, Y Ravin, J Prager
1994 - IBM TJ Watson Research Center, IBM TJ Watson Research Center

Recognizing and utilizing user preferences in collaborative consultation dialogues
S Elzer, J Chu-Carroll, S Carberry
roceedings of the Fourth International Conference on User Modeling, 1994

Robust methods for using context-dependent features and models in a continuous speech recognizer
LR Bahl, PV De Souza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, pp. I--533

Fast incremental indexing for full-text information retrieval
E W Brown, J P Callan, W B Croft
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, pp. 192--192, 1994


1993

Fast nonlinear dimension reduction. ICNN'93
N Kambhatla, TK Leen
1993 International Conference on Neural Networks, 1993

Word lookahead scheme for cross-word right context models in a stack decoder
LR Bahl, P V Souza, PS Gopalakrishnan, D Nahamoo, M Picheny
Third European Conference on Speech Communication and Technology, 1993


1992

Lexical knowledge and lexical knowledge bases
Branimir Boguraev, Beth Levin
Semantics and the Lexicon, 1992

Corpora, lexical semantics and lexical evaluation
Branimir K Boguraev
Second Workshop of the Consortium for Lexical Research, 1992

From representation of texts to knowledge about words
B K Boguraev, M S Neff
Journal of Literary and Linguistic Computing, 1992

Shalt2: a symmetric machine translation system with conceptual transfer
Koichi Takeda, Tetsuya Nasukawa, Naohiko Uramoto, Taijiro Tsutsumi
Proceedings of the 14th conference on Computational linguistics-Volume 3, pp. 1034--1038, 1992


1991

An iterativeflip-flop'approximation of the most informative split in the construction of decision trees
A. N\'adas, D. Nahamoo, M.A. Picheny, J. Powell
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 565--568

Building a lexicon: The contribution of computers
Branimir K Boguraev
International Journal of Lexicography, 1991


1990

Local models and Gaussian mixture models for statistical data processing
K Nandakishore
sl]: Institute of Technology Benaras Hindu University, 1990

REASON: An intelligent user assistant for interactive environments
J M Prager, D M Lamberti, D L Gardner, S R Balzac
IBM Systems Journal 29(1), 141--164, IBM, 1990

Database models for computational lexicography
B Boguraev, E J Briscoe, J Carroll, A Copestake
Proceedings off EURALEX IV, 1990

Lexical ambiguity and the role of knowledge representation in lexicon design
Branimir Boguraev, James Pustejovsky
Proceedings of the 13th conference on Computational linguistics - Volume 2, pp. 36--41, Association for Computational Linguistics, 1990
Abstract

Enjoy the paper: lexical semantics via lexicology
Ted Briscoe, Ann Copestake, Bran Boguraev
Proceedings of the 13th conference on Computational linguistics - Volume 2, pp. 42--47, Association for Computational Linguistics, 1990
Abstract


1989

From machine-readable dictionaries to lexical data bases
M S Neff, B Boguraev
1989 - IBM TJ Watson Research Center, IBM TJ Watson Research Center

Dictionaries, dictionary grammars and dictionary entry parsing
Mary S Neff, Branimir K Boguraev
Proceedings of the 27th annual meeting on Association for Computational Linguistics, pp. 91--101, Association for Computational Linguistics, 1989
Abstract

Large vocabulary natural language continuous speech recognition
LR Bahl, R. Bakis, J. Bellegarda, PF Brown, D. Burshtein, SK Das, PV De Souza, PS Gopalakrishnan, F. Jelinek, D. Kanevsky, others
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on, pp. 465--467


1988

The Grammar Development Environment User Manual
J Carroll, B Boguraev, C Grover, T Briscoe
Cambridge Computer Laboratory Technical Report127, 1988

Decoder selection based on cross-entropies
PS Gopalakrishnan, D. Kanevsky, A. Nadas, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 20--23


1987

The derivation of a grammatically-indexed lexicon from the Longman Dictionary of Contemporary English.
B Boguraev, E Briscoe, J Carroll, D Carter, C Grover
25th Annual Meeting of the Association for Computational Linguistics, Stanford, CA, 1987

The ALVEY natural language tools project grammar: A large computational grammar
T Briscoe, C Grover, B Boguraev, J Carroll
ALVEY Documents, Cambridge Univ., Computer Laboratory, UK, 1987

A formalism and environment for practical grammar development
E J Briscoe, C Grover, B K Boguraev, J Carroll
10th International Joint Conference on Artificial Intelligence, pp. 703--8, 1987

An Object-Oriented Toolkit for Visual Interface Design
K Huang, R Droms, C Swart, J Mastrodiulio, R Li
IBM ITL Meeting on Image Processing, Toronto, Canada, 1987


1983

The project automated librarian
J M Prager
IBM Systems Journal 22(3), 214--228, IBM, 1983


1981

The FINITE STRING Newsletter
R.C. Berwick, K. Church, R. Patil, H.M. Gigley, R.M. Kaplan, M. Kay, D.T. Langendoen, J.A. Moyne, R. Milne, R. Grishman, others
American Journal of Computational Linguistics 7(3), 181, 1981


Year Unknown



NetVista: Growing an Internet solution for schools-Author bios
WA Kellogg, JT Richards, C Swart, P Malkin, M Laff …
research.ibm.com, 0

Fast non-linear dimension reduction. InJ. D. Cowan, G. Tesauro, andJ. Alspector, editors
N Kambhatla, TK Leen
Advances in Neural Information Processing Systems-6, 0



The Talent 5.1 TFst system: user documentation and grammar writing manual

2003 - IBM TJ Watson Research Center, ...

An annotation-based finite state system for UIMA: User documentation and grammar writing manual

IBM TJ Watson Research Center, 2007

Sensei: Spoken language assessment for call center agents

... , 2007. ASRU. IEEE ..., 2008 - ieeexplore.ieee.org