Search For:

Displaying 1-8 out of 8 total
A Suffix-Based Noun and Verb Classifier for an Inflectional Language
Found in: Asian Language Processing, International Conference on
By Navanath Saharia, Utpal Sharma, Jugal Kalita
Issue Date:December 2010
pp. 19-22
Nouns and verbs pose the major challenge in part-of-speech tagging exercises. In this paper we present a suffix based noun and verb classifier for Assamese, an inflectional, relatively free word order Indic language. We used a tiny dictionary of frequent w...
 
A Comparison of Approaches for Geospatial Entity Extraction from Wikipedia
Found in: International Conference on Semantic Computing
By Daryl Woodward, Jeremy Witmer, Jugal Kalita
Issue Date:September 2010
pp. 402-407
We target in this paper the challenge of extracting geospatial data from the article text of the English Wikipedia. We present the results of a Hidden Markov Model (HMM) based approach to identify location-related named entities in the our corpus of Wikipe...
 
Extracting Geospatial Entities from Wikipedia
Found in: International Conference on Semantic Computing
By Jeremy Witmer, Jugal Kalita
Issue Date:September 2009
pp. 450-457
This paper addresses the challenge of extracting geospatial data from the article text of the English Wikipedia. In the first phase of our work, we create a training corpus and select a set of word-based features to train a Support Vector Machine (SVM) for...
 
A Branch and Bound Algorithm to Scale Alignment of Large Ontologies
Found in: International Conference on Semantic Computing
By Suzette Stoutenburg, Kaily Ewing, Lisa Hines, Jugal Kalita
Issue Date:September 2009
pp. 349-354
Increasingly, ontologies are being developed and exposed on the Web to support a variety of applications, including biological knowledge sharing, enhanced search and discovery, and decision support. This proliferation of new Web knowledge sources is result...
 
Shifting-and-scaling Correlation based Biclustering Algorithm
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics
By Hasin Ahmed,Priyakshi Mahanta,Dhruba Bhattacharyya,Jugal Kalita
Issue Date:May 2014
pp. 1
The existence of various types of correlations among the expressions of a group of biologically significant genes poses challenges in developing effective methods of gene expression data analysis. The initial focus of computational biologists was to work w...
 
Design and Evaluation of Soft Keyboards for Brahmic Scripts
Found in: ACM Transactions on Asian Language Information Processing (TALIP)
By Albert Brouillette, Jugal Kalita, Leigh Gathings
Issue Date:June 2013
pp. 1-37
Despite being spoken by a large percentage of the world, Indic languages in general lack user-friendly and efficient methods for text input. These languages have poor or no support for typing. Soft keyboards, because of their ease of installation and lack ...
     
Analysis and evaluation of stemming algorithms: a case study with Assamese
Found in: Proceedings of the International Conference on Advances in Computing, Communications and Informatics (ICACCI '12)
By Jugal Kalita, Navanath Saharia, Utpal Sharma
Issue Date:August 2012
pp. 842-846
Stemming is the process of automatically extracting the base form of a given word of a language. Assamese is a morphologically rich, relatively free word order, Indo-Aryan language spoken in North-Eastern part of India that uses Assamese-Bengali script for...
     
Summarization as feature selection for text categorization
Found in: Proceedings of the tenth international conference on Information and knowledge management (CIKM'01)
By Aleksander Kolcz, Jugal Kalita, Vidya Prabakarmurthi
Issue Date:October 2001
pp. 365-370
We address the problem of evaluating the effectiveness of summarization techniques for the task of document categorization. It is argued that for a large class of automatic categorization algorithms, extraction-based document categorization can be viewed a...
     
 1