Search For:

Displaying 1-28 out of 28 total
Bird Flu Outbreak Prediction via Satellite Tracking
Found in: IEEE Intelligent Systems
By Yuanchun Zhou,Mingjie Tang,Weike Pan,Jinyan Li,Weihang Wang,Jing Shao,Liang Wu,Jianhui Li,Qiang Yang,Baoping Yan
Issue Date:July 2014
pp. 10-17
Advanced satellite tracking technologies have collected huge amounts of wild bird migration data. Biologists use these data to understand dynamic migration patterns, study correlations between habitats, and predict global spreading trends of avian influenz...
 
Distance Based Subspace Clustering with Flexible Dimension Partitioning
Found in: Data Engineering, International Conference on
By Guimei Liu, Jinyan Li, Kelvin Sim, Limsoon Wong
Issue Date:April 2007
pp. 1250-1254
Traditional similarity or distance measurements usually become meaningless when the dimensions of the datasets increase, which has detrimental effects on clustering performance. In this paper, we propose a distance-based subspace clustering model, called n...
 
Guest Editors' Introduction: Data Mining in Bioinformatics
Found in: IEEE Intelligent Systems
By Jinyan Li, Limsoon Wong, Qiang Yang
Issue Date:November 2005
pp. 16-18
This special issue aims to bridge the gap between bioinformatics and data mining by presenting research integrating the two. Data mining has the potential to provide the necessary tools for better understanding of gene expression, drug design, and other em...
 
Diagnostic Rules Induced by an Ensemble Method for Childhood Leukemia
Found in: Bioinformatic and Bioengineering, IEEE International Symposium on
By Jinyan Li, Huiqing Liu, Ling Li
Issue Date:October 2005
pp. 246-249
We introduce a new ensemble method based on decision tree to discover significant and diversified rules for subtype classification of childhood acute lymphoblastic leukemia, a heterogeneous disease with individual subtypes differing in their response to ch...
 
Mining Maximal Quasi-Bicliques to Co-Cluster Stocks and Financial Ratios for Value Investment
Found in: Data Mining, IEEE International Conference on
By Kelvin Sim, Jinyan Li, Vivekanand Gopalkrishnan, Guimei Liu
Issue Date:December 2006
pp. 1059-1063
We introduce an unsupervised process to co-cluster groups of stocks and financial ratios, so that investors can gain more insight on how they are correlated. Our idea for the co-clustering is based on a graph concept called maximal quasi-bicliques, which c...
 
Selection of Patient Samples and Genes for Outcome Prediction
Found in: Computational Systems Bioinformatics Conference, International IEEE Computer Society
By Huiqing Liu, Jinyan Li, Limsoon Wong
Issue Date:August 2004
pp. 382-392
Gene expression profiles with clinical outcome data enable monitoring of disease progression and prediction of patient survival at the molecular level. We present a new computational method for outcome prediction. Our idea is to use an informative subset o...
 
Ensembles of Cascading Trees
Found in: Data Mining, IEEE International Conference on
By Jinyan Li, Huiqing Liu
Issue Date:November 2003
pp. 585
We introduce a new method, called CS4, to construct committees of decision trees for classification. The method considers different top-ranked features as the root nodes of member trees. This idea is particularly suitable for dealing with high-dimensional ...
 
Solving the Fragmentation Problem of Decision Trees by Discovering Boundary Emerging Patterns
Found in: Data Mining, IEEE International Conference on
By Jinyan Li, Limsoon Wong
Issue Date:December 2002
pp. 653
The single coverage constraint discourages a decision tree to contain many significant rules. The loss of significant rules leads to a loss in accuracy. On the other hand, the fragmentation problem causes a decision tree to contain too many minor rules. Th...
 
Coupling Graphs, Efficient Algorithmsand B-Cell Epitope Prediction
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics
By Liang Zhao,Steven C.H. Hoi,Zhenhua Li,Limsoon Wong,Hung Nguyen,Jinyan Li
Issue Date:January 2014
pp. 7-16
Coupling graphs are newly introduced in this paper to meet many application needs particularly in the field of bioinformatics. A coupling graph is a two-layer graph complex, in which each node from one layer of the graph complex has at least one connection...
   
Maximal Biclique Subgraphs and Closed Pattern Pairs of the Adjacency Matrix: A One-to-One Correspondence and Mining Algorithms
Found in: IEEE Transactions on Knowledge and Data Engineering
By Jinyan Li, Guimei Liu, Haiquan Li, Limsoon Wong
Issue Date:December 2007
pp. 1625-1637
Enumerating maximal biclique subgraphs from a graph is a computationally challenging problem. In this paper, we efficiently enumerate them through the use of closed patterns of the adjacency matrix of the graph. For an undirected graph $G$ without self-loo...
 
Using Fixed Point Theorems to Model the Binding in Protein-Protein Interactions
Found in: IEEE Transactions on Knowledge and Data Engineering
By Jinyan Li, Haiquan Li
Issue Date:August 2005
pp. 1079-1087
The binding in protein-protein interactions exhibits a kind of biochemical stability in cells. The mathematical notion of fixed points also describes stability. A point is a fixed point if it remains unchanged after a transformation by a function. Many poi...
 
Detection of Outlier Residues for Improving Interface Prediction in Protein Heterocomplexes
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics
By Peng Chen, Limsoon Wong, Jinyan Li
Issue Date:July 2012
pp. 1155-1165
Sequence-based understanding and identification of protein binding interfaces is a challenging research topic due to the complexity in protein systems and the imbalanced distribution between interface and noninterface residues. This paper presents an outli...
 
Antibody-Specified B-Cell Epitope Prediction in Line with the Principle of Context-Awareness
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics
By Liang Zhao, Limsoon Wong, Jinyan Li
Issue Date:November 2011
pp. 1483-1494
Context-awareness is a characteristic in the recognition between antigens and antibodies, highlighting the reconfiguration of epitope residues when an antigen interacts with a different antibody. A coarse binary classification of antigen regions into epito...
 
Mining Iterative Generators and Representative Rules for Software Specification Discovery
Found in: IEEE Transactions on Knowledge and Data Engineering
By David Lo, Jinyan Li, Limsoon Wong, Siau-Cheng Khoo
Issue Date:February 2011
pp. 282-296
Billions of dollars are spent annually on software-related cost. It is estimated that up to 45 percent of software cost is due to the difficulty in understanding existing systems when performing maintenance tasks (i.e., adding features, removing bugs, etc....
 
Sequence-based B-cell epitope prediction by using associations in antibody-antigen structural complexes
Found in: Bioinformatics and Biomedicine Workshop, IEEE International Conference on
By Liang Zhao, Jinyan Li
Issue Date:November 2009
pp. 165-172
B-cell secreted antibodies play a critical role in fighting against the invaders and abnormal self tissues. Identifying the epitope on antigens recognized by the paratope on antibodies can enlighten the understanding of this important immune mechanism. Pre...
 
Prediction of protein long-range contacts using GaMC approach with sequence profile centers
Found in: Bioinformatics and Biomedicine Workshop, IEEE International Conference on
By Peng Chen, Jinyan Li
Issue Date:November 2009
pp. 128-135
In this paper, we apply an evolutionary optimization classifier, referred to as genetic algorithm-based multiple classifier (GaMC), to the long-range contacts prediction. As a result, about 44.1% contacts between long-range residues (with a sequence separa...
 
High Functional Coherence in k-Partite Protein Cliques of Protein Interaction Networks
Found in: Bioinformatics and Biomedicine, IEEE International Conference on
By Qian Liu, Yi-Ping Phoebe Chen, Jinyan Li
Issue Date:November 2009
pp. 111-117
We introduce a new topological concept called k-partite protein cliques to study protein interaction (PPI) networks.In particular, we examine functional coherence of proteins in k-partite protein cliques. A k-partite protein clique is a k-partite maximal c...
 
Modeling Protein Interacting Groups by Quasi-Bicliques: Complexity, Algorithm, and Application
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics
By Xiaowen Liu, Jinyan Li, Lusheng Wang
Issue Date:April 2010
pp. 354-364
HASH(0x2973610)
 
Burial level change defines a high energetic relevance for protein binding interfaces
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics
By Zhenhua Li,Ying He,Limsoon Wong,Jinyan Li
Issue Date:February 2015
pp. 1
Protein-protein interfaces defined through atomic contact or solvent accessibility change are widely adopted in structural biology studies. But, these definitions cannot precisely capture energetically important regions at protein interfaces. The burial de...
 
Feature Selection in Life Science Classification: Metaheuristic Swarm Search
Found in: IT Professional
By Simon Fong,Suash Deb,Xin-She Yang,Jinyan Li
Issue Date:July 2014
pp. 24-29
The purpose of classification in medical informatics is to predict the presence or absence of a particular disease as well as disease types from historical data. Medical data often contain irrelevant features and noise, and an appropriate subset of the sig...
 
Relative risk and odds ratio: a data mining perspective
Found in: Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (PODS '05)
By Haiquan Li, Jinyan Li, Limsoon Wong, Mengling Feng, Yap-Peng Tan
Issue Date:June 2005
pp. 368-377
We are often interested to test whether a given cause has a given effect. If we cannot specify the nature of the factors involved, such tests are called model-free studies. There are two major strategies to demonstrate associations between risk factors (ie...
     
Model the complex dependence structures of financial variables by using canonical vine
Found in: Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM '12)
By Jinyan Li, Longbing Cao, Wei Wei, Xuhui Fan
Issue Date:October 2012
pp. 1382-1391
Financial variables such as asset returns in the massive market contain various hierarchical and horizontal relationships forming complicated dependence structures. Modeling and mining of these structures is challenging due to their own high structural com...
     
Detection of Outlier Residues for Improving Interface Prediction in Protein Heterocomplexes
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
By Jinyan Li, Limsoon Wong, Peng Chen
Issue Date:July 2012
pp. 1155-1165
Sequence-based understanding and identification of protein binding interfaces is a challenging research topic due to the complexity in protein systems and the imbalanced distribution between interface and noninterface residues. This paper presents an outli...
     
Antibody-Specified B-Cell Epitope Prediction in Line with the Principle of Context-Awareness
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
By Jinyan Li, Liang Zhao, Liang Zhao, Limsoon Wong, Limsoon Wong
Issue Date:November 2011
pp. 1483-1494
Context-awareness is a characteristic in the recognition between antigens and antibodies, highlighting the reconfiguration of epitope residues when an antigen interacts with a different antibody. A coarse binary classification of antigen regions into epito...
     
Negative correlations in collaboration: concepts and algorithms
Found in: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '10)
By Jinyan Li, Qian Liu, Tao Zeng
Issue Date:July 2010
pp. 463-472
This paper studies efficient mining of negative correlations that pace in collaboration. A collaborating negative correlation is a negative correlation between two sets of variables rather than traditionally between a pair of variables. It signifies a sync...
     
Modeling Protein Interacting Groups by Quasi-Bicliques: Complexity, Algorithm, and Application
Found in: IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
By Jinyan Li, Lusheng Wang, Lusheng Wang, Xiaowen Liu, Xiaowen Liu
Issue Date:April 2010
pp. 354-364
HASH(0x2973610)
     
Mining statistically important equivalence classes and delta-discriminative emerging patterns
Found in: Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '07)
By Guimei Liu, Jinyan Li, Limsoon Wong
Issue Date:August 2007
pp. 430-439
The support-confidence framework is the most common measure used in itemset mining algorithms, for its antimonotonicity that effectively simplifies the search lattice. This computational convenience brings both quality and statistical flaws to the results ...
     
Efficient mining of emerging patterns: discovering trends and differences
Found in: Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining (KDD '99)
By Guozhu Dong, Jinyan Li
Issue Date:August 1999
pp. 43-52
This talk is an interim report on the 5 year plan launched in 1996 to provide a theoretical and computational foundation of Statistics for massive data sets. The plan coincided with the formation of AT&T Labs and the proposed research agenda of the In...
     
 1