Search For:

Displaying 1-50 out of 125 total
A Novel Contrast Co-learning Framework for Generating High Quality Training Data
Found in: Data Mining, IEEE International Conference on
By Zeyu Zheng, Jun Yan, Shuicheng Yan, Ning Liu, Zheng Chen, Ming Zhang
Issue Date:December 2010
pp. 649-658
The good performances of most classical learning algorithms are generally founded on high quality training data, which are clean and unbiased. The availability of such data is however becoming much harder than ever in many real world problems due to the di...
 
Application of Strength Reduction FEM in Anti-sliding Stability at Dam Foundation
Found in: Computer Science and Information Engineering, World Congress on
By Si Jianhui, Jian Zheng, Chen Xi
Issue Date:April 2009
pp. 751-754
An analysis on safety factor of anti-sliding stability at dam foundation through c-φ reduction algorithm by finite elements is presented. When the system reaches instability, the numerical non-convergence occurs simultaneously. The safety factor is then ob...
 
Cross Domain Random Walk for Query Intent Pattern Mining from Search Engine Log
Found in: Data Mining, IEEE International Conference on
By Siyu Gu,Jun Yan,Lei Ji,Shuicheng Yan,Junshi Huang,Ning Liu,Ying Chen,Zheng Chen
Issue Date:December 2011
pp. 221-230
Understanding search intents of users through their condensed short queries has attracted much attention both in academia and industry. The search intents of users are generally assumed to be associated with various query patterns, such as
 
P-packSVM: Parallel Primal grAdient desCent Kernel SVM
Found in: Data Mining, IEEE International Conference on
By Zeyuan Allen Zhu, Weizhu Chen, Gang Wang, Chenguang Zhu, Zheng Chen
Issue Date:December 2009
pp. 677-686
It is an extreme challenge to produce a nonlinear SVM classifier on very large scale data. In this paper we describe a novel P-packSVM algorithm that can solve the Support Vector Machine (SVM) optimization problem with an arbitrary kernel. This algorithm e...
 
Inverse Time Dependency in Convex Regularized Learning
Found in: Data Mining, IEEE International Conference on
By Zeyuan Allen Zhu, Weizhu Chen, Chenguang Zhu, Gang Wang, Haixun Wang, Zheng Chen
Issue Date:December 2009
pp. 667-676
In the conventional regularized learning, training time increases as the training set expands. Recent work on L2 linear SVM challenges this common sense by proposing the inverse time dependency on the training set size. In this paper, we first put forward ...
 
TransRank: A Novel Algorithm for Transfer of Rank Learning
Found in: Data Mining Workshops, International Conference on
By Depin Chen, Jun Yan, Gang Wang, Yan Xiong, Weiguo Fan, Zheng Chen
Issue Date:December 2008
pp. 106-115
Recently, learning to rank technique has attracted much attention. However, the lack of labeled training data seriously limits its application in real-world tasks. In this paper, we propose to break this bottleneck by considering the cross-domain “transfer...
 
Document Transformation for Multi-label Feature Selection in Text Categorization
Found in: Data Mining, IEEE International Conference on
By Weizhu Chen, Jun Yan, Benyu Zhang, Zheng Chen, Qiang Yang
Issue Date:October 2007
pp. 451-456
Feature selection on multi-label documents for automatic text categorization is an under-explored research area. This paper presents a systematic document transformation framework, whereby the multi-label documents are transformed into single-label documen...
 
Improving Text Classification by Using Encyclopedia Knowledge
Found in: Data Mining, IEEE International Conference on
By Pu Wang, Jian Hu, Hua-Jun Zeng, Lijun Chen, Zheng Chen
Issue Date:October 2007
pp. 332-341
The exponential growth of text documents available on the Internet has created an urgent need for accurate, fast, and general purpose text classification algorithms. However, the
 
Diverse Topic Phrase Extraction through Latent Semantic Analysis
Found in: Data Mining, IEEE International Conference on
By Jilin Chen, Jun Yan, Benyu Zhang, Qiang Yang, Zheng Chen
Issue Date:December 2006
pp. 834-838
We propose a novel algorithm for extracting diverse topic phrases in order to provide summary for large corpora. Previous works often ignore the importance of diversity and thus extract phrases crowded on some hot topics while failing to cover other less o...
 
Efficient Text Classification by Weighted Proximal SVM
Found in: Data Mining, IEEE International Conference on
By Dong Zhuang, Benyu Zhang, Qiang Yang, Jun Yan, Zheng Chen, Ying Chen
Issue Date:November 2005
pp. 538-545
In this paper, we present an algorithm that can classify large-scale text data with high classification quality and fast training speed. Our method is based on a novel extension of the proximal SVM mode [3]. Previous studies on proximal SVM have focused on...
 
Web Information at Your Fingertips: Paper as an Interaction Metaphor
Found in: Computer
By Zheng Chen,Jian-Tao Sun,Xuedong Huang
Issue Date:March 2014
pp. 62-66
The prototype O system seamlessly integrates Web searching and browsing on touch-enabled tablets using two-handed gesture interaction, much like handling and marking content on a piece of paper. The system also leverages contextual information to infer use...
 
A Power Efficient and Compact Optical Interconnect for Network-on-Chip
Found in: IEEE Computer Architecture Letters
By Zheng Chen,Huaxi Gu,Yintang Yang,Luying Bai,Hui Li
Issue Date:January 2014
pp. 1-1
Optical interconnect is a promising alternative to substitute the electrical interconnect for intra-chip communications. The topology of optical Network-on-Chip (ONoC) has a great impact on the network performance. However, the size of ONoC is limited by t...
 
Using Flash to Tolerate Track Failures in RAID
Found in: 2014 IEEE 15th International Symposium on High-Assurance Systems Engineering (HASE)
By Zheng Chen,Allen McBride
Issue Date:January 2014
pp. 234-235
RAID systems are designed to tolerate disk failures, but latent track failures can prevent recovery. Erasure codes can tolerate latent track failures through schemes such as RAID 6. With traditional erasure codes, this resilience often requires high space ...
 
Study of Stock Prediction Based on Social Network
Found in: 2013 International Conference on Social Computing (SocialCom)
By Zheng Chen,Xiaoqing Du
Issue Date:September 2013
pp. 913-916
The study on the interactions between social media and financial markets is an interesting topic. This paper is to investigate this issue for stocks from the Shanghai/Shenzhen stock exchange, based on a popular online Chinese stock forum Guba.com.cn. Other...
 
Quantum Path Integral Inspired Query Sequence Suggestion for User Search Task Simplification
Found in: Data Mining Workshops, International Conference on
By Baojun Yue, Jun Yan, Heng Liang, Ning Liu, Lei Ji, Fengshan Bai, Zheng Chen
Issue Date:December 2010
pp. 647-654
Query suggestion algorithms, which aim to suggest a set of similar but independent queries to users, have been widely studied to simplify user searches. However, in many cases, the users will accomplish their search tasks through a sequence of search behav...
 
Realization of Parallel Ant Colony Algorithm Based on TBB Multi-core Platform
Found in: Information Technology and Applications, International Forum on
By Ni Li, Dongdong Gao, Guanghong Gong, Zheng Chen
Issue Date:July 2010
pp. 177-180
TBB (Thread Building Blocking) is currently a representative parallel computing platform of multi-core processors. The ant colony algorithm is used to solve combinatorial optimization problem of discrete-time systems. With the expansion of the problem scal...
 
Trace-Oriented Feature Analysis for Large-Scale Text Data Dimension Reduction
Found in: IEEE Transactions on Knowledge and Data Engineering
By Jun Yan, Ning Liu, Shuicheng Yan, Qiang Yang, Weiguo (Patrick) Fan, Wei Wei, Zheng Chen
Issue Date:July 2011
pp. 1103-1117
Dimension reduction for large-scale text data is attracting much attention nowadays due to the rapid growth of the World Wide Web. We can categorize those popular dimension reduction algorithms into two groups: feature extraction and feature selection algo...
 
Synthesizing Novel Dimension Reduction Algorithms in Matrix Trace Oriented Optimization Framework
Found in: Data Mining, IEEE International Conference on
By Jun Yan, Ning Liu, Shuicheng Yan, Qiang Yang, Zheng Chen
Issue Date:December 2009
pp. 598-606
Dimension Reduction (DR) algorithms are generally categorized into feature extraction and feature selection algorithms. In the past, few works have been done to contrast and unify the two algorithm categories. In this work, we introduce a matrix trace orie...
 
A Thread Partitioning Method for Speculative Multithreading
Found in: Scalable Computing and Communications; International Conference on Embedded Computing, International Conference on
By Xiaoyu Pan, Yinliang Zhao, Zheng Chen, Xuhao Wang, Yuanke Wei, Yanning Du
Issue Date:September 2009
pp. 285-290
Speculative Multithreading (SpMT) is an effective mechanism for parallelizing irregular programs which are hard by conventional approaches. SpMT technology can be applied to exploit Thread-Level Parallelism effectively through allowing multiple threads exe...
 
Design of Embedded System API Function for PMAC
Found in: Information Engineering, International Conference on
By Miao Xingang, Wang Su, Cai Lingling, Yan Zheng, Chen Jiang
Issue Date:July 2009
pp. 3-6
Programmable Multi-Axis Controller (PMAC) is widely used in robot control system, most of them use industrial PC (IPC) as the host computer, but this cannot meet the request in some welding robot, which need economical、miniaturization and infinite variety....
 
An Integrated Learning Resource Management System with Web Services
Found in: New Trends in Information and Service Science, International Conference on
By Yushun Li, Zheng Chen, Ronghuai Huang, Xiaochun Cheng
Issue Date:July 2009
pp. 863-868
In recent years, there are some new directions in learning resource management and sharing research field, which mainly focus on innovative methods to promote sharing and reusing learning resource in modular manner, to create system with capabilities suita...
 
Research on New Generation e-Learning System for Ubiquitous Learning
Found in: Information Technology and Applications, International Forum on
By Yushun Li, Ge Gao, Zheng Chen, Ronghuai Huang
Issue Date:May 2009
pp. 275-279
rend of incorporating ubiquitous learning into mainstream of education. This demands new generation e-Learning system for learning anywhere, at any time, with any device. The paper introduces our on-going research efforts in the field. In the work, the con...
 
TOFA: Trace Oriented Feature Analysis in Text Categorization
Found in: Data Mining, IEEE International Conference on
By Jun Yan, Ning Liu, Qiang Yang, Weiguo Fan, Zheng Chen
Issue Date:December 2008
pp. 668-677
Dimension reduction for large-scale text data is attracting much attention lately due to the rapid growth of World Wide Web. We can consider dimension reduction algorithms in two categories: feature extraction and feature selection. An important problem re...
 
Learning the Latent Semantic Space for Ranking in Text Retrieval
Found in: Data Mining, IEEE International Conference on
By Jun Yan, Shuicheng Yan, Ning Liu, Zheng Chen
Issue Date:December 2008
pp. 1115-1120
Subspace learning techniques for text analysis, such as Latent Semantic Indexing (LSI), have been widely studied in the past decade. However, to our best knowledge, no previous study has leveraged the rank information for subspace learning in ranking tasks...
 
Web Query Prediction by Unifying Model
Found in: Data Mining Workshops, International Conference on
By Ning Liu, Jun Yan, Shuicheng Yan, Weiguo Fan, Zheng Chen
Issue Date:December 2008
pp. 436-441
Recently, many commercial products, such as Google Trends and Yahoo! Buzz, are released to monitor the past search engine query frequency trend. However, little research has been devoted for predicting the upcoming query trend, which is of great importance...
 
Research on Learning Resource Sharing System with Ontology-Based Hierarchy Semantic Model
Found in: Computer Science and Software Engineering, International Conference on
By Yushun Li, Zheng Chen, Shenggang Yang, Jiangjian Ma, Ronghuai Huang
Issue Date:December 2008
pp. 772-776
Sharing, re-purposing of learning resources are directions of e-learning system, and the developing semantic technology has potential to provide more advanced solutions for these requirements. There have been many research efforts in this direction. Howeve...
 
Using Gene Ontology to Enhance Effectiveness of Similarity Measures for Microarray Data
Found in: Bioinformatics and Biomedicine, IEEE International Conference on
By Zheng Chen, Jian Tang
Issue Date:November 2008
pp. 66-71
Feature selection is a necessary processing step for class prediction using microarray expression data.Traditional methods select top-ranked genes in terms of their discriminative powers. This strategy unavoidably results in redundancy, whereby correlated ...
 
A Rapid Secret Sharing Scheme for Resource Constrained Environments
Found in: Embedded Computing, IEEE International Symposium on
By Zheng Chen, Xiao-Jing Wang, Sheng Cao, Dan Tang
Issue Date:October 2008
pp. 55-60
A method of rapid secret sharing scheme based on STAR codes is proposed. Due to the characteristic of array codes such as STAR codes, the computation in secret sharing scheme need only XOR operation on GF(2), which is distinguished with the modulation and ...
 
A New Approach to MANet Routing Based on Erasure Codes
Found in: Power Electronics and Intelligent Transportation System, Workshop on
By Zheng Chen, Xiaojing Wang, Sheng Cao, Dan Tang
Issue Date:August 2008
pp. 188-191
In this paper, we present a novel approach for mobile ad-hoc routing, which is called AOMDV-CB. The brand-new protocol is an extension of the well-known AOMDV (Ad Hoc on Demand Multi-Path Distance Vector) protocol, holds a potential to drastically increase...
 
A New Class of Highly Fault Tolerant Erasure Code for the Disk Array
Found in: Power Electronics and Intelligent Transportation System, Workshop on
By Dan Tang, Xiaojing Wang, Sheng Cao, Zheng Chen
Issue Date:August 2008
pp. 578-581
We present a new class of erasure codes of size n?n (n is a prime number) called T-code, a new family of simple, highly fault tolerant XOR-based erasure codes for storage systems (with fault tolerance up to 15). T-code is not maximum distance separable (MD...
 
Exploring Fault-tolerant Distributed Storage System using GE code
Found in: Embedded Software and Systems, Second International Conference on
By Zheng Chen, Xiaojing Wang, Yili Jin, Honglei Zhou
Issue Date:July 2008
pp. 142-148
Notice of Violation of IEEE Publication Principles <br/><br/>
 
Local Word Bag Model for Text Categorization
Found in: Data Mining, IEEE International Conference on
By Wen Pu, Ning Liu, Shuicheng Yan, Jun Yan, Kunqing Xie, Zheng Chen
Issue Date:October 2007
pp. 625-630
Many text processing applications adopted the Bag of Words (BOW) model representation of documents, in which each document is represented as a vector of weighted terms or n-grams, and then cosine distance between two vectors is used as the similarity measu...
 
Similarity of Temporal Query Logs Based on ARIMA Model
Found in: Data Mining, IEEE International Conference on
By Ning Liu, Shuzhen Nong, Jun Yan, Benyu Zhang, Zheng Chen, Ying Li
Issue Date:December 2006
pp. 975-979
A challenging issue faced by modern information retrieval is that of determining and satisfying users? requirements relying only on very short text queries. In this paper, we propose an algorithm to find out related queries based on Auto-Regressive Integra...
 
A Novel Scalable Algorithm for Supervised Subspace Learning
Found in: Data Mining, IEEE International Conference on
By Jun Yan, Ning Liu, Benyu Zhang, Qiang Yang, Shuicheng Yan, Zheng Chen
Issue Date:December 2006
pp. 721-730
Subspace learning approaches aim to discover important statistical distribution on lower dimensions for high dimensional data. Methods such as Principal Component Analysis (PCA) do not make use of the class information, and Linear Discriminant Analysis (LD...
 
Subjectivity Categorization of Weblog with Part-of-Speech Based Smoothing
Found in: Data Mining, IEEE International Conference on
By Shen Huang, Jian-Tao Sun, Xuanhui Wang, Hua-Jun Zeng, Zheng Chen
Issue Date:December 2006
pp. 285-294
Experts from different domains try to mine users? comments on weblogs for different reasons such as politics or commerce. All these needs necessitate automatically distinguishing subjective weblog contents from objective ones, namely subjectivity categoriz...
 
Adding Semantics to Email Clustering
Found in: Data Mining, IEEE International Conference on
By Hua Li, Dou Shen, Benyu Zhang, Zheng Chen, Qiang Yang
Issue Date:December 2006
pp. 938-942
This paper presents a novel algorithm to cluster emails according to their contents and the sentence styles of their subject lines. In our algorithm, natural language processing techniques and frequent itemset mining techniques are utilized to automaticall...
 
Similarity of Temporal Query Logs Based on ARIMA Model
Found in: Data Mining Workshops, International Conference on
By Ning Liu, Shuzhen Nong, Jun Yan, Benyu Zhang, Zheng Chen, Ying Li
Issue Date:December 2006
pp. 366-370
A challenging issue faced by modern information retrieval is that of determining and satisfying users? requirements relying only on very short text queries. In this paper, we propose an algorithm to find out related queries based on Auto-Regressive Integra...
 
Multitype Features Coselection for Web Document Clustering
Found in: IEEE Transactions on Knowledge and Data Engineering
By Shen Huang, Zheng Chen, Yong Yu, Wei-Ying Ma
Issue Date:April 2006
pp. 448-459
Feature selection has been widely applied in text categorization and clustering. Compared to unsupervised selection, supervised feature selection is more successful in filtering out noise in most cases. However, due to a lack of label information, clusteri...
 
Text Classification Improved through Automatically Extracted Sequences
Found in: Data Engineering, International Conference on
By Dou Shen, Jian-Tao Sun, Qiang Yang, Hui Zhao, Zheng Chen
Issue Date:April 2006
pp. 121
We propose to use the n-multigram model to help the automatic text classification task. This model could automatically discover the latent semantic sequences contained in the document set of each category. Based on the n-multigram model and the n-gram lang...
 
Effective and Efficient Dimensionality Reduction for Large-Scale and Streaming Data Preprocessing
Found in: IEEE Transactions on Knowledge and Data Engineering
By Jun Yan, Benyu Zhang, Ning Liu, Shuicheng Yan, Qiansheng Cheng, Weiguo Fan, Qiang Yang, Wensi Xi, Zheng Chen
Issue Date:March 2006
pp. 320-333
Dimensionality reduction is an essential data preprocessing technique for large-scale and streaming data classification tasks. It can be used to improve both the efficiency and the effectiveness of classifiers. Traditional dimensionality reduction approach...
 
Adaptive Fuzzy Logic Controller with Rule-based Changeable Universe Of Discourse for a Nonlinear MIMO System
Found in: Intelligent Systems Design and Applications, International Conference on
By Yi Wang, Huiwen Deng, Zheng Chen
Issue Date:September 2005
pp. 8-13
The accurate input-output universe of discourse (UOD) on which membership functions are defined is hard to acquire, especially for nonlinear multi-input and multi-output (MIMO) systems, and control accuracy will reduce greatly in the steady state due to li...
 
Supervised semi-definite embedding for image manifolds
Found in: Multimedia and Expo, IEEE International Conference on
By Benyu Zhang, Jun Yan, Ning Liu, Qiansheng Cheng, Zheng Chen, Wei-Ying Ma
Issue Date:July 2005
pp. 4 pp.
Semi-definite embedding (SDE) has been a recently proposed to maximize the sum of pair wise squared distances between outputs while the input data and outputs are locally isometric, i.e. it pulls the outputs as far apart as possible, subject to unfolding a...
 
Mining Ratio Rules Via Principal Sparse Non-Negative Matrix Factorization
Found in: Data Mining, IEEE International Conference on
By Chenyong Hu, Benyu Zhang, Shuicheng Yan, Qiang Yang, Jun Yan, Zheng Chen, Wei-Ying Ma
Issue Date:November 2004
pp. 407-410
Association rules are traditionally designed to capture statistical relationship among itemsets in a given database. To additionally capture the quantitative association knowledge, F.Korn et al recently proposed a paradigm named Ratio Rules for quantifiabl...
 
Supervised Latent Semantic Indexing for Document Categorization
Found in: Data Mining, IEEE International Conference on
By Jian-Tao Sun, Zheng Chen, Hua-Jun Zeng, Yu-Chang Lu, Chun-Yi Shi, Wei-Ying Ma
Issue Date:November 2004
pp. 535-538
Latent Semantic Indexing (LSI) is a successful technology in information retrieval (IR) which attempts to explore the latent semantics implied by a query or a document through representing them in a dimension-reduced space. However, LSI is not optimal for ...
 
IRC: An Iterative Reinforcement Categorization Algorithm for Interrelated Web Objects
Found in: Data Mining, IEEE International Conference on
By Gui-Rong Xue, Dou Shen, Qiang Yang, Hua-Jun Zeng, Zheng Chen, Yong Yu, WenSi Xi, Wei-Ying Ma
Issue Date:November 2004
pp. 273-280
Most existing categorization algorithms deal with homogeneous Web data objects, and consider interrelated objects as additional features when taking the interrelationships with other types of objects into account. However, focusing on any single aspects of...
 
Improving Text Classification using Local Latent Semantic Indexing
Found in: Data Mining, IEEE International Conference on
By Tao Liu, Zheng Chen, Benyu Zhang, Wei-ying Ma, Gongyi Wu
Issue Date:November 2004
pp. 162-169
Latent Semantic Indexing (LSI) has been shown to be extremely useful in information retrieval, but it is not an optimal representation for text classification. It always drops the text classification performance when being applied to the whole training set...
 
TSSP: A Reinforcement Algorithm to Find Related Papers
Found in: Web Intelligence, IEEE / WIC / ACM International Conference on
By Shen Huang, Gui-Rong Xue, Ben-Yu Zhang, Zheng Chen, Yong Yu, Wei-Ying Ma
Issue Date:September 2004
pp. 117-123
Content analysis and citation analysis are two common methods in recommending system. Compared with content analysis, citation analysis can discover more implicitly related papers. However, the citation-based methods may introduce more noise in citation gr...
 
GE-CKO: A Method to Optimize Composite Kernels for Web Page Classification
Found in: Web Intelligence, IEEE / WIC / ACM International Conference on
By Jian-Tao Sun, Ben-Yu Zhang, Zheng Chen, Yu-Chang Lu, Chun-Yi Shi, Wei-Ying Ma
Issue Date:September 2004
pp. 299-305
Most of current researches on Web page classification focus on leveraging heterogeneous features such as plain text, hyperlinks and anchor texts in an effective and efficient way. Composite kernel method is one topic of interest among them. It first select...
 
CBC: Clustering Based Text Classification Requiring Minimal Labeled Data
Found in: Data Mining, IEEE International Conference on
By Hua-Jun Zeng, Xuan-Hui Wang, Zheng Chen, Hongjun Lu, Wei-Ying Ma
Issue Date:November 2003
pp. 443
Semi-supervised learning methods construct classifiers using both labeled and unlabeled training data samples. While unlabeled data samples can help to improve the accuracy of trained models to certain extent, existing methods still face difficulties when ...
 
A Unified Framework for Clustering Heterogeneous Web Objects
Found in: Web Information Systems Engineering, International Conference on
By Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma
Issue Date:December 2002
pp. 161
In this paper, we introduce a novel framework for clustering web data which is often heterogeneous in nature. As most existing methods often integrate heterogeneous data into a unified feature space, their flexibilities to explore and adjust contributing e...
 
 1  2 Next >>