|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue
Oct. 2012 (vol. 24 no. 10)
pp. 1789-1802
| ASCII Text | x | ||
| Brian Quanz, Jun (Luke) Huan, Meenakshi Mishra, "Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1789-1802, Oct., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TKDE.2012.75, author = {Brian Quanz and Jun (Luke) Huan and Meenakshi Mishra}, title = {Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue}, journal ={IEEE Transactions on Knowledge and Data Engineering}, volume = {24}, number = {10}, issn = {1041-4347}, year = {2012}, pages = {1789-1802}, doi = {http://doi.ieeecomputersociety.org/10.1109/TKDE.2012.75}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE Transactions on Knowledge and Data Engineering TI - Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue IS - 10 SN - 1041-4347 SP1789 EP1802 EPD - 1789-1802 A1 - Brian Quanz, A1 - Jun (Luke) Huan, A1 - Meenakshi Mishra, PY - 2012 KW - Encoding KW - Vectors KW - Knowledge transfer KW - Kernel KW - Feature extraction KW - Estimation KW - Equations KW - low-quality data. KW - Knowledge transfer KW - transfer learning KW - feature extraction KW - sparse coding VL - 24 JA - IEEE Transactions on Knowledge and Data Engineering ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TKDE.2012.75
Effectively utilizing readily available auxiliary data to improve predictive performance on new modeling tasks is a key problem in data mining. In this research, the goal is to transfer knowledge between sources of data, particularly when ground-truth information for the new modeling task is scarce or is expensive to collect where leveraging any auxiliary sources of data becomes a necessity. Toward seamless knowledge transfer among tasks, effective representation of the data is a critical but yet not fully explored research area for the data engineer and data miner. Here, we present a technique based on the idea of sparse coding, which essentially attempts to find an embedding for the data by assigning feature values based on subspace cluster membership. We modify the idea of sparse coding by focusing the identification of shared clusters between data when source and target data may have different distributions. In our paper, we point out cases where a direct application of sparse coding will lead to a failure of knowledge transfer. We then present the details of our extension to sparse coding, by incorporating distribution distance estimates for the embedded data, and show that the proposed algorithm can overcome the shortcomings of the sparse coding algorithm on synthetic data and achieve improved predictive performance on a real world chemical toxicity transfer learning task.
Index Terms:
Encoding,Vectors,Knowledge transfer,Kernel,Feature extraction,Estimation,Equations,low-quality data.,Knowledge transfer,transfer learning,feature extraction,sparse coding
Citation:
Brian Quanz, Jun (Luke) Huan, Meenakshi Mishra, "Knowledge Transfer with Low-Quality Data: A Feature Extraction Issue," IEEE Transactions on Knowledge and Data Engineering, vol. 24, no. 10, pp. 1789-1802, Oct. 2012, doi:10.1109/TKDE.2012.75
Usage of this product signifies your acceptance of the Terms of Use.

