|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Distinguishing Endogenous Retroviral LTRs from SINE Elements Using Features Extracted from Evolved Side Effect Machines
Nov.-Dec. 2012 (vol. 9 no. 6)
pp. 1676-1689
| ASCII Text | x | ||
| W. Ashlock, S. Datta, "Distinguishing Endogenous Retroviral LTRs from SINE Elements Using Features Extracted from Evolved Side Effect Machines," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 9, no. 6, pp. 1676-1689, Nov.-Dec., 2012. | |||
| BibTex | x | ||
| @article{ 10.1109/TCBB.2012.116, author = {W. Ashlock and S. Datta}, title = {Distinguishing Endogenous Retroviral LTRs from SINE Elements Using Features Extracted from Evolved Side Effect Machines}, journal ={IEEE/ACM Transactions on Computational Biology and Bioinformatics}, volume = {9}, number = {6}, issn = {1545-5963}, year = {2012}, pages = {1676-1689}, doi = {http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.116}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics TI - Distinguishing Endogenous Retroviral LTRs from SINE Elements Using Features Extracted from Evolved Side Effect Machines IS - 6 SN - 1545-5963 SP1676 EP1689 EPD - 1676-1689 A1 - W. Ashlock, A1 - S. Datta, PY - 2012 KW - proteins KW - biology computing KW - feature extraction KW - genetic algorithms KW - genetics KW - genomics KW - molecular biophysics KW - molecular configurations KW - biologists KW - endogenous retroviral LTRs KW - SINE elements KW - feature extraction KW - evolved side effect machines KW - DNA sequences KW - biological features KW - genomic sequences KW - classifier training KW - transposable elements KW - gene expression KW - genome structure KW - genetic diversity KW - genetic markers KW - protein codes KW - structural characteristics KW - Bioinformatics KW - Genomics KW - Genetic algorithms KW - DNA KW - Feature extraction KW - Machine learning KW - side effect machines KW - Endogenous retroviruses KW - LTR retrotransposons KW - SINE elements KW - feature evaluation and selection KW - machine learning KW - evolutionary computing and genetic algorithms VL - 9 JA - IEEE/ACM Transactions on Computational Biology and Bioinformatics ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.116
Side effect machines produce features for classifiers that distinguish different types of DNA sequences. They have the, as yet unexploited, potential to give insight into biological features of the sequences. We introduce several innovations to the production and use of side effect machine sequence features. We compare the results of using consensus sequences and genomic sequences for training classifiers and find that more accurate results can be obtained using genomic sequences. Surprisingly, we were even able to build a classifier that distinguished consensus sequences from genomic sequences with high accuracy, suggesting that consensus sequences are not always representative of their genomic counterparts. We apply our techniques to the problem of distinguishing two types of transposable elements, solo LTRs and SINEs. Identifying these sequences is important because they affect gene expression, genome structure, and genetic diversity, and they serve as genetic markers. They are of similar length, neither codes for protein, and both have many nearly identical copies throughout the genome. Being able to efficiently and automatically distinguish them will aid efforts to improve annotations of genomes. Our approach reveals structural characteristics of the sequences of potential interest to biologists.
Index Terms:
proteins,biology computing,feature extraction,genetic algorithms,genetics,genomics,molecular biophysics,molecular configurations,biologists,endogenous retroviral LTRs,SINE elements,feature extraction,evolved side effect machines,DNA sequences,biological features,genomic sequences,classifier training,transposable elements,gene expression,genome structure,genetic diversity,genetic markers,protein codes,structural characteristics,Bioinformatics,Genomics,Genetic algorithms,DNA,Feature extraction,Machine learning,side effect machines,Endogenous retroviruses,LTR retrotransposons,SINE elements,feature evaluation and selection,machine learning,evolutionary computing and genetic algorithms
Citation:
W. Ashlock, S. Datta, "Distinguishing Endogenous Retroviral LTRs from SINE Elements Using Features Extracted from Evolved Side Effect Machines," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 9, no. 6, pp. 1676-1689, Nov.-Dec. 2012, doi:10.1109/TCBB.2012.116
Usage of this product signifies your acceptance of the Terms of Use.

