|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Genomic Region Operation Kit for Flexible Processing of Deep Sequencing Data
PrePrint
ISSN: 1545-5963
| ASCII Text | x | ||
| Kristian Ovaska, Lauri Lyly, Biswajyoti Sahu, Olli A. Janne, Sampsa Hautaniemi, "Genomic Region Operation Kit for Flexible Processing of Deep Sequencing Data," IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 99, no. 1, pp. 1, , 5555. | |||
| BibTex | x | ||
| @article{ 10.1109/TCBB.2012.170, author = {Kristian Ovaska and Lauri Lyly and Biswajyoti Sahu and Olli A. Janne and Sampsa Hautaniemi}, title = {Genomic Region Operation Kit for Flexible Processing of Deep Sequencing Data}, journal ={IEEE/ACM Transactions on Computational Biology and Bioinformatics}, volume = {99}, number = {1}, issn = {1545-5963}, year = {5555}, pages = {1}, doi = {http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.170}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - JOUR JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics TI - Genomic Region Operation Kit for Flexible Processing of Deep Sequencing Data IS - 1 SN - 1545-5963 SP EP EPD - 1 A1 - Kristian Ovaska, A1 - Lauri Lyly, A1 - Biswajyoti Sahu, A1 - Olli A. Janne, A1 - Sampsa Hautaniemi, PY - 5555 KW - Bioinformatics KW - Genomics KW - Databases KW - Benchmark testing KW - Algebra KW - Software KW - Complexity theory KW - Software/Software Engineering KW - Computer Applications KW - Life and Medical Sciences VL - 99 JA - IEEE/ACM Transactions on Computational Biology and Bioinformatics ER - | |||
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.170
Web Extra: View Supplemental Material(PDF)
Computational analysis of data produced in deep sequencing experiments is challenging due to large data volumes and requirements for flexible analysis approaches. Here we present a mathematical formalism based on set algebra for frequently performed operations in deep sequencing data analysis to facilitate translation of biomedical research questions to language amenable for computational analysis. With the help of this formalism we implemented Genomic Region Operation Kit (GROK), which supports various deep sequencing related operations such as preprocessing, filtering, file conversion and sample comparison. GROK provides high level interfaces for R, Python, Lua and command line, as well as an extension C++ API. It supports major genomic file formats and allows storing custom genomic regions in efficient data structures such as red-black trees and SQL databases. To demonstrate the utility of GROK we have characterized the roles of two major transcription factors in prostate cancer using data from ten deep sequencing experiments. GROK is freely available with a user guide from http://csbi.ltdk.helsinki.fi/grok/.
Index Terms:
Bioinformatics,Genomics,Databases,Benchmark testing,Algebra,Software,Complexity theory,Software/Software Engineering,Computer Applications,Life and Medical Sciences
Citation:
Kristian Ovaska, Lauri Lyly, Biswajyoti Sahu, Olli A. Janne, Sampsa Hautaniemi, "Genomic Region Operation Kit for Flexible Processing of Deep Sequencing Data," IEEE/ACM Transactions on Computational Biology and Bioinformatics, 09 Jan. 2013. IEEE computer Society Digital Library. IEEE Computer Society, <http://doi.ieeecomputersociety.org/10.1109/TCBB.2012.170>
Usage of this product signifies your acceptance of the Terms of Use.

