CSDL Home IEEE/ACM Transactions on Computational Biology and Bioinformatics 2005 vol.2 Issue No.03 - July-September
Issue No.03 - July-September (2005 vol.2)
DOI Bookmark: http://doi.ieeecomputersociety.org/10.1109/TCBB.2005.44
Selection of machine learning techniques requires a certain sensitivity to the requirements of the problem. In particular, the problem can be made more tractable by deliberately using algorithms that are biased toward solutions of the requisite kind. In this paper, we argue that recurrent neural networks have a natural bias toward a problem domain of which biological sequence analysis tasks are a subset. We use experiments with synthetic data to illustrate this bias. We then demonstrate that this bias can be exploitable using a data set of protein sequences containing several classes of subcellular localization targeting peptides. The results show that, compared with feed forward, recurrent neural networks will generally perform better on sequence analysis tasks. Furthermore, as the patterns within the sequence become more ambiguous, the choice of specific recurrent architecture becomes more critical.
Index Terms- Machine learning, neural network architecture, recurrent neural network, bias, biological sequence analysis, motif, subcellular localization, pattern recognition, classifier design.
John Hawkins, "The Applicability of Recurrent Neural Networks for Biological Sequence Analysis", IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol.2, no. 3, pp. 243-253, July-September 2005, doi:10.1109/TCBB.2005.44