The Community for Technology Leaders
Green Image
All published approaches to DNA sequencing by hybridization (SBH) consist of the biochemical acquisition of the spectrum of a target sequence (the set of its subsequences conforming to a given probing pattern) followed by the algorithmic reconstruction of the sequence from its spectrum. In the "standard” or "uniform” approach, the probing pattern is a string of length L and the length of reliably reconstructible sequences is known to be m_{len}=O(2^L). For a fixed microarray area, higher sequencing performance can be achieved by inserting nonprobing gaps ("wild-cards”) in the probing pattern. The reconstruction, however, must cope with the emergence of fooling probes due to the gaps and algorithmic failure occurs when the spectrum becomes too densely populated, although we can achieve m_{comp}=O(4^L). Despite the combinatorial success of gapped probing, all current approaches are based on a biochemically unrealistic spectrum-acquisition model (digital-spectrum). The reality of hybridization is much more complex. Departing from the conventional model, in this paper, we propose an alternative, called the analog-spectrum model, which more closely reflects the biochemical process. This novel modeling reestablishes probe length as the performance-governing factor, adopting "semidegenerate bases” as suitable emulators of currently inadequate universal bases. One important conclusion is that accurate biochemical measurements are pivotal to the success of SBH. The theoretical proposal presented in this paper should be a convincing stimulus for the needed biotechnological work.
DNA sequencing, sequencing-by-hybridization, microarrays, gapped probes, thermodynamics of hybridization, analog spectrum, semidegenerate bases.

F. P. Preparata, "Sequencing-by-Hybridization Revisited: The Analog-Spectrum Proposal," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 1, no. , pp. 46-52, 2004.
88 ms
(Ver 3.3 (11022016))