|
| This Article | ||
| ||
| Share | ||
| Bibliographic References | ||
| Add to: | ||
| | ||
| Search | ||
| ||
Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007)
An Efficient Technique for Mining Approximately Frequent Substring Patterns
Omaha, Nebraska, USA
October 28-October 31
ISBN: 0-7695-3033-8
| ASCII Text | x | ||
| Xiaonan Ji, James Bailey, "An Efficient Technique for Mining Approximately Frequent Substring Patterns," 2012 IEEE 12th International Conference on Data Mining Workshops, pp. 325-330, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007), 2007. | |||
| BibTex | x | ||
| @article{ 10.1109/ICDMW.2007.121, author = {Xiaonan Ji and James Bailey}, title = {An Efficient Technique for Mining Approximately Frequent Substring Patterns}, journal ={2012 IEEE 12th International Conference on Data Mining Workshops}, volume = {0}, year = {2007}, isbn = {0-7695-3033-8}, pages = {325-330}, doi = {http://doi.ieeecomputersociety.org/10.1109/ICDMW.2007.121}, publisher = {IEEE Computer Society}, address = {Los Alamitos, CA, USA}, } | |||
| RefWorks Procite/RefMan/Endnote | x | ||
| TY - CONF JO - 2012 IEEE 12th International Conference on Data Mining Workshops TI - An Efficient Technique for Mining Approximately Frequent Substring Patterns SN - 0-7695-3033-8 SP325 EP330 A1 - Xiaonan Ji, A1 - James Bailey, PY - 2007 VL - 0 JA - 2012 IEEE 12th International Conference on Data Mining Workshops ER - | |||
Sequential patterns are used to discover knowledge in a wide range of applications. However, in many scenar- ios pattern quality can be low, due to short lengths or low supports. Furthermore, for dense datasets such as proteins, most of the sequential pattern mining algorithms return a tremendously large number of patterns, which are difficult to process and analyze. However, by relaxing the defini- tion of frequency and allowing some mismatches, it is pos- sible to discover higher quality patterns. We call these pat- terns Frequent Approximate Substrings or FAS-patterns and we introduce an algorithm called FAS-Miner, to handle the mining task efficiently. The experiments on real-world pro- tein and DNA datasets show that FAS-Miner can discover patterns of much longer lengths and higher supports than standard sequential mining approaches.
Citation:
Xiaonan Ji, James Bailey, "An Efficient Technique for Mining Approximately Frequent Substring Patterns," icdmw, pp.325-330, Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007), 2007
Usage of this product signifies your acceptance of the Terms of Use.
