• IEEE.org
  • IEEE CS Standards
  • Career Center
  • About Us
  • Subscribe to Newsletter

0

IEEE
CS Logo
  • MEMBERSHIP
  • CONFERENCES
  • PUBLICATIONS
  • EDUCATION & CAREER
  • VOLUNTEER
  • ABOUT
  • Join Us
CS Logo

0

IEEE Computer Society Logo
Sign up for our newsletter
FacebookTwitterLinkedInInstagramYoutube
IEEE COMPUTER SOCIETY
About UsBoard of GovernorsNewslettersPress RoomIEEE Support CenterContact Us
COMPUTING RESOURCES
Career CenterCourses & CertificationsWebinarsPodcastsTech NewsMembership
BUSINESS SOLUTIONS
Corporate PartnershipsConference Sponsorships & ExhibitsAdvertisingRecruitingDigital Library Institutional Subscriptions
DIGITAL LIBRARY
MagazinesJournalsConference ProceedingsVideo LibraryLibrarian Resources
COMMUNITY RESOURCES
GovernanceConference OrganizersAuthorsChaptersCommunities
POLICIES
PrivacyAccessibility StatementIEEE Nondiscrimination PolicyIEEE Ethics ReportingXML Sitemap

Copyright 2025 IEEE - All rights reserved. A public charity, IEEE is the world’s largest technical professional organization dedicated to advancing technology for the benefit of humanity.

  • Home
  • /Publications
  • /Tech News
  • /Research
  • Home
  • / ...
  • /Tech News
  • /Research

Leveraging Intel’s Coprocessors for Large-Scale SNP Detection

By IEEE Computer Society Team on
March 1, 2023

Using Xeon Phi Coprocessor for Large Scale SNP DetectionUsing Xeon Phi Coprocessor for Large Scale SNP DetectionDetecting Single Nucleotide Polymorphisms (SNPs) in genome analysis is a fundamental procedure. SNPs act as biological markers and are used by geneticists to identify genes associated with a particular disease. However, with up to five million SNPs in a single person’s genome, detecting and cataloging them is a prodigious task.

The most common tool for SNP detection, SOAPsnp, would require upward of a week to perform an analysis on a single typical human genome. This severely affects any downstream analyses, which reduces efficiency.

Recently, IEEE posted a paper entitled mSNP: A Massively Parallel Algorithm for Large-Scale SNP Detection. It introduces mSNP as an improved version of SOAPsnp that is able to handle large-scale SNP detection using Intel Xeon Phi coprocessors.

Redesign of the Essential Data Structures


Developing mSNP began with the redesign of SOAPsnp’s essential data structures. The development of a space-efficient data structure increases computing efficiency. Unnecessary access to memory is reduced, which significantly reduces the overall memory footprint. This improvement is especially important when you consider that memory would be accessed almost 420 trillion times during a whole human genome scan using SOAPsnp.


Want More Tech News? Subscribe to ComputingEdge Newsletter Today!


Leveraging a Parallel Framework


The inefficiencies of SOAPsnp were countered by a coordinated parallel framework to optimize the use of hardware. Normally, the CPU would need to wait for Xeon Phi to return results on each job, meaning it would lay idle, wasting computing power. However, by making data transfer asynchronous, the CPU can continue with the next job without waiting for results. This asynchronous configuration is possible because rather than utilizing Xeon Phi’s native mode, offload mode was adopted by mSNP. Additionally, Xeon Phi has a 512-bit wide VPU, so mSNP’s tailored operations and data structures can make optimal use of the wide VPU to improve data throughput. This is something SOAPsnp could not do without optimization.

Using a Window Division Strategy


In order to address load imbalance and achieve higher performance, mSNP uses a strategy of read-based window division. This improves throughput since different processes are able to load reads from various lines of input simultaneously. Better load balance and increased efficiency are achieved because the number of reads per process is more or less equal, and all processes can communicate at the same time.

mSNP has the boasting rights of being the first SNP detection tool empowered by Intel’s Xeon Phi. During testing, a 38x single thread speedup on CPU was achieved. Given the speed benefits researchers can reap from this advancement, this improvement is significant enough on its own. However, this was also done without experiencing any loss in precision. Thus the team was able to achieve an often elusive “twofer”: more speed without sacrificing quality. Additionally, mSNP was able to successfully scale to 4,096 nodes on Tianhe-2. These results demonstrate the efficiency and scalability mSNP offers for large-scale human genome SNP detection.

The whole article, mSNP: A Massively Parallel Algorithm for Large-Scale SNP Detection, can be downloaded and read from IEEE’s digital library.

LATEST NEWS
From Isolation to Innovation: Establishing a Computer Training Center to Empower Hinterland Communities
From Isolation to Innovation: Establishing a Computer Training Center to Empower Hinterland Communities
IEEE Uganda Section: Tackling Climate Change and Food Security Through AI and IoT
IEEE Uganda Section: Tackling Climate Change and Food Security Through AI and IoT
Blockchain Service Capability Evaluation (IEEE Std 3230.03-2025)
Blockchain Service Capability Evaluation (IEEE Std 3230.03-2025)
Autonomous Observability: AI Agents That Debug AI
Autonomous Observability: AI Agents That Debug AI
Disaggregating LLM Infrastructure: Solving the Hidden Bottleneck in AI Inference
Disaggregating LLM Infrastructure: Solving the Hidden Bottleneck in AI Inference
Get the latest news and technology trends for computing professionals with ComputingEdge
Sign up for our newsletter
Read Next

From Isolation to Innovation: Establishing a Computer Training Center to Empower Hinterland Communities

IEEE Uganda Section: Tackling Climate Change and Food Security Through AI and IoT

Blockchain Service Capability Evaluation (IEEE Std 3230.03-2025)

Autonomous Observability: AI Agents That Debug AI

Disaggregating LLM Infrastructure: Solving the Hidden Bottleneck in AI Inference

Copilot Ergonomics: UI Patterns that Reduce Cognitive Load

The Myth of AI Neutrality in Search Algorithms

Gen AI and LLMs: Rebuilding Trust in a Synthetic Information Age