Search For:

Displaying 1-25 out of 25 total
Continuous Dataflow Update Strategies for Mission-Critical Applications
Found in: 2013 IEEE 9th International Conference on eScience (eScience)
By Charith Wickramaarachchi,Yogesh Simmhan
Issue Date:October 2013
pp. 155-163
Continuous data flows complement scientific work-flows by allowing composition of real time data ingest and analytics pipelines to process data streams from pervasive sensors and
Cloud-Based Software Platform for Big Data Analytics in Smart Grids
Found in: Computing in Science & Engineering
By Yogesh Simmhan,Saima Aman,Alok Kumbhare,Rongyang Liu,Sam Stevens,Qunzhi Zhou,Viktor Prasanna
Issue Date:July 2013
pp. 38-47
This article focuses on a scalable software platform for the Smart Grid cyber-physical system using cloud technologies. Dynamic Demand Response (D²R) is a challenge-application to perform intelligent demand-side management and relieve peak load ...
Optimizations and Analysis of BSP Graph Processing Models on Public Clouds
Found in: 2013 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)
By Mark Redekopp,Yogesh Simmhan,Viktor K. Prasanna
Issue Date:May 2013
pp. 203-214
Large-scale graph analytics is a central tool in many fields, and exemplifies the size and complexity of Big Data applications. Recent distributed graph processing frameworks utilize the venerable Bulk Synchronous Parallel (BSP) model and promise scalabili...
Cryptonite: A Secure and Performant Data Repository on Public Clouds
Found in: 2012 IEEE 5th International Conference on Cloud Computing (CLOUD)
By Alok Kumbhare,Yogesh Simmhan,Viktor Prasanna
Issue Date:June 2012
pp. 510-517
Cloud storage has become immensely popular for maintaining synchronized copies of files and for sharing documents with collaborators. However, there is heightened concern about the security and privacy of Cloud-hosted data due to the shared infrastructure ...
Semantic Information Modeling for Emerging Applications in Smart Grid
Found in: Information Technology: New Generations, Third International Conference on
By Qunzhi Zhou,Sreedhar Natarajan,Yogesh Simmhan,Viktor Prasanna
Issue Date:April 2012
pp. 775-782
Smart Grid modernizes power grid by integrating digital and information technologies. Millions of smart meters, intelligent appliances and communication infrastructures are under deployment allowing advanced IT applications to be developed to secure and ma...
Improving Energy Use Forecast for Campus Micro-grids Using Indirect Indicators
Found in: Data Mining Workshops, International Conference on
By Saima Aman,Yogesh Simmhan,Viktor K. Prasanna
Issue Date:December 2011
pp. 389-397
The rising global demand for energy is best addressed by adopting and promoting sustainable methods of power consumption. We employ an informatics approach towards forecasting the energy consumption patterns in a university campus micro-grid which can be u...
An Analysis of Security and Privacy Issues in Smart Grid Software Architectures on Clouds
Found in: Cloud Computing, IEEE International Conference on
By Yogesh Simmhan,Alok Gautam Kumbhare,Baohua Cao,Viktor Prasanna
Issue Date:July 2011
pp. 582-589
Power utilities globally are increasingly upgrading to Smart Grids that use bi-directional communication with the consumer to enable an information-driven approach to distributed energy management. Clouds offer features well suited for Smart Grid software ...
Towards Reliable, Performant Workflows for Streaming-Applications on Cloud Platforms
Found in: Cluster Computing and the Grid, IEEE International Symposium on
By Daniel Zinn, Quinn Hart, Timothy McPhillips, Bertram Ludäscher, Yogesh Simmhan, Michail Giakkoupis, Viktor K. Prasanna
Issue Date:May 2011
pp. 235-244
Scientific workflows are commonplace in eScience applications. Yet, the lack of integrated support for data models, including streaming data, structured collections and files, is limiting the ability of workflows to support emerging applications in energy ...
Bridging the Gap between Desktop and the Cloud for eScience Applications
Found in: 2010 IEEE 3rd International Conference on Cloud Computing (CLOUD 2010)
By Yogesh Simmhan,Catharine van Ingen,Girish Subramanian, Jie Li
Issue Date:July 2010
pp. 474-481
The widely discussed scientific data deluge creates a need to computationally scale out eScience applications beyond the local desktop and cope with variable loads over time. Cloud computing offers a scalable, economic, on-demand model well matched to thes...
Building Reliable Data Pipelines for Managing Community Data Using Scientific Workflows
Found in: e-Science and Grid Computing, International Conference on
By Yogesh Simmhan, Catharine van Ingen, Alex Szalay, Roger Barga, Jim Heasley
Issue Date:December 2009
pp. 321-328
The growing amount of scientific data from sensors and field observations is posing a challenge to “data valets” responsible for managing them in data repositories. These repositories built on commodity clusters need to reliably ingest data continuously an...
The Trident Scientific Workflow Workbench
Found in: eScience, IEEE International Conference on
By Roger Barga, Jared Jackson, Nelson Araujo, Dean Guo, Nitin Gautam, Yogesh Simmhan
Issue Date:December 2008
pp. 317-318
In our demonstration we present Trident, a scientific workflow workbench built on top of a commercial workflow system to leverage existing functionality to the extent possible. Trident is being developed in collaboration with the scientific computing commu...
On Building Scientific Workflow Systems for Data Management in the Cloud
Found in: eScience, IEEE International Conference on
By Yogesh Simmhan, Roger Barga, Catharine van Ingen, Ed Lazowska, Alex Szalay
Issue Date:December 2008
pp. 434-435
Scientific workflows have become an archetype to model in silico experiments in the Cloud by scientists. There is a class of workflows that are used to by
End-to-End Scientific Data Management Using Workflows
Found in: Services, IEEE Congress on
By Yogesh Simmhan
Issue Date:July 2008
pp. 472-473
Workflows have evolved as the natural tool for scientists to model their eScience experiments. With the scientific world producing data at an explosive rate, workflows have an important part to play in the end to end management of scientific data. To illus...
On Building Parallel & Grid Applications: Component Technology and Distributed Services
Found in: Challenges of Large Applications in Distributed Environments, International Workshop on
By Dennis Gannon, Sriram Krishnan, Liang Fang, Gopi Kandaswamy, Yogesh Simmhan, Aleksander Slominski
Issue Date:June 2004
pp. 44
Software Component Frameworks are well known in the commercial business application world and now this technology is being explored with great interest as a way to build large-scale scientific application on parallel computers. In the case of Grid systems,...
GrayWulf: Scalable Software Architecture for Data Intensive Computing
Found in: Hawaii International Conference on System Sciences
By Yogesh Simmhan, Roger Barga, Catharine van Ingen, Maria Nieto-Santisteban, Lazslo Dobos, Nolan Li, Michael Shipway, Alexander S. Szalay, Sue Werner, Jim Heasley
Issue Date:January 2009
pp. 1-10
Big data presents new challenges to both cluster infrastructure software and parallel application design. We present a set of software services and design principles for data intensive computing with petabyte data sets, named GrayWulf. These services are i...
PLAStiCC: Predictive Look-Ahead Scheduling for Continuous Dataflows on Clouds
Found in: 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)
By Alok Gautam Kumbhare,Yogesh Simmhan,Viktor K. Prasanna
Issue Date:May 2014
pp. 344-353
Scalable stream processing and continuous dataflow systems are gaining traction with the rise of big data due to the need for processing high velocity data in near real time. Unlike batch processing systems such as MapReduce and workflows, static schedulin...
Provenance Information Model of Karma Version 3
Found in: Services, IEEE Congress on
By Bin Cao, Beth Plale, Girish Subramanian, Ed Robertson, Yogesh Simmhan
Issue Date:July 2009
pp. 348-351
Provenance that captures e-Science activity has long term value only if the right amount and kind of information is collected. In this paper, we propose a two-layer model for representing provenance information capable of representing both execution inform...
Exploiting application dynamism and cloud elasticity for continuous dataflows
Found in: Proceedings of SC13: International Conference for High Performance Computing, Networking, Storage and Analysis (SC '13)
By Alok Kumbhare, Yogesh Simmhan, Viktor K. Prasanna
Issue Date:November 2013
pp. 1-12
Contemporary continuous dataflow systems use elastic scaling on distributed cloud resources to handle variable data rates and to meet applications' needs while attempting to maximize resource utilization. However, virtualized clouds present an added challe...
Toward data-driven demand-response optimization in a campus microgrid
Found in: Proceedings of the Third ACM Workshop on Embedded Sensing Systems for Energy-Efficiency in Buildings (BuildSys '11)
By Qunzhi Zhou, Saima Aman, Sreedhar Natarajan, Viktor Prasanna, Wei Yin, Yogesh Simmhan
Issue Date:November 2011
pp. 41-42
We describe and demonstrate a prototype software architecture to support data-driven demand response optimization (DR) in the USC campus microgrid, as part of the Los Angeles Smart Grid Demonstration Project. The architecture includes a semantic informatio...
Stargazing through a digital veil: managing a large scale sky survey using distributed databases on HPC clusters
Found in: Proceedings of the first annual workshop on High performance computing meets databases (HPCDB '11)
By Alex Szalay, Catharine van Ingen, Jim Heasley, Yogesh Simmhan
Issue Date:November 2011
pp. 33-36
The Sloan Digital Sky Survey established the use of relational databases for the scans and cone searches common to astronomy analyses. The Pan-STARRS project scales up SDSS by melding HPC clusters with hierarchical and spatially partitioned distributed dat...
Designing a secure storage repository for sharing scientific datasets using public clouds
Found in: Proceedings of the second international workshop on Data intensive computing in the clouds (DataCloud-SC '11)
By Alok Gautam Kumbhare, Viktor Prasanna, Yogesh Simmhan
Issue Date:November 2011
pp. 31-40
As Cloud platforms gain increasing traction among scientific and business communities for outsourcing storage, computing and content delivery, there is also growing concern about the associated loss of control over private data hosted in the Cloud. In this...
Towards an inexact semantic complex event processing framework
Found in: Proceedings of the 5th ACM international conference on Distributed event-based system (DEBS '11)
By Qunzhi Zhou, Viktor Prasanna, Yogesh Simmhan
Issue Date:July 2011
pp. 401-402
Complex event processing (CEP) deals with detecting real-time situations, represented as event patterns, from among an event cloud. The state-of-the-art CEP systems process events as plain data tuples and are limited to detect precisely defined patterns. E...
Adaptive rate stream processing for smart grid applications on clouds
Found in: Proceedings of the 2nd international workshop on Scientific cloud computing (ScienceCloud '11)
By Baohua Cao, Michail Giakkoupis, Viktor K. Prasanna, Yogesh Simmhan
Issue Date:June 2011
pp. 33-38
Pervasive smart meters that continuously measure power usage by consumers within a smart (power) grid are providing utilities and power systems researchers with unprecedented volumes of information through streams that need to be processed and analyzed in ...
Comparison of resource platform selection approaches for scientific workflows
Found in: Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing (HPDC '10)
By Lavanya Ramakrishnan, Yogesh Simmhan
Issue Date:June 2010
pp. 445-450
Cloud computing is increasingly considered as an additional computational resource platform for scientific workflows. The cloud offers opportunity to scale-out applications from desktops and local cluster resources. Each platform has different properties (...
Efficient scheduling of scientific workflows in a high performance computing cluster
Found in: Proceedings of the 6th international workshop on Challenges of large applications in distributed environments (CLADE '08)
By Alex Szalay, Dan Fay, Dean Guo, Roger S. Barga, Steven Newhouse, Yogesh Simmhan
Issue Date:June 2008
pp. 1-6
The scientific computing community, especially academia is clearly in need of technology to handle and organize the 1-100+ Terabyte datasets coming from computer simulations and scientific instrumentation. In this paper we briefly describe GrayWulf, an exe...