Approaches to distributed execution of scientific workflows in Kepler

Marcin Płóciennik, Tomasz Zok, Ilkay Altintas, Jianwu Wang, Daniel Crawl, David Abramson, Frederic Imbeaux, Bernard Guillerminet, Marcos Lopez-Caniego, Isabel Campos Plasencia, Wojciech Pych, Pawel Cieciela̧g, Bartek Palak, Michał Owsiak, Yann Frauel

    Research output: Contribution to journalArticleResearchpeer-review

    13 Citations (Scopus)

    Abstract

    The Kepler scientific workflow system enables creation, execution and sharing of workflows across a broad range of scientific and engineering disciplines while also facilitating remote and distributed execution of workflows. In this paper, we present and compare different approaches to distributed execution of workflows using the Kepler environment, including a distributed data-parallel framework using Hadoop and Stratosphere, and Cloud and Grid execution using Serpens, Nimrod/K and Globus actors. We also present real-life applications in computational chemistry, bioinformatics and computational physics to demonstrate the usage of different distributed computing capabilities of Kepler in executable workflows. We further analyze the differences of each approach and provide a guidance for their applications.

    Original languageEnglish
    Pages (from-to)281-302
    Number of pages22
    JournalFundamenta Informaticae
    Volume128
    Issue number3
    DOIs
    Publication statusPublished - 2013

    Keywords

    • distributed execution
    • Kepler
    • scientific workflow

    Cite this