Discovering DataOps: A Comprehensive Review of Definitions, Use Cases, and Tools

Kiran Mainali, Lisa Ehrlinger, Johannes Himmelbauer, Mihhail Matskin

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Data management approaches have changed drastically in the past few years due to improved data availability and increasing interest in data analysis (e.g., artificial intelligence). The volume, velocity, and variety of data requires novel and automated ways to "operate" this data. In accordance with software development, where DevOps is the de-facto standard to operate code, DataOps is an emerging approach advocated by practitioners to tackle data management challenges for analytics. In this paper, we uncover DataOps from the scientific perspective with a rigorous review of research and tools. As a result, we make the following three-fold contribution: we (1) outline definitions of DataOps and their ambiguities, (2) identify the extent to which DataOps covers different stages of the data lifecycle, and (3) provide a comprehensive overview on tools and their suitability for different stages of DataOps.
Original languageEnglish
Title of host publicationDATA ANALYTICS 2021 : The Tenth International Conference on Data Analytics
Editors Sandjai Bhulai, Ivana Semanjski, Les Sztandera
PublisherInternational Academy, Research, and Industry Association
Pages61-69
Number of pages9
ISBN (Print)978-1-61208-891-4
Publication statusPublished - Oct 2021

Fields of science

  • 102001 Artificial intelligence
  • 102010 Database systems
  • 102014 Information design
  • 102015 Information systems
  • 102019 Machine learning
  • 102022 Software development
  • 102025 Distributed systems
  • 102028 Knowledge engineering
  • 102033 Data mining
  • 102035 Data science
  • 509018 Knowledge management

JKU Focus areas

  • Digital Transformation

Cite this