• Products

    Overview

    • Features
    • Pricing

    Featured Products

    • Red Hat OpenShift Container Platform

      Build, deploy and manage your applications across cloud- and on-premise infrastructure

    • Red Hat OpenShift Dedicated

      Single-tenant, high-availability Kubernetes clusters in the public cloud

    • Red Hat OpenShift Online

      The fastest way for developers to build, host and scale applications in the public cloud

    • All products
  • Learn

    Learn

    • What is OpenShift
    • Get started
    • Partners
    • Customer success stories
    • Blog
    • Resources

    Technology Topics

    • Knative
    • Security
    • Kubernetes
    • Service Brokers
  • Community
    • OpenShift Commons
    • Open Source (OKD)
    • Startups
    • Grants
  • Support
    • Help Center
    • OpenShift Docs
  • Free Trial
  • Log In

  1. Docs »
  2. AI/ML »
  3. Data Engineering with Open Data Hub Workshop
    • Home
  • AI/ML
    • AI/ML Workflows on OpenShift
    • Data Engineering with Open Data Hub Workshop
  • AppDev
    • Couchbase Cluster with OpenShift
    • DevOps with OpenShift
    • Getting Started with OpenShift for Developers
    • Helm 3 in Action
    • odo Developer CLI
    • OpenShift Cloud Native Development Workshop
    • OpenShift Pipelines
    • Red Hat OpenShift Service Mesh in Action Workshop
  • GitOps
    • Getting Started with ArgoCD
    • Using Tekton and ArgoCD
  • Install/Multi-Cloud
    • Azure IPI
    • Bare Metal UPI
    • Disconnected Install
    • Google Cloud IPI
    • Installing a Windows Node
    • RHV IPI
    • vSphere IPI
    • vSphere UPI
  • Management/Ops
    • Cluster Application Migration
    • Kubernetes Operators
    • OpenShift and Container Storage for Admins
    • OpenShift Metering
    • OpenShift Virtualization
  • Security
    • Synopsys Black Duck for OpenShift Workshop
    • Cyberark Secrets Management for OpenShift Workshop
    • Snyk for OpenShift Workshop
    • Prisma Cloud for OpenShift Workshop
    • Hashicorp Vault for OpenShift Workshop

This workshop is designed to show data engineers and system administrators how to architect a data infrastructure service using the Open Data Hub. Our goal is for engineers who are interested in developing platforms for the ML and Analytics needs to walk away having learned data wrangling in a Hybrid Cloud environment with Spark and JupyterHub.

Unlike some other workshops, the entirety of the activity is conducted within JupyterHub.

Workshop

For conducting this workshop using the Red Hat Product Demo System (RHPDS), you will find it in the workshops section of the catalog entitled OCP4 Workshop Data Engineering.

Source

The original source content is available here. If you wish to install or consume this workshop in your own cluster, the source repository has those instructions.

Red Hat

Copyright © 2019 Red Hat, Inc.

Privacy statement Terms of use All policies and guidelines