DS 7200

Computation III - Distributed Computing

Course Description

Learning tools and concepts for computing on big data. Learn how to use Spark for large-scale analytics and machine learning. Spark is an open-source, general-purpose computing framework that is scalable and blazingly fast. Fundamental data types and concepts will be covered (e.g., resilient distributed datasets, DataFrames) along with Tools for data processing, storage, and retrieval, including Amazon Web Services (AWS).


  • Adam Tashman

     Rating

     Difficulty

     GPA

     Sections

    1

    Last Taught

    Fall 2024