Learning tools and concepts for computing on big data. Learn how to use Spark for large-scale analytics and machine learning. Spark is an open-source, general-purpose computing framework that is scalable and blazingly fast. Fundamental data types and concepts will be covered (e.g., resilient distributed datasets, DataFrames) along with Tools for data processing, storage, and retrieval, including Amazon Web Services (AWS).