This course helps you prepare for the AWS Certified Big Data - Specialty exam by taking a deep dive into several data-driven use cases. It is intended for individuals with a Cloud Practitioner or Associate-level AWS certification and two or more years of experience performing complex big data analysis.
In this course, you will learn how to:
- Fit AWS solutions inside a big data ecosystem
- Leverage Apache Hadoop in the context of Amazon EMR
- Identify the components of an Amazon EMR cluster, then launch and configure an Amazon EMR cluster
- Use common programming frameworks available for Amazon EMR, including Hive, Pig, and streaming
- Improve the ease of use of Amazon EMR by using Hadoop User Experience (Hue)
- Use in-memory analytics with Apache Spark on Amazon EMR
- Choose appropriate AWS data storage options
- Identify the benefits of using Amazon Kinesis for near real-time Big Data processing
- Leverage Amazon Redshift to efficiently store and analyze data
- Comprehend and manage costs and security for a Big Data solution
- Identify options for ingesting, transferring, and compressing data
- Leverage Amazon Athena for ad-hoc query analytics
- Use AWS Glue to automate extract, transform, and load (ETL) workloads
- Use visualization software to depict data and queries using Amazon QuickSight