Course Details
Course Outline
1 - Day 1
Overview of Big Data, Apache Hadoop, and the Benefits of Amazon EMRAmazon EMR ArchitectureUsing Amazon EMRLaunching and Using an Amazon EMR ClusterHadoop Programming Frameworks
2 - Day 2
Using Hive for Advertising AnalyticsUsing Streaming for Life Sciences AnalyticsOverview: Spark and Shark for In-Memory AnalyticsUsing Spark and Shark for In-Memory AnalyticsManaging Amazon EMR CostsOverview of Amazon EMR SecurityData Ingestion, Transfer, and CompressionUsing Amazon Kinesis for Real-Time Big Data Processing
3 - Day 3
Using Amazon Kinesis for Real-Time Big Data ProcessingAWS Data Storage OptionsUsing DynamoDB with Amazon EMROverview: Amazon Redshift and Big DataUsing Amazon Redshift for Big DataVisualizing and Orchestrating Big DataUsing Tableau Desktop or Jaspersoft BI to Visualize Big Data
Actual course outline may vary depending on offering center. Contact your sales representative for more information.
Who is it For?
Target Audience
This course is intended for:
Individuals responsible for designing and implementing big data solutions, namely Solutions Architects and SysOps Administrators.
Data Scientists and Data Analysts interested in learning about big data solutions on AWS.
Other Prerequisites
We recommend that attendees of this course have the following prerequisites:
Basic familiarity with big data technologies, including Apache Hadoop, HDFS, and SQL/NoSQL querying.
Students should complete the Big Data Technology Fundamentals web-based training or have equivalent experience.
Working knowledge of core AWS services and public cloud implementation.
Students should complete the AWS Essentials course or have equivalent experience.
Basic understanding of data warehousing, relational database systems, and database design.