Setup Menus in Admin Panel

Welcome to Introduction to SAS and Hadoop. I’m David Ghan. I’ve been a technical training consultant with SAS since 1999, and I teach a wide variety of courses covering SAS programming and SAS applications for data management. In this course, you’ll learn how to use SAS programming methods to read, write, and manipulate Hadoop data.

In Chapter 1, I’ll introduce you to key concepts on Hadoop, and give you an overview of the various SAS technologies available for managing and analyzing data in Hadoop.

In Chapter 2, we’ll look at Base SAS methods for interacting with Hadoop data. These include reading and writing raw data with the DATA step, copying files between the SAS server and Hadoop, managing the Hadoop file system, and executing MapReduce and Pig programs from SAS by using the HADOOP procedure.

Then, in Chapter 3, I’ll show you how to use the SQL pass-through facility to interact with the Hive tables in Hadoop. You’ll learn how to submit native Hive SQL query code to Hadoop from a SAS program, and manage the results of those queries when they return to SAS. You’ll also learn how you can submit all other types of HiveQL statements from SAS programs to manage this Hive system in any way that you need to.

In Chapter 4, you’ll learn how to use the LIBNAME method to connect to a Hive database and treat Hive tables as if they are SAS data sets in all your SAS programs. So that you can maximize performance, I’ll describe which SAS program elements convert into processes that execute in Hadoop, and I’ll show you techniques to evaluate how much processing is done by Hadoop.

In Chapter 5, you’ll learn ways you can structure Hive tables to enable you to execute certain queries with better performance.

Finally, in Chapter 6, I’ll give you a brief overview of additional SAS and Hadoop technologies, including the SAS DS2 procedure, SAS High-Performance Analytics, SAS LASR Analytic Server, and SAS In-Memory Statistics.

Let’s get started.

Course Curriculum

Introduction to SAS and Hadoop Course Overview
Introduction to SAS and Hadoop Course Overview 00:00:00
1.1a Introduction and Creating Course Data Chapter Overview
1.1a Introduction and Creating Course Data Chapter Overview 00:00:00
1.2a- What Is Hadoop
1.2a- What Is Hadoop 00:00:00
1.2b- Hadoop Distributed File System
1.2b- Hadoop Distributed File System 00:00:00
1.2c-MapReduce Distributed Processing
1.2c-MapReduce Distributed Processing 00:00:00
1.2d- Executing Programs That Use MapReduce
1.2d- Executing Programs That Use MapReduce 00:00:00
1.2e-The Hadoop Ecosystem
1.2e-The Hadoop Ecosystem 00:00:00
1.2f- Characteristics of Hadoop
1.2f- Characteristics of Hadoop 00:00:00
1.2g- Hadoop Distributions
1.2g- Hadoop Distributions 00:00:00
1.2h- SAS Foundation Interfaces and Additional SAS Technologies for Hadoop
1.2h- SAS Foundation Interfaces and Additional SAS Technologies for Hadoop 00:00:00
1.2i- Course Image for Demos and Exercises
1.2i- Course Image for Demos and Exercises 00:00:00
1.3a-Course Logistics
1.3a-Course Logistics 00:00:00
1.3b- Demo Creating the Course Data
1.3b- Demo Creating the Course Data 00:00:00
2.1a- Chapter Overview
2.1a- Chapter Overview 00:00:00
2.2a- Base SAS Tools for Hadoop
2.2a- Base SAS Tools for Hadoop 00:00:00
2.2b- Configuration Files Required by SAS
2.2b- Configuration Files Required by SAS 00:00:00
2.3a- Business Scenario
2.3a- Business Scenario 00:00:00
2.3b-HADOOP Procedure Syntax
2.3b-HADOOP Procedure Syntax 00:00:00
2.3c- HDFS Statements in the HADOOP Procedure
2.3c- HDFS Statements in the HADOOP Procedure 00:00:00
2.3d- Moving a File from SAS to Hadoop
2.3d- Moving a File from SAS to Hadoop 00:00:00
2.3f- Executing MapReduce Code
2.3f- Executing MapReduce Code 00:00:00
2.3g- FILENAME Statement for Hadoop
2.3g- FILENAME Statement for Hadoop 00:00:00
2.3i- Using Hue to Browse the Hadoop File System
2.3i- Using Hue to Browse the Hadoop File System 00:00:00
2.3j- Reading a Hadoop File with a DATA Step
2.3j- Reading a Hadoop File with a DATA Step 00:00:00
2.3k- Demo Base SAS Methods to Read and Write Hadoop Files and Execute MapReduce Code, Part 1
2.3k- Demo Base SAS Methods to Read and Write Hadoop Files and Execute MapReduce Code, Part 1 00:00:00
2.3l- Demo Base SAS Methods to Read and Write Hadoop Files and Execute MapReduce Code, Part 2
2.3l- Demo Base SAS Methods to Read and Write Hadoop Files and Execute MapReduce Code, Part 2 00:00:00
2.4a- Executing Pig Code with the HADOOP Procedure
2.4a- Executing Pig Code with the HADOOP Procedure 00:00:00
2.4b Demo Executing Pig Code with PROC HADOOP
2.4b Demo Executing Pig Code with PROC HADOOP 00:00:00
3.1a- Chapter Overview
3.1a- Using the SQL Pass-Through Facility Chapter Overview 00:00:00
3.2a- SQL Pass-Through Methods and Syntax
3.2a- SQL Pass-Through Methods and Syntax 00:00:00
3.2c- Review of the Activity
3.2c- Review of the Activity 00:00:00
3.3a Querying Hadoop Hive Metadata
3.3a Querying Hadoop Hive Metadata 00:00:00
3.4a- Querying Hive Table Data
3.4a- Querying Hive Table Data 00:00:00
3.4b- Joining Multiple Tables
3.4b- Joining Multiple Tables 00:00:00
3.4d- Review of the Activity
3.4d- Review of the Activity 00:00:00
3.4e- Using SAS Features with Hive Results
3.4e- Using SAS Features with Hive Results 00:00:00
3.4f- Creating SAS Tables and Views
3.4f- Creating SAS Tables and Views 00:00:00
3.4g- A Limitation of Views
3.4g- A Limitation of Views 00:00:00
3.4h- Advantages and Disadvantages of SQL Pass-Through
3.4h- Advantages and Disadvantages of SQL Pass-Through 00:00:00
3.5a- What are Hive Tables
3.5a- What are Hive Tables 00:00:00
3.5b- Executing HiveQL Non-Query Statements
3.5b- Executing HiveQL Non-Query Statements 00:00:00
3.5c- Creating Hive Tables
3.5c- Creating Hive Tables 00:00:00
3.5f- Loading Hive Tables
3.5f- Loading Hive Tables 00:00:00
3.5g- Internal and External Hive Tables
3.5g- Internal and External Hive Tables 00:00:00
3.5h- Demo Creating Hive Tables, Part 1
3.5h- Demo Creating Hive Tables, Part 1 00:00:00
3.5i- Demo- Creating Hive Tables, Part 2 TS
3.5i- Demo- Creating Hive Tables, Part 2 00:00:00
3.5k- File Types for Hive
3.5k- File Types for Hive 00:00:00
3.5l- Hive SerDes
3.5l- Hive SerDes 00:00:00
3.5n- Review of the Activity
3.5n- Review of the Activity 00:00:00
3.5o- Using SerDes
3.5o- Using SerDes 00:00:00
3.5p- Demo Applying RegexSerDe to Semi-Structured Data
3.5p- Demo Applying RegexSerDe to Semi-Structured Data 00:00:00
3.6a- Hive Character Data Types
3.6a- Hive Character Data Types 00:00:00
3.6b- SAS Methods to Manage Hive Character Data Types
3.6b- SAS Methods to Manage Hive Character Data Types 00:00:00
3.6c- Demo Using Conversion Functions and TBLPROPERTIES to Control How SAS Interprets Hive Columns
3.6c- Demo Using Conversion Functions and TBLPROPERTIES to Control How SAS Interprets Hive Columns 00:00:00
4.1a Using the SAS/ACCESS LIBNAME Engine- Chapter Overview
4.1a Using the SAS/ACCESS LIBNAME Engine- Chapter Overview 00:00:00
4.2a SASACCESS LIBNAME Engine
4.2a SASACCESS LIBNAME Engine 00:00:00
4.2b- Accessing Hive Metadata and Data
4.2b- Accessing Hive Metadata and Data 00:00:00
4.2c Implicit Pass-Through with the LIBNAME Engine
4.2c Implicit Pass-Through with the LIBNAME Engine 00:00:00
4.2e- Demo Determining If a WHERE Clause Is Passed to Hive
4.2e- Demo Determining If a WHERE Clause Is Passed to Hive 00:00:00
4.2g-Using the SASTRACE System Option to Determine What Processing Is Passed to Hive
4.2g-Using the SASTRACE System Option to Determine What Processing Is Passed to Hive 00:00:00
4.2h-Demo Using the SASTRACE System Option to Determine Where Processing Occurs
4.2h-Demo Using the SASTRACE System Option to Determine Where Processing Occurs 00:00:00
4.2i- Passing SAS Functions to Hadoop
4.2i- Passing SAS Functions to Hadoop 00:00:00
4.3a- LIBNAME Engine Translation to HiveQL
4.3a- LIBNAME Engine Translation to HiveQL 00:00:00
4.4a-Data Set Options Passed to Hive
4.4a-Data Set Options Passed to Hive 00:00:00
4.4c- Review of the Activity
4.4c- Review of the Activity 00:00:00
4.5a- Creating and Using PROC SQL Views of Hive Tables
4.5a- Creating and Using PROC SQL Views of Hive Tables 00:00:00
4.5c-Review of the Activity
4.5c-Review of the Activity 00:00:00
4.5d- Creating Views with an Embedded LIBNAME Statement
4.5d- Creating Views with an Embedded LIBNAME Statement 00:00:00
4.6a- Ways to Combine Tables and Passing Joins to Hive
4.6a- Ways to Combine Tables and Passing Joins to Hive 00:00:00
4.6c-Review of the Activity
4.6c-Review of the Activity 00:00:00
4.6d- SQL Set Operators
4.6d- SQL Set Operators 00:00:00
4.6f- Review of the Activity
4.6f- Review of the Activity 00:00:00
4.6g- Combining Hive Tables with Other Source
4.6g- Combining Hive Tables with Other Source 00:00:00
4.6i- Review of the Activity
4.6i- Review of the Activity 00:00:00
4.6j- The MULTI_DATASRC_OPT= Option
4.6j- The MULTI_DATASRC_OPT= Option 00:00:00
4.6l- Review of the Activity
4.6l- Review of the Activity 00:00:00
4.6n- Review of the Exercise
4.6n- Review of the Exercise 00:00:00
4.7a- Methods to Create Hive Tables
4.7a- Methods to Create Hive Tables 00:00:00
4.7b- Demo Creating Output Data Sets as Hive Tables
4.7b- Demo Creating Output Data Sets as Hive Tables 00:00:00
4.8a- Ordering Data
4.8a- Ordering Data 00:00:00
4.8c- Ordering Data with a BY Statement
4.8c- Ordering Data with a BY Statement 00:00:00
4.8e- Using the TABULATE and REPORT Procedures
4.8e- Using the TABULATE and REPORT Procedures 00:00:00
4.8f- Demo Sorting and Reporting with Hive Data
4.8f- Demo Sorting and Reporting with Hive Data 00:00:00
4.8h- Review of the Exercise
4.8h- Review of the Exercise 00:00:00
4.8j- Review of the Exercise
4.8j- Review of the Exercise 00:00:00
5.1a Chapter Overview and Objectives
5.1a Chapter Overview and Objectives 00:00:00
5.2a Partitions, Clusters, and Indexes for Hive
5.2a Partitions, Clusters, and Indexes for Hive 00:00:00
5.2b- Partitioning Hive Tables
5.2b- Partitioning Hive Tables 00:00:00
5.2c- Demo Creating and Viewing Partitions in a Hive Table
5.2c- Demo Creating and Viewing Partitions in a Hive Table 00:00:00
5.2d-Considerations for Creating Partitions
5.2d-Considerations for Creating Partitions 00:00:00
5.2e- Clustering Hive Tables
5.2e- Clustering Hive Tables 00:00:00
5.2h- Indexing Hive Tables
5.2h- Indexing Hive Tables 00:00:00
6.1a Chapter Overview
6.1a Chapter Overview 00:00:00
6.2a- Review of SAS Technologies for Hadoop
6.2a- Review of SAS Technologies for Hadoop 00:00:00
6.2b- Review of the Software Architecture for SAS and Hadoop
6.2b- Review of the Software Architecture for SAS and Hadoop 00:00:00
6.2c- Types of In-Memory Analytics Grids
6.2c- Types of In-Memory Analytics Grids 00:00:00
6.2d- Engine Types and Products for In-Memory Analytics
6.2d- Engine Types and Products for In-Memory Analytics 00:00:00
6.2f- SAS High-Performance Procedure Example
6.2f- SAS High-Performance Procedure Example 00:00:00
6.2h-SAS Visual Analytics and SAS Visual Statistics
6.2h-SAS Visual Analytics and SAS Visual Statistics 00:00:00
6.2j- SAS In-Memory Statistics Example
6.2j- SAS In-Memory Statistics Example 00:00:00
6.2k- Process Details for the HPA Grid
6.2k- Process Details for the HPA Grid 00:00:00
6.2l- Process Details for the LASR Grid
6.2l- Process Details for the LASR Grid 00:00:00
6.3a- A SAS High-Performance Analytics Programming Example
6.3a- A SAS High-Performance Analytics Programming Example 00:00:00
6.3b- Demo Using High-Performance Procedures and the SASHDAT Library Engine
6.3b- Demo Using High-Performance Procedures and the SASHDAT Library Engine 00:00:00
6.4a- An In-Memory Statistics Programming Example
6.4a- An In-Memory Statistics Programming Example 00:00:00
6.4b- Demo Creating a LASR Analytic Server Session and Using the SASIOLA Engine
6.4b- Demo Creating a LASR Analytic Server Session and Using the SASIOLA Engine 00:00:00
6.5a- What Is DS2 and Why Use It
6.5a- What Is DS2 and Why Use It 00:00:00
6.5b- A Basic DS2 Program Explained
6.5b- A Basic DS2 Program Explained 00:00:00
6.5c- A DS2 Program to Summarize Data
6.5c- A DS2 Program to Summarize Data 00:00:00
6.5d- Running a DS2 Program in Threads
6.5d- Running a DS2 Program in Threads 00:00:00
6.5f- Executing DS2 Threads in Hadoop
6.5f- Executing DS2 Threads in Hadoop 00:00:00
6.5g- Demo Executing DS2 Threads in the Hadoop Cluster to Summarize Data
6.5g- Demo Executing DS2 Threads in the Hadoop Cluster to Summarize Data 00:00:00
6.6a- Using PROC HDMD to Define Table Metadata for Data in HDFS
6.6a- Using PROC HDMD to Define Table Metadata for Data in HDFS 00:00:00
6.6b- Demo Using PROC HDMD to Access HDFS Files
6.6b- Demo Using PROC HDMD to Access HDFS Files 00:00:00
1.1b- Objectives
1.1b- Objectives 00:00:00
1.3c- Creating the course Data
1.3c- Creating the course Data 00:00:00
1.3d- Troubleshooting
1.3d- Troubleshooting 00:00:00
2.1b- Objectives
2.1b- Objectives 00:00:00
2.3m- Ex- Directing Data stp otpt to HDFS with a Hadoop file name
2.3m- Ex- Directing Data stp otpt to HDFS with a Hadoop file name 00:00:00
2.3m- Ex Soln- Directing Data stp otpt to HDFS with a Hadoop file name
2.3m- Ex Soln- Directing Data stp otpt to HDFS with a Hadoop file name 00:00:00
3.1b Chapter 3: Using the SQL Pass-Through Facility- Objectives
3.1b Chapter 3: Using the SQL Pass-Through Facility- Objectives 00:00:00
3.3b Exercise- Using Hue to Investigate the Hive Database
3.3b Exercise- Using Hue to Investigate the Hive Database 00:00:00
3.3b Exercise Solution- Using Hue to Investigate the Hive Database
3.3b Exercise Solution- Using Hue to Investigate the Hive Database 00:00:00
3.4c- Activity
3.4c- Activity 00:00:00
3.2b- Actvity
3.2b- Actvity 00:00:00
3.4i Exercise- Querying a Singe Table using SAS SQL Features
3.4i Exercise- Querying a Singe Table using SAS SQL Features 00:00:00
3.4i Exercise Solution- Querying a Singe Table using SAS SQL Features
3.4i Exercise Solution- Querying a Singe Table using SAS SQL Features 00:00:00
3.4j- Exercise- Joining Tables in Hadoop to create a SAS Table
3.4j- Exercise- Joining Tables in Hadoop to create a SAS Table 00:00:00
3.4j- Exercise Solution- Joining Tables in Hadoop to create a SAS Table
3.4j- Exercise Solution- Joining Tables in Hadoop to create a SAS Table 00:00:00
3.4k-Exercise- Creating and Using a SAS PROC SQL View(optional)
3.4k-Exercise- Creating and Using a SAS PROC SQL View(optional) 00:00:00
3.4k-Exercise Solution- Creating and Using a SAS PROC SQL View(optional)
3.4k-Exercise Solution- Creating and Using a SAS PROC SQL View(optional) 00:00:00
3.5j- Exercise- Creating a Hive Table as an External Table
3.5j- Exercise- Creating a Hive Table as an External Table 00:00:00
3.5j- Exercise Solution- Creating a Hive Table as an External Table
3.5j- Exercise Solution- Creating a Hive Table as an External Table 00:00:00
3.6d- Exercise- Applying CAST Functions to Create Desired Data Types when Computing Columns
3.6d- Exercise- Applying CAST Functions to Create Desired Data Types when Computing Columns 00:00:00
3.6d- Exercise Solution - Applying CAST Functions to Create Desired Data Types when Computing Columns
3.6d- Exercise Solution – Applying CAST Functions to Create Desired Data Types when Computing Columns 00:00:00
5.1b- Objectives
5.1b- Objectives 00:00:00
5.2f- Exercise- Creating and Loading a Clustered Table in Hive Using SQL Pass- Through
5.2f- Exercise- Creating and Loading a Clustered Table in Hive Using SQL Pass- Through 00:00:00
5.2f- Exercise Solution- Creating and Loading a Clustered Table in Hive Using SQL Pass- Through
5.2f- Exercise Solution- Creating and Loading a Clustered Table in Hive Using SQL Pass- Through 00:00:00
5.2g- Exercise- Creating and Loading a Table that is Partitioned and Clustered
5.2g- Exercise- Creating and Loading a Table that is Partitioned and Clustered 00:00:00
5.2g- Exercise Solution- Creating and Loading a Table that is Partitioned and Clustered
5.2g- Exercise Solution- Creating and Loading a Table that is Partitioned and Clustered 00:00:00
4.1b Using the SAS/ACCESS LIBNAME Engine- Objectives
4.1b Using the SAS/ACCESS LIBNAME Engine- Objectives 00:00:00
4.2j- Exercise- Accessing Hive Tables using the LIBNAME Statement
4.2j- Exercise- Accessing Hive Tables using the LIBNAME Statement 00:00:00
4.2j- Exercise Solution- Accessing Hive Tables using the LIBNAME Statement
4.2j- Exercise Solution- Accessing Hive Tables using the LIBNAME Statement 00:00:00
4.4b- Activity
4.4b- Activity 00:00:00
4.4d- Exercise- using SAS Data Set Options with PROC SQL
4.4d- Exercise- using SAS Data Set Options with PROC SQL 00:00:00
4.5b- Activity answer
4.5b- Activity answer 00:00:00
4.5e- Activity
4.5e- Activity 00:00:00
4.6b- ACtivity
4.6b- ACtivity 00:00:00
4.6e- Activity
4.6e- Activity 00:00:00
4.6h- Activity
4.6h- Activity 00:00:00
4.6k- Activity
4.6k- Activity 00:00:00
4.6m- Exercise- Testing Implicit Joins on small Amounts of Data with the OBS= Option
4.6m- Exercise- Testing Implicit Joins on small Amounts of Data with the OBS= Option 00:00:00
4.6m- Exercise Solution- Testing Implicit Joins on small Amounts of Data with the OBS= Option
4.6m- Exercise Solution- Testing Implicit Joins on small Amounts of Data with the OBS= Option 00:00:00
4.7c- Exercise- Using an Alternative Test Method
4.7c- Exercise- Using an Alternative Test Method 00:00:00
4.7c- Exercise Solution- Using an Alternative Test Method
4.7c- Exercise Solution- Using an Alternative Test Method 00:00:00
4.8b- ACtivity
4.8b- ACtivity 00:00:00
4.8d- ACtivity
4.8d- ACtivity 00:00:00
4.8g- Exercise- Using PROC TABULATE as an IN- Database Procedure
4.8g- Exercise- Using PROC TABULATE as an IN- Database Procedure 00:00:00
4.8g- Exercise Solution- Using PROC TABULATE as an IN- Database Procedure
4.8g- Exercise Solution- Using PROC TABULATE as an IN- Database Procedure 00:00:00
4.8i- Exercise- Using PROC REPORT as an IN- Database Procedure
4.8i- Exercise- Using PROC REPORT as an IN- Database Procedure 00:00:00
4.8i- Exercise Solution- Using PROC REPORT as an IN- Database Procedure
4.8i- Exercise Solution- Using PROC REPORT as an IN- Database Procedure 00:00:00
6.1b- Objectives
6.1b- Objectives 00:00:00
Appendix A- WordCount MapReduce Program File
Appendix A- WordCount MapReduce Program File 00:00:00
Appendix B- Additional SAS Technologies for Hadoop File
Appendix B- Additional SAS Technologies for Hadoop File 00:00:00
Course Notes- Introduction to SAS and Hadoop Essentials
Course Notes- Introduction to SAS and Hadoop Essentials 00:00:00
2.3e-Question
2.3e-Question 00:00:00
2.3h- Activity
2.3h- Activity 00:00:00
3.5m- Activity Answer
3.5m- Activity Answer 00:00:00
4.2f- Activity
4.2f- Activity 00:00:00
4.3b- Activity
4.3b- Activity 00:00:00
4.4d- Exercise Solution- using SAS Data Set Options with PROC SQL
4.4d- Exercise Solution- using SAS Data Set Options with PROC SQL 00:00:00

Course Reviews

N.A

ratings
  • 5 stars0
  • 4 stars0
  • 3 stars0
  • 2 stars0
  • 1 stars0

No Reviews found for this course.

PRIVATE COURSE
  • PRIVATE
  • 1 week, 3 days
1 STUDENTS ENROLLED
Copyright @2019