{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Ingest data with Athena\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "---\n", "\n", "This notebook's CI test result for us-west-2 is as follows. CI test results in other regions can be found at the end of the notebook. \n", "\n", "\n", "\n", "---" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "This notebook demonstrates how to set up a database with Athena and query data with it.\n", "\n", "Amazon Athena is a serverless interactive query service that makes it easy to analyze your S3 data with standard SQL. It uses S3 as its underlying data store, and uses Presto with ANSI SQL support, and works with a variety of standard data formats, including CSV, JSON, ORC, Avro, and Parquet. Athena is ideal for quick, ad-hoc querying but it can also handle complex analysis, including large joins, window functions, and arrays. \n", "\n", "To get started, you can point to your data in Amazon S3, define the schema, and start querying using the built-in query editor. Amazon Athena allows you to tap into all your data in S3 without the need to set up complex processes to extract, transform, and load the data (ETL).\n" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Set up Athena\n", "First, we are going to make sure we have the necessary policies attached to the role that we used to create this notebook to access Athena. You can do this through an IAM client as shown below, or through the AWS console. \n", "\n", "**Note: You would need IAMFullAccess to attach policies to the role.**" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "#### Attach IAMFullAccess Policy from Console\n", "\n", "**1.** Go to **SageMaker Console**, choose **Notebook instances** in the navigation panel, then select your notebook instance to view the details. Then under **Permissions and Encryption**, click on the **IAM role ARN** link and it will take you to your role summary in the **IAM Console**. " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "