This part is OPTIONAL for our workshop. You don’t need it if you already have a local Docker setup and have your data files ready in your local machine.
Development endpoints incur costs whether or not you are using them. Please delete the endpoints AND notebooks after usage.
In AWS Glue, you can create an environment — known as a development endpoint — that you can use to iteratively develop and test your extract, transform, and load (ETL) scripts. For more info, please refer to Developing Scripts Using Development Endpoints
The advantages of having a development endpoint compared to the local Docker method are:
You can then create a notebook that connects to the endpoint, and use your notebook to author and test your ETL script. When you’re satisfied with the results of your development process, you can create an ETL job that runs your script. With this process, you can add functions and debug your scripts in an interactive manner.
It is also possible to connect your local IDE to this endpoint, which is explained here: Tutorial: Set Up PyCharm Professional with a Development Endpoint
How to create an endpoint and use it from a notebook:
Go to Glue in the console https://console.aws.amazon.com/glue/
byod
It will take a while to create the endpoint.
byod