Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
SPDX-License-Identifier: CC-BY-SA-4.0
Download the MNIST dataset to your notebook instance, review the data, transform it, and upload it to your S3 bucket.
You transform the data by changing its format from numpy.array
to comma-separated values (CSV). The XGBoost Algorithm expects input in either the LIBSVM or CSV format. LIBSVM is an open source machine learning library. In this exercise , you use CSV format because it’s simpler.
Topics + Step 4.1: Download the MNIST Dataset + Step 4.2: Explore the Training Dataset + Step 4.3: Transform the Training Dataset and Upload It to Amazon S3