{
 "cells": [
  {
   "cell_type": "markdown",
   "id": "5a70731c",
   "metadata": {},
   "source": [
    "# SageMaker JumpStart ã‚’ç”¨ã„ãŸ LightGBM (åˆ†é¡ž)ã®ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã¨æŽ¨è«–\n",
    "* JumpStart ã§ã¯ç‹¬è‡ªã®ãƒ‡ãƒ¼ã‚¿ã‚’ç”¨æ„ã™ã‚‹ã ã‘ã§ã€æ§˜ã€…ãªãƒ¢ãƒ‡ãƒ«ã®å¦ç¿’ã¨å‡ºæ¥ãŸãƒ¢ãƒ‡ãƒ«ã®æŽ¨è«–ãŒã§ãã‚‹\n",
    "* ã“ã®ãƒŽãƒ¼ãƒˆãƒ–ãƒƒã‚¯ã§ã¯ LightGBM ã®åˆ†é¡žãƒ¢ãƒ‡ãƒ«ã‚’ç”¨ã„ãŸãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã®å‹•ã‹ã—æ–¹ã‚’è¨˜è¿°ã™ã‚‹\n",
    "* ãƒ‡ãƒ¼ã‚¿ã«ã¤ã„ã¦ã¯ã€AWS ãŒå…¬é–‹ã—ã¦ã„ã‚‹ãƒ‡ãƒ¼ã‚¿ã‚’åˆ©ç”¨ã™ã‚‹\n",
    "* SageMaker SDK ã‚’ä½¿ã£ãŸãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã¨æŽ¨è«–ã‚’è¨˜è¼‰ã—ã€æœ€å¾Œã« boto3 ã‚’ä½¿ã£ãŸæŽ¨è«–ã‚’è¨˜è¼‰ã—ã¦ã„ã‚‹\n",
    "* ã“ã®ãƒŽãƒ¼ãƒˆãƒ–ãƒƒã‚¯ã¯ `Data Science 2.0` ã‚¤ãƒ¡ãƒ¼ã‚¸ã€`Python 3` ã‚«ãƒ¼ãƒãƒ«ã§é–‹ã„ã¦ãã ã•ã„\n",
    "\n",
    "## Tabel of Contents\n",
    "* [äº‹å‰æº–å‚™](#äº‹å‰æº–å‚™)\n",
    "  * [ãƒ¢ã‚¸ãƒ¥ãƒ¼ãƒ«ã®ã‚¤ãƒ³ãƒãƒ¼ãƒˆ](#ãƒ¢ã‚¸ãƒ¥ãƒ¼ãƒ«ã®ã‚¤ãƒ³ãƒãƒ¼ãƒˆ)\n",
    "  * [ãƒ‡ãƒ¼ã‚¿å–å¾—](#ãƒ‡ãƒ¼ã‚¿å–å¾—)\n",
    "* [SageMaker JumpStart ã‚’ä½¿ã£ã¦ CUI(SageMaker SDK) ã§ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã¨æŽ¨è«–](#SageMaker-JumpStart-ã‚’ä½¿ã£ã¦-CUI(SageMaker-SDK)-ã§ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã¨æŽ¨è«–)\n",
    "  * [ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°](#ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°)\n",
    "    * [ãƒ‡ãƒ¼ã‚¿ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰](#ãƒ‡ãƒ¼ã‚¿ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰)\n",
    "    * [ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã®å–å¾—](#ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã®å–å¾—)\n",
    "    * [ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚¸ãƒ§ãƒ–å®Ÿè¡Œ](#ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚¸ãƒ§ãƒ–å®Ÿè¡Œ)\n",
    "  * [æŽ¨è«–](#æŽ¨è«–)\n",
    "    * [æŽ¨è«–ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã®å–å¾—](#ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã®å–å¾—)\n",
    "    * [æŽ¨è«–ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä½œæˆ](#æŽ¨è«–ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä½œæˆ)\n",
    "* [boto3 ã§æŽ¨è«–](#boto3-ã§æŽ¨è«–)\n",
    "  * [å®šæ•°ã‚„ã‚¯ãƒ©ã‚¤ã‚¢ãƒ³ãƒˆã®è¨å®š](#å®šæ•°ã‚„ã‚¯ãƒ©ã‚¤ã‚¢ãƒ³ãƒˆã®è¨å®š)\n",
    "  * [ãƒ¢ãƒ‡ãƒ«ã¨æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’ tar.gz ã«å›ºã‚ã‚‹](#ãƒ¢ãƒ‡ãƒ«ã¨æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’-tar.gz-ã«å›ºã‚ã‚‹)\n",
    "  * [boto3 ã§SageMaker ã§ãƒ¢ãƒ‡ãƒ«ã®ä½œæˆ](#boto3-ã§SageMaker-ã§ãƒ¢ãƒ‡ãƒ«ã®ä½œæˆ)\n",
    "  * [boto3 ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã®è¨å®šã‚’ä½œæˆ](#boto3-ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã®è¨å®šã‚’ä½œæˆ)\n",
    "  * [boto3 ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã‚’ä½œæˆã™ã‚‹](#boto3-ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã‚’ä½œæˆã™ã‚‹)\n",
    "  * [boto3 ã§æŽ¨è«–ã™ã‚‹](#boto3-ã§æŽ¨è«–ã™ã‚‹)\n",
    "  * [boto3 ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä»–ã‚’å‰Šé™¤](#boto3-ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä»–ã‚’å‰Šé™¤)\n"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "bdfbce04",
   "metadata": {},
   "source": [
    "## äº‹å‰æº–å‚™\n",
    "### ãƒ¢ã‚¸ãƒ¥ãƒ¼ãƒ«ã®ã‚¤ãƒ³ãƒãƒ¼ãƒˆ"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "ffce17a9",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "import sagemaker\n",
    "from sagemaker import image_uris, model_uris, script_uris\n",
    "from sagemaker.estimator import Estimator\n",
    "from sagemaker.session import Session\n",
    "from sagemaker import hyperparameters\n",
    "import json\n",
    "import pandas as pd\n",
    "from typing import Final\n",
    "import numpy as np"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "97b63a14",
   "metadata": {},
   "source": [
    "### ãƒ‡ãƒ¼ã‚¿å–å¾—\n",
    "å…¬é–‹ã•ã‚Œã¦ã„ã‚‹åˆ†é¡žç”¨ãƒ‡ãƒ¼ã‚¿ã‚’ä½¿ã†ã€‚  \n",
    "mnist ã®ç”»åƒã‚’ã‚«ãƒ©ãƒ å±•é–‹ã•ã‚ŒãŸã‚‚ã®ã§ã‚ã‚Šã€æœ€åˆã®åˆ—ã«æ•™å¸«ãƒ©ãƒ™ãƒ«ãŒæ ¼ç´ã•ã‚Œã¦ã„ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "bc9a07ff",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "data_dir: Final[str] = 'classification_data'\n",
    "!if [ -d ./{data_dir} ]; then rm -rf ./{data_dir}/;fi\n",
    "!mkdir ./{data_dir}/\n",
    "!aws s3 sync  s3://jumpstart-cache-prod-us-east-1/training-datasets/tabular_multiclass/ ./{data_dir}/"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "1c8436ab",
   "metadata": {},
   "source": [
    "## SageMaker JumpStart ã‚’ä½¿ã£ã¦ CUI(SageMaker SDK) ã§ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã¨æŽ¨è«–\n",
    "### ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "96ee7936",
   "metadata": {},
   "source": [
    "#### ãƒ‡ãƒ¼ã‚¿ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰\n",
    "\n",
    "* ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ãƒ‡ãƒ¼ã‚¿ã«ã¤ã„ã¦\n",
    "    * JumpStart ã§è‡ªåˆ†ã®ãƒ‡ãƒ¼ã‚¿ã§ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã™ã‚‹ã«ã¯äºˆã‚ S3 ã«é…ç½®ã™ã‚‹(ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°å®Ÿè¡Œæ™‚ã« S3 ã® URI ã‚’æŒ‡å®šã™ã‚‹)\n",
    "* ãƒ‡ãƒ¼ã‚¿ã®æŒã¡æ–¹ã«ã¤ã„ã¦\n",
    "    * csv å½¢å¼ã§ãƒ•ã‚¡ã‚¤ãƒ«åã‚’ data.csv ã«ã™ã‚‹å¿…è¦ãŒã‚ã‚‹(ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ãŒ data.csv ã—ã‹å—ã‘ä»˜ã‘ãªã„ã‚ˆã†ã«ãªã£ã¦ã„ã‚‹)\n",
    "    * è¨“ç·´ç”¨ãƒ‡ãƒ¼ã‚¿ã® `train/data.csv` ã¯å¿…ãšç”¨æ„ã™ã‚‹\n",
    "    * è©•ä¾¡ç”¨ãƒ‡ãƒ¼ã‚¿ã®`validation/data.csv` ã¯ã‚ªãƒ—ã‚·ãƒ§ãƒ³\n",
    "    * ãƒ†ã‚¹ãƒˆç”¨ãƒ‡ãƒ¼ã‚¿ã® `test/data.csv` ã¯ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°æ™‚ã«ã‚‚ã¡ã‚ã‚“ä½¿ã‚ãªã„ãŒã¾ã¨ã‚ã¦ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰ã—ã¦ã„ã‚‹ã®ã§å‰¯æ¬¡çš„ã«ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰ã•ã‚Œã‚‹\n",
    "    * ã‚¿ãƒ¼ã‚²ãƒƒãƒˆå¤‰æ•°ã¯å¿…ãš 0 åˆ—ç›®ã«å…¥ã‚Œã‚‹ã“ã¨(ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ãŒ 0 åˆ—ç›®ã‚’ã‚¿ãƒ¼ã‚²ãƒƒãƒˆå¤‰æ•°ã¨ã—ã¦èªè˜ã™ã‚‹ã‚ˆã†ã«å®Ÿè£…ã•ã‚Œã¦ã„ã‚‹)\n",
    "* ã‚«ãƒ†ã‚´ãƒªãƒ¼å¤‰æ•°ã«ã¤ã„ã¦(ã“ã®ãƒ‡ãƒ¼ã‚¿ã«ã‚«ãƒ†ã‚´ãƒªãƒ¼å¤‰æ•°ã¯ãªã„)\n",
    "    * ãƒ‡ãƒ¼ã‚¿ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã®ãƒ«ãƒ¼ãƒˆã«ä»»æ„ã® json ãƒ•ã‚¡ã‚¤ãƒ«ã‚’ï¼‘ã¤ã ã‘å«ã‚€ã“ã¨ã§ã‚«ãƒ†ã‚´ãƒªã‚«ãƒ«å¤‰æ•°ã‚’æ‰±ã†ã“ã¨ãŒã§ãã‚‹\n",
    "    * ã‚«ãƒ†ã‚´ãƒªãƒ¼å¤‰æ•°ã¯ã€0 ä»¥ä¸Šã®æ•´æ•°(Int32ã®ç¯„å›²å†…)ã§ã‚¨ãƒ³ã‚³ãƒ¼ãƒ‰ã•ã‚Œã¦ã„ã‚‹å¿…è¦ãŒã‚ã‚‹\n",
    "    * ã‚«ãƒ†ã‚´ãƒªãƒ¼å¤‰æ•°ã¯åˆ—ã®ã‚¤ãƒ³ãƒ‡ãƒƒã‚¯ã‚¹ã§è¾žæ›¸å½¢å¼ã§ã‚ãƒ¼ã« `cat_index_list` ã§ã€å€¤ã«åˆ—ã®ã‚¤ãƒ³ãƒ‡ãƒƒã‚¯ã‚¹ã‚’ãƒªã‚¹ãƒˆå½¢å¼ã§æ¸¡ã™\n",
    "    * ä»Šå›žã¯ 1 åˆ—ç›®ãŒã‚«ãƒ†ã‚´ãƒªãƒ¼å¤‰æ•°\n",
    "    * å®Ÿéš›ã®ä¾‹ã¯[å›žå¸°ãƒ¢ãƒ‡ãƒ«](./lightgbm_regression.ipynb)ã§åˆ©ç”¨ã—ã¦ã„ã‚‹ã®ã§å‚ç…§ã®ã“ã¨"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "a4c8c32b",
   "metadata": {},
   "source": [
    "ãƒ‡ãƒ¼ã‚¿ã®ç¢ºèª(JumpStart ã‚’å‹•ã‹ã™ã®ã«ã¯ä¸è¦)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "217d125f",
   "metadata": {},
   "outputs": [],
   "source": [
    "# pd.read_csv(f'{data_dir}/train/data.csv',header=None).head()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "aa865131",
   "metadata": {},
   "source": [
    "* ãƒ‡ãƒ¼ã‚¿ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰ã¯ [upload_data](https://sagemaker.readthedocs.io/en/stable/api/utility/session.html#sagemaker.session.Session.upload_data) ãƒ¡ã‚½ãƒƒãƒ‰ã‚’åˆ©ç”¨ã—ã¦ã€ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã¾ã‚‹ã”ã¨ S3 ã«ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰ã™ã‚‹\n",
    "* ã“ã“ã§ã¯ SageMaker ã®ãƒ‡ãƒ•ã‚©ãƒ«ãƒˆãƒã‚±ãƒƒãƒˆ(`sagemaker-{region}-{account}`ã«ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰ã—ã¦ã„ã‚‹ãŒã€ä»»æ„ã®ãƒã‚±ãƒƒãƒˆã‚’é¸æŠžã™ã‚‹ã¨ãã¯ `bucket` å¼•æ•°ã‚’ä½¿ç”¨ã™ã‚‹\n",
    "* ã“ã“ã§å‡ºåŠ›ã•ã‚Œã‚‹ URI ã¯ã€GUI ã§å…¥åŠ›ã™ã‚‹å€¤ã§ã‚‚ã‚ã‚‹ï¼ˆGUI ã®å ´åˆã¯ã€S3 ã® URI ã‚’å…¥åŠ›ã—ãŸã‚ã¨ `Train` ã‚’ã‚¯ãƒªãƒƒã‚¯ã™ã‚Œã°å¦ç¿’ãŒé–‹å§‹ã•ã‚Œã‚‹  "
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f18031ca",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# ä½¿ã†ãƒ‡ãƒ¼ã‚¿ã‚’ S3 ã«ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰\n",
    "input_s3_uri: Final[str] = sagemaker.session.Session().upload_data(\n",
    "    f'./{data_dir}/',\n",
    "    key_prefix = 'sagemaker-jumpstart/lightgbm_classification/data'\n",
    ")\n",
    "print(f'ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰å…ˆ : \\n{input_s3_uri}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e7a4aac4",
   "metadata": {},
   "source": [
    "#### ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã®å–å¾—\n",
    "* JumpStart ã¯äºˆã‚ã‚³ãƒ³ãƒ†ãƒŠã‚„ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ã‚’ç”¨æ„ã—ã¦ã„ã‚‹ã®ã§ã€ãã®ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã‚’å–å¾—ã™ã‚‹\n",
    "\n",
    "##### å®šæ•°ã®è¨å®š"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "eb5f8ddd",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# JumpStart ã§å‹•ã‹ã™ãƒ¢ãƒ‡ãƒ«ã¨ãƒãƒ¼ã‚¸ãƒ§ãƒ³ã€ã‚¤ãƒ³ã‚¹ã‚¿ãƒ³ã‚¹ã‚¿ã‚¤ãƒ—ã¨å°æ•°ã‚’è¨å®š\n",
    "model_id: Final[str] = 'lightgbm-classification-model'\n",
    "model_version: Final[str] = '*'\n",
    "training_instance_type: Final[str] = 'ml.m5.xlarge'\n",
    "instance_count: Final[int] = 1"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "4b619554",
   "metadata": {},
   "source": [
    "##### ãƒãƒ¼ãƒ«åã‚’å–å¾—\n",
    "ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚¸ãƒ§ãƒ–ã‚’å‹•ã‹ã™éš›ã«ã€ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚¤ãƒ³ã‚¹ã‚¿ãƒ³ã‚¹ã«å‰²ã‚Šå½“ã¦ã‚‹ãƒãƒ¼ãƒ«ã‚’å–å¾—"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f8a61840",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# JumpStart ã§å‹•ã‹ã™ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚¸ãƒ§ãƒ–ã«ã‚¢ã‚¿ãƒƒãƒã™ã‚‹ãƒãƒ¼ãƒ«ã‚’å–å¾—(Notebook ã¨åŒä¸€)\n",
    "role: Final[str] = sagemaker.get_execution_role()\n",
    "print(role)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "76320358",
   "metadata": {},
   "source": [
    "##### Fine-Tune ã®å…ƒã¨ãªã‚‹ãƒ¢ãƒ‡ãƒ«ã® URI ã‚’å–å¾—\n",
    "* JumpStart ã¯ Fine-Tune ãŒåŸºæœ¬ãªã®ã§ã€Fine-Tune ã®å…ƒã¨ãªã‚‹ãƒ¢ãƒ‡ãƒ«ã® URI ã‚’å–å¾—\n",
    "* ãŸã ã—ã€LightGBM ã¯ Fine-Tune ã™ã‚‹ã‚‚ã®ã§ã¯ãªã„ã®ã§ classification ã™ã‚‹ã¨ã„ã†è¨å®šå€¤ã ã‘ãŒæ ¼ç´ã•ã‚Œã¦ã„ã‚‹\n",
    "* [sagemaker.model_uris.retrieve](https://sagemaker.readthedocs.io/en/stable/api/utility/model_uris.html#sagemaker.model_uris.retrieve) ãƒ¡ã‚½ãƒƒãƒ‰ã§å–å¾—ã§ãã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "5f3a59fb",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "base_model_uri: Final[str] = model_uris.retrieve(model_id=model_id, model_version=model_version, model_scope=\"training\")\n",
    "print(f'ãƒ¢ãƒ‡ãƒ«ã® URI:\\n{base_model_uri}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "615fa805",
   "metadata": {},
   "source": [
    "è¨å®šã‚’ç¢ºèªã—ãŸã„å ´åˆã¯ä¸‹è¨˜ã‚’å®Ÿè¡Œ( JumpStart ã‚’å‹•ã‹ã™ã®ã«ã¯ä¸è¦ãªä½œæ¥)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "9be7ac11",
   "metadata": {},
   "outputs": [],
   "source": [
    "# model_dir = 'train-lightgbm-classification-model'\n",
    "# !aws s3 cp {base_model_uri} ./\n",
    "# !if [ -d ./{model_dir} ]; then rm -rf {model_dir}/;fi\n",
    "# !mkdir {model_dir}/\n",
    "# !tar zxvf train-lightgbm-classification-model.tar.gz -C ./{model_dir}/\n",
    "# !cat {model_dir}/train-pytorch-lightgbm-lightgbmmulticlass.json"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "acc3699f",
   "metadata": {},
   "source": [
    "##### ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ã® S3 URI ã‚’å–å¾—\n",
    "* ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ã¯ AWS ãŒç®¡ç†ã™ã‚‹ S3 ã«æ ¼ç´ã•ã‚Œã¦ãŠã‚Šã€ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚¸ãƒ§ãƒ–ã‚’å®šç¾©ã™ã‚‹æ™‚ã«ä½¿ã†ãŸã‚å–å¾—ã™ã‚‹  \n",
    "* [sagemaker.script_uris.retrieve](https://sagemaker.readthedocs.io/en/stable/api/utility/script_uris.html#sagemaker.script_uris.retrieve) ãƒ¡ã‚½ãƒƒãƒ‰ã§å–å¾—ã§ãã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "ca62d152",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "training_script_uri: Final[str] = script_uris.retrieve(\n",
    "    model_id=model_id, model_version=model_version, script_scope=\"training\"\n",
    ")\n",
    "print(f'ã‚³ãƒ¼ãƒ‰ã® URI:\\n{training_script_uri}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "4f180ecb",
   "metadata": {},
   "source": [
    "* ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ã‚’ç¢ºèªã—ãŸã„å ´åˆã¯ä¸‹è¨˜ã‚’å®Ÿè¡Œ( JumpStart ã‚’å‹•ã‹ã™ã®ã«ã¯ä¸è¦ãªä½œæ¥)\n",
    "* ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ã‚’ã‚«ã‚¹ã‚¿ãƒžã‚¤ã‚ºã—ãŸã„å ´åˆã¯ãƒ€ã‚¦ãƒ³ãƒãƒ¼ãƒ‰ã—ã¦ç·¨é›†ã™ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f0431fce",
   "metadata": {
    "scrolled": true,
    "tags": []
   },
   "outputs": [],
   "source": [
    "training_script_dir: Final[str] = 'lightgbm_classification_training_script_dir'\n",
    "!aws s3 cp {training_script_uri} ./\n",
    "!if [ -d ./{training_script_dir} ]; then rm -rf ./{training_script_dir}/;fi\n",
    "!mkdir ./{training_script_dir}/\n",
    "!tar zxvf sourcedir.tar.gz -C ./{training_script_dir}/\n",
    "!pygmentize ./{training_script_dir}/transfer_learning.py"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "87e3d3c4",
   "metadata": {},
   "source": [
    "##### ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ³ãƒ†ãƒŠã‚¤ãƒ¡ãƒ¼ã‚¸ã® URI ã‚’å–å¾—\n",
    "* AWS ãŒç®¡ç†ã™ã‚‹ ECR ã«æ ¼ç´ã•ã‚Œã¦ãŠã‚Šã€ãã® URI ã‚’å–å¾—ã™ã‚‹\n",
    "* [sagemaker.image_uris.retrieve](https://sagemaker.readthedocs.io/en/stable/api/utility/image_uris.html#sagemaker.image_uris.retrieve) ãƒ¡ã‚½ãƒƒãƒ‰ã§å–å¾—ã§ãã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "1b693bb4",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "training_image_uri: Final[str] = image_uris.retrieve(\n",
    "    region=None,\n",
    "    framework=None,\n",
    "    image_scope=\"training\",\n",
    "    model_id=model_id,\n",
    "    model_version=model_version,\n",
    "    instance_type=training_instance_type,\n",
    ")\n",
    "print(f'ã‚³ãƒ³ãƒ†ãƒŠã® URI:\\n{training_image_uri}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "e6cf5e02",
   "metadata": {},
   "source": [
    "##### ãƒ‡ãƒ•ã‚©ãƒ«ãƒˆã®ãƒã‚¤ãƒ‘ãƒ¼ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã‚’å–å¾—\n",
    "* [sagemaker.hyperparameters.retrieve_default](https://sagemaker.readthedocs.io/en/stable/api/utility/hyperparameters.html#sagemaker.hyperparameters.retrieve_default) ãƒ¡ã‚½ãƒƒãƒ‰ã§å–å¾—ã§ãã‚‹\n",
    "* ãƒã‚¤ãƒ‘ãƒ¼ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã‚’å¤‰ãˆã‚‹å ´åˆã¯å–å¾—çµæžœã®è¾žæ›¸ã‚’ä¸Šæ›¸ãã™ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "8d133e81",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "hps = hyperparameters.retrieve_default(\n",
    "    model_id=model_id,\n",
    "    model_version=model_version,\n",
    ")\n",
    "print(f'ãƒã‚¤ãƒ‘ãƒ¼ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿\\n{json.dumps(hps,indent=4)}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "97662751",
   "metadata": {},
   "source": [
    "#### ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚¸ãƒ§ãƒ–å®Ÿè¡Œ\n",
    "* é€šå¸¸ã® SageMaker Training ã¨åŒã˜æ§˜ã« [Estimator](https://sagemaker.readthedocs.io/en/stable/api/training/estimators.html#sagemaker.estimator.Estimator) ã‚¯ãƒ©ã‚¹ã‹ã‚‰ `estimator` ã‚¤ãƒ³ã‚¹ã‚¿ãƒ³ã‚¹ã‚’ç”Ÿæˆã—ã€ [fit](https://sagemaker.readthedocs.io/en/stable/api/training/estimators.html#sagemaker.estimator.Estimator.fit) ãƒ¡ã‚½ãƒƒãƒ‰ã§å®Ÿè¡Œã™ã‚‹\n",
    "* ä»Šã¾ã§å–å¾—ã—ã¦ããŸè¨å®šå€¤ã‚’å¼•æ•°ã«å…¥ã‚Œã¦ `estimator` ã‚¤ãƒ³ã‚¹ã‚¿ãƒ³ã‚¹ã‚’ç”Ÿæˆã™ã‚‹\n",
    "* `training_script_uri` ã«ã¤ã„ã¦ã€ãƒãƒ¼ã‚«ãƒ«ã§æ›¸ãæ›ãˆãŸå ´åˆã¯ãƒãƒ¼ã‚«ãƒ«ã®ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã‚’æŒ‡å®šã™ã‚‹\n",
    "* fit ã®å¼•æ•°ã«ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ãƒ‡ãƒ¼ã‚¿ã® S3 URI ã‚’æŒ‡å®šã™ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "d009d3da",
   "metadata": {
    "scrolled": true,
    "tags": []
   },
   "outputs": [],
   "source": [
    "estimator = Estimator(\n",
    "    image_uri=training_image_uri,\n",
    "    source_dir=training_script_uri,\n",
    "    model_uri=base_model_uri,\n",
    "    entry_point=\"transfer_learning.py\",\n",
    "    role=role,\n",
    "    hyperparameters=hps,\n",
    "    instance_count=instance_count,\n",
    "    instance_type=training_instance_type,\n",
    ")\n",
    "estimator.fit({\"training\": input_s3_uri})\n"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "bf22cb5d",
   "metadata": {},
   "source": [
    "### æŽ¨è«–"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "c79566a5",
   "metadata": {},
   "source": [
    "#### æŽ¨è«–ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã®å–å¾—\n",
    "* JumpStart ã¯äºˆã‚ã‚³ãƒ³ãƒ†ãƒŠã‚„æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’ç”¨æ„ã—ã¦ã„ã‚‹ã®ã§ã€ãã®ãƒ‘ãƒ©ãƒ¡ãƒ¼ã‚¿ã‚’å–å¾—ã™ã‚‹\n",
    "\n",
    "##### ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ã® S3 URI ã‚’å–å¾—\n",
    "* æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã¯ AWS ãŒç®¡ç†ã™ã‚‹ S3 ã«æ ¼ç´ã•ã‚Œã¦ãŠã‚Šã€ãƒ¢ãƒ‡ãƒ«ãƒ‡ãƒ—ãƒã‚¤ã«ä½¿ã†ãŸã‚å–å¾—ã™ã‚‹  \n",
    "* [sagemaker.script_uris.retrieve](https://sagemaker.readthedocs.io/en/stable/api/utility/script_uris.html#sagemaker.script_uris.retrieve) ãƒ¡ã‚½ãƒƒãƒ‰ã§å–å¾—ã§ãã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "c3e671ee",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "inference_script_uri: Final[str] = script_uris.retrieve(\n",
    "    model_id=model_id, model_version=model_version, script_scope=\"inference\"\n",
    ")\n",
    "print(f'æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã®URL:\\n{inference_script_uri}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "3ba0b82a",
   "metadata": {},
   "source": [
    "* æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’ç¢ºèªã—ãŸã„å ´åˆã¯ä¸‹è¨˜ã‚’å®Ÿè¡Œ( JumpStart ã‚’å‹•ã‹ã™ã®ã«ã¯ä¸è¦ãªä½œæ¥)\n",
    "* æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’ã‚«ã‚¹ã‚¿ãƒžã‚¤ã‚ºã—ãŸã„å ´åˆã¯ãƒ€ã‚¦ãƒ³ãƒãƒ¼ãƒ‰ã—ã¦ç·¨é›†ã™ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "2e832091",
   "metadata": {
    "scrolled": true
   },
   "outputs": [],
   "source": [
    "# inference_script_dir: Final[str] = 'lightgbm_classification_inference_script_dir'\n",
    "# !aws s3 cp {inference_script_uri} ./\n",
    "# !if [ -d ./{inference_script_dir} ]; then rm -rf ./{inference_script_dir}/;fi\n",
    "# !mkdir ./{inference_script_dir}/\n",
    "# !tar zxvf sourcedir.tar.gz -C ./{inference_script_dir}/\n",
    "# !pygmentize ./{inference_script_dir}/inference.py"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "c53b0b4b",
   "metadata": {},
   "source": [
    "##### æŽ¨è«–ã‚³ãƒ³ãƒ†ãƒŠã‚¤ãƒ¡ãƒ¼ã‚¸ã® URI ã‚’å–å¾—\n",
    "* AWS ãŒç®¡ç†ã™ã‚‹ ECR ã«æ ¼ç´ã•ã‚Œã¦ãŠã‚Šã€ãã® URI ã‚’å–å¾—ã™ã‚‹\n",
    "* [sagemaker.image_uris.retrieve](https://sagemaker.readthedocs.io/en/stable/api/utility/image_uris.html#sagemaker.image_uris.retrieve) ãƒ¡ã‚½ãƒƒãƒ‰ã§å–å¾—ã§ãã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "d8a80d62",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "inference_image_uri: Final[str] = image_uris.retrieve(\n",
    "    region=None,\n",
    "    framework=None,\n",
    "    image_scope=\"inference\",\n",
    "    model_id=model_id,\n",
    "    model_version=model_version,\n",
    "    instance_type=training_instance_type,\n",
    ")\n",
    "print(f'ã‚³ãƒ³ãƒ†ãƒŠã® URI:\\n{inference_image_uri}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "76cbe69f",
   "metadata": {},
   "source": [
    "#### æŽ¨è«–ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä½œæˆ\n",
    "[Estimator](https://sagemaker.readthedocs.io/en/stable/api/training/estimators.html#sagemaker.estimator.Estimator) ã® [deploy](https://sagemaker.readthedocs.io/en/stable/api/training/estimators.html#sagemaker.estimator.EstimatorBase.deploy) ãƒ¡ã‚½ãƒƒãƒ‰ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä½œæˆã‚’è¡Œã†"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "3f099e14",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "predictor = estimator.deploy(\n",
    "    instance_type = 'ml.m5.large',\n",
    "    initial_instance_count  = 1,\n",
    "    entry_point='inference.py',\n",
    "    source_dir=inference_script_uri,\n",
    "    image_uri = inference_image_uri\n",
    "    \n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "c8746166",
   "metadata": {},
   "source": [
    "#### æŽ¨è«–å®Ÿè¡Œ\n",
    "* ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã¯ãƒ‡ãƒ•ã‚©ãƒ«ãƒˆã ã¨ `text/csv` ã—ã‹å—ã‘ä»˜ã‘ãªã„ã®ã§(æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã® `inference.py` ã¨ `constants.py` ã‚’å‚ç…§)ã€å‘¼ã³å‡ºã—ã‚‚ã¨(predictor)å´ã« [CSVSerializer](https://sagemaker.readthedocs.io/en/stable/api/inference/serializers.html#sagemaker.serializers.CSVSerializer) ã‚’è¨å®šã™ã‚‹\n",
    "* `CSVSerializer` ã‚’è¨å®šã™ã‚‹ã¨ã€API ã¸ã®ãƒªã‚¯ã‚¨ã‚¹ãƒˆ([predict](https://sagemaker.readthedocs.io/en/stable/api/inference/predictors.html#sagemaker.predictor.Predictor.predict))æ™‚ã« `content_type='text/csv'` ãŒè¨å®šã•ã‚Œã€ã¾ãŸ ndarray ã‚’æ¸¡ã—ã¦ã‚‚è£å´ã§ csv ã«ã‚·ãƒªã‚¢ãƒ©ã‚¤ã‚ºã—ã¦ãã‚Œã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "ccfaa006",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "# csvã«å¤‰æ›ã—ã¦ã€csv å½¢å¼ã§ãƒªã‚¯ã‚¨ã‚¹ãƒˆã‚’æŠ•ã’ã¦ãã‚Œã‚‹ã‚ˆã†ã«ãªã‚‹\n",
    "predictor.serializer = sagemaker.serializers.CSVSerializer()"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f55cd49d",
   "metadata": {
    "scrolled": true,
    "tags": []
   },
   "outputs": [],
   "source": [
    "# csv ã§ãƒªã‚¯ã‚¨ã‚¹ãƒˆã™ã‚‹ãƒ‘ã‚¿ãƒ¼ãƒ³\n",
    "np.argmax(json.loads(predictor.predict(pd.read_csv(f'{data_dir}/test/data.csv',header=None).iloc[0:1,1:].to_csv(header=False,index=False)).decode('utf-8'))['probabilities'])\n",
    "# # ndarray ã§ãƒªã‚¯ã‚¨ã‚¹ãƒˆã™ã‚‹ãƒ‘ã‚¿ãƒ¼ãƒ³\n",
    "# np.argmax(json.loads(predictor.predict(pd.read_csv(f'{data_dir}/test/data.csv',header=None).iloc[0:1,1:].values).decode('utf-8'))['probabilities'])"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "7ce3eb42",
   "metadata": {},
   "source": [
    "#### ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆå‰Šé™¤\n",
    "* ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã‚’å‰Šé™¤ã™ã‚‹ã“ã¨ã§ã‚¤ãƒ³ã‚¹ã‚¿ãƒ³ã‚¹ãŒåœæ¢ã•ã‚Œã‚‹\n",
    "* [delete_endpoint](https://sagemaker.readthedocs.io/en/stable/api/inference/predictors.html#sagemaker.predictor.Predictor.delete_endpoint) ã§å‰Šé™¤ã§ãã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "a963e645",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "predictor.delete_endpoint()"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "05d00843",
   "metadata": {},
   "source": [
    "## boto3 ã§æŽ¨è«–\n",
    "ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä½œæˆã‚„æŽ¨è«–ã¯ SageMaker SDK ã§ã¯ãªãã€boto3 ã‹ã‚‰ã‚„ã‚‹ã“ã¨ã‚‚å¤šã„ã®ã§ã‚„ã‚Šæ–¹ã‚’ç´¹ä»‹"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "6312cde8",
   "metadata": {},
   "source": [
    "### å®šæ•°ã‚„ã‚¯ãƒ©ã‚¤ã‚¢ãƒ³ãƒˆã®è¨å®š"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "653cca9a",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "import boto3\n",
    "sm_client = boto3.client('sagemaker')\n",
    "smr_client = boto3.client('sagemaker-runtime')\n",
    "endpoint_inservice_waiter = sm_client.get_waiter('endpoint_in_service')"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "68dd961d",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "model_name: Final[str] = 'LightgbmClassification'\n",
    "endpoint_config_name: Final[str] = model_name + 'EndpointConfig'\n",
    "endpoint_name: Final[str] = model_name + 'Endpoint'"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "f6a86d9c",
   "metadata": {},
   "source": [
    "### ãƒ¢ãƒ‡ãƒ«ã¨æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’ tar.gz ã«å›ºã‚ã‚‹\n",
    "æŽ¨è«–ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã‚’ç«‹ã¡ä¸Šã’ã‚‹ãŸã‚ã«ã¯ã€SageMaker ä¸Šã§ãƒ¢ãƒ‡ãƒ«ã‚’ç™»éŒ²ã™ã‚‹å¿…è¦ãŒã‚ã‚‹ã€‚  \n",
    "ã“ã“ã§ã„ã†`ãƒ¢ãƒ‡ãƒ«`ã¨ã¯ã€ã€Œæ©Ÿæ¢°å¦ç¿’ãƒ¢ãƒ‡ãƒ«+æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã€ã‚’ tar.gz ã® S3 URI ã¨ã€ãƒ¢ãƒ‡ãƒ«ã‚’å‹•ã‹ã™ã‚³ãƒ³ãƒ†ãƒŠãªã©ã‚’æŒ‡ã™ã€‚  \n",
    "ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ãŒçµ‚ã‚ã£ãŸæ®µéšŽã§ã¯ã€lightgbm ã®å¦ç¿’æ¸ˆãƒ¢ãƒ‡ãƒ«(pkl) ã ã‘ãªã®ã§ã€å½“ç„¶æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’å«ã¾ãªã„ã®ã§ã€  \n",
    "æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’æ¢±åŒ…ã—ã¦ S3 ã«ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰ã—ãªãŠã™(SageMaker SDK ã ã¨è£å´ã§å‹æ‰‹ã«ã‚„ã£ã¦ãã‚Œã¦ã„ãŸ)ã€‚  \n",
    "  \n",
    "æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã¯ã€`tar.gz` ã®ãƒ«ãƒ¼ãƒˆãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã« `code` ãƒ‡ã‚£ãƒ¬ã‚¯ãƒˆãƒªã‚’é…ç½®ã—ãã®ç›´ä¸‹ã«`inference.py`ã§ç½®ãã¨å‹æ‰‹ã«èªã‚“ã§ãã‚Œã‚‹ã€‚(åå‰ã‚’å¤‰ãˆã‚‹ã“ã¨ã‚‚ã§ãã‚‹ã‹ç’°å¢ƒå¤‰æ•°ã‚’ã„ã˜ã‚‹å¿…è¦ãŒå‡ºã¦ãã‚‹ã®ã§ãŠå‹§ã‚ã—ãªã„ï¼‰"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "af55ad29",
   "metadata": {
    "scrolled": true,
    "tags": []
   },
   "outputs": [],
   "source": [
    "# ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã®è¨˜éŒ²ã‹ã‚‰ãƒ¢ãƒ‡ãƒ«ã® URI ã‚’å–å¾—ã—ã¦ã€ãƒãƒ¼ã‚«ãƒ«ã«ãƒ€ã‚¦ãƒ³ãƒãƒ¼ãƒ‰ã™ã‚‹\n",
    "!aws s3 cp {estimator.latest_training_job.describe()['ModelArtifacts']['S3ModelArtifacts']} ./\n",
    "# å…ˆç¨‹ä½¿ã£ãŸ æŽ¨è«–ã‚³ãƒ¼ãƒ‰ã‚’ãƒ€ã‚¦ãƒ³ãƒãƒ¼ãƒ‰ã™ã‚‹\n",
    "!aws s3 cp {inference_script_uri} ./\n",
    "\n",
    "# ãƒ¢ãƒ‡ãƒ«ã‚’è§£å‡\n",
    "inference_model_dir: Final[str] = 'model'\n",
    "!if [ -d ./{inference_model_dir} ]; then rm -rf ./{inference_model_dir}/;fi\n",
    "!mkdir ./{inference_model_dir}/\n",
    "!tar zxvf ./model.tar.gz -C ./{inference_model_dir}/\n",
    "\n",
    "# ã‚³ãƒ¼ãƒ‰ã‚’è¿½åŠ \n",
    "inference_code_dir: Final[str] = 'code'\n",
    "!if [ -d ./{inference_code_dir} ]; then rm -rf ./{inference_code_dir}/;fi\n",
    "!mkdir ./{inference_code_dir}/\n",
    "!tar zxvf ./sourcedir.tar.gz -C ./{inference_code_dir}/\n",
    "!mv ./code/ model/\n",
    "\n",
    "# å†åœ§ç¸®\n",
    "!rm ./{inference_model_dir}.tar.gz\n",
    "%cd {inference_model_dir}/\n",
    "!tar zcvf model.tar.gz .\n",
    "%cd ..\n",
    "\n",
    "# ãƒ¢ãƒ‡ãƒ«ã¨ãƒˆãƒ¬ãƒ¼ãƒ‹ãƒ³ã‚°ã‚³ãƒ¼ãƒ‰ã® tar.gz ã‚’ S3 ã«ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰\n",
    "inference_model_uri: Final[str] = sagemaker.session.Session().upload_data(\n",
    "    f'./{inference_model_dir}/{inference_model_dir}.tar.gz',\n",
    "    key_prefix = 'sagemaker-jumpstart/lightgbm/model'\n",
    ")\n",
    "print(f'ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰å…ˆ : \\n{inference_model_uri}')"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "8d1bb0af",
   "metadata": {},
   "source": [
    "### boto3 ã§ SageMaker ã§ãƒ¢ãƒ‡ãƒ«ã®ä½œæˆ\n",
    "ã‚¢ãƒƒãƒ—ãƒãƒ¼ãƒ‰ã—ãŸãƒ¢ãƒ‡ãƒ« `model.tar.gz` ã¨ã€ã‚³ãƒ³ãƒ†ãƒŠã‚¤ãƒ¡ãƒ¼ã‚¸ã‚’è¨å®šã™ã‚‹  \n",
    "[create_model](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/sagemaker.html#SageMaker.Client.create_model) ãƒ¡ã‚½ãƒƒãƒ‰ã§è¨å®šã™ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "ffbba6b8",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "response = sm_client.create_model(\n",
    "    ModelName=model_name,\n",
    "    PrimaryContainer={\n",
    "        # SageMaker SDK ã®æ™‚ã¨åŒã˜ URI ã‚’æŒ‡å®š\n",
    "        'Image': inference_image_uri,\n",
    "        # SageMaker SDK ã®æ™‚ã¨åŒã˜ URI ã‚’æŒ‡å®š\n",
    "        'ModelDataUrl': inference_model_uri,\n",
    "    },\n",
    "    # SageMaker SDK ã®æ™‚ã¨åŒã˜ role ã‚’æŒ‡å®š\n",
    "    ExecutionRoleArn=role,\n",
    ")\n",
    "print(response)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "7dba8ecb",
   "metadata": {},
   "source": [
    "### boto3 ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã®è¨å®šã‚’ä½œæˆ\n",
    "ä½¿ç”¨ã™ã‚‹ãƒ¢ãƒ‡ãƒ«ã€ã‚¤ãƒ³ã‚¹ã‚¿ãƒ³ã‚¹ã®ç¨®é¡žã¨å°æ•°ãªã©ã‚’è¨å®šã™ã‚‹ã€‚  \n",
    "[create_endpoint_config](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/sagemaker.html#SageMaker.Client.create_endpoint_config) ãƒ¡ã‚½ãƒƒãƒ‰ã§è¨å®šã™ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "f08b19c5",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "response = sm_client.create_endpoint_config(\n",
    "    EndpointConfigName=endpoint_config_name,\n",
    "    ProductionVariants=[\n",
    "        {\n",
    "            'VariantName': 'AllTrafic',\n",
    "            'ModelName': model_name,\n",
    "            'InitialInstanceCount': 1,\n",
    "            'InstanceType': 'ml.m5.xlarge',\n",
    "        },\n",
    "    ],\n",
    ")\n",
    "print(response)"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "fdb2a026",
   "metadata": {},
   "source": [
    "### boto3 ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆã‚’ä½œæˆã™ã‚‹\n",
    "[create_endpoint](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/sagemaker.html#SageMaker.Client.create_endpoint) ãƒ¡ã‚½ãƒƒãƒ‰ã§ä½œæˆã™ã‚‹"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "62ff5f13",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "response = sm_client.create_endpoint(\n",
    "    EndpointName=endpoint_name,\n",
    "    EndpointConfigName=endpoint_config_name,\n",
    ")\n",
    "# ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆãŒç«‹ã¡ä¸ŠãŒã‚‹ã¾ã§å¾…ã¤\n",
    "endpoint_inservice_waiter.wait(\n",
    "    EndpointName=endpoint_name,\n",
    "    WaiterConfig={'Delay': 5,}\n",
    ")"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "3c127d49",
   "metadata": {},
   "source": [
    "### boto3 ã§æŽ¨è«–ã™ã‚‹\n",
    "[invoke_endpoint](https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/sagemaker-runtime.html#SageMakerRuntime.Client.invoke_endpoint)ã§æŽ¨è«–ã‚’å®Ÿè¡Œã§ãã‚‹ã€‚  \n",
    "client ã¯ `boto3.client('sagemaker')` ã§ã¯ãªãã€`boto3.client('sagemaker-runtime')`ãªã“ã¨ã«æ³¨æ„ã€‚"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "4a68d4a9",
   "metadata": {
    "scrolled": true,
    "tags": []
   },
   "outputs": [],
   "source": [
    "request_args = {\n",
    "    'EndpointName': endpoint_name,\n",
    "    'ContentType' : 'text/csv',\n",
    "    'Body' : pd.read_csv(f'{data_dir}/test/data.csv',header=None).iloc[0:1,1:].to_csv(header=False, index=False)\n",
    "}\n",
    "response = smr_client.invoke_endpoint(**request_args)\n",
    "np.argmax(json.loads(response['Body'].read())['probabilities'])"
   ]
  },
  {
   "cell_type": "markdown",
   "id": "eb141c95",
   "metadata": {},
   "source": [
    "### boto3 ã§ã‚¨ãƒ³ãƒ‰ãƒã‚¤ãƒ³ãƒˆä»–ã‚’å‰Šé™¤"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "5b61a459",
   "metadata": {
    "tags": []
   },
   "outputs": [],
   "source": [
    "r = sm_client.delete_endpoint(EndpointName=endpoint_name)\n",
    "r = sm_client.delete_endpoint_config(EndpointConfigName=endpoint_config_name)\n",
    "r = sm_client.delete_model(ModelName=model_name)"
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
   "id": "e1737fcc",
   "metadata": {},
   "outputs": [],
   "source": []
  }
 ],
 "metadata": {
  "instance_type": "ml.t3.medium",
  "kernelspec": {
   "display_name": "Python 3 (Data Science 2.0)",
   "language": "python",
   "name": "python3__SAGEMAKER_INTERNAL__arn:aws:sagemaker:us-east-1:081325390199:image/sagemaker-data-science-38"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.8.13"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 5
}