{ "cells": [ { "cell_type": "markdown", "id": "16f15664-c5c8-4aa9-9ecf-4ad1935825e0", "metadata": { "tags": [] }, "source": [ "# Serverless Data Prep with Glue Interactive Sessions (from `SageMaker Studio`)\n", "\n", "This notebook demonstrates the ability to use Apache Spark using AWS Glue to do data prep with two different datasets in order to build an urban air quality predictor with Amazon SageMaker.\n", "\n", "Data engineers and data scientists can now interactively prepare data at scale using their Studio notebook’s built-in integration with serverless Spark sessions managed by AWS Glue. Starting in seconds and automatically stopping compute when idle, AWS Glue interactive sessions provide an on-demand, highly-scalable, serverless Spark backend to achieve scalable data preparation within Studio. Notable benefits of using AWS Glue interactive sessions on Studio notebooks include:\n", "\n", "- No clusters to provision or manage\n", "- No idle clusters to pay for\n", "- No up-front configuration required\n", "- No resource contention for the same development environment\n", "- The exact same serverless Spark runtime and platform as AWS Glue extract, transform, and load (ETL) jobs\n", "\n", "![Intro](img/img1.png)\n" ] }, { "cell_type": "markdown", "id": "91500df2-118c-402a-a356-059c33b43e48", "metadata": { "tags": [] }, "source": [ "# Introduction" ] }, { "cell_type": "markdown", "id": "30fbffff-9ca1-49ce-a5cb-f4d0d53e31f4", "metadata": {}, "source": [ "While some organizations see data science, data engineering, and data analytics as separate siloed functions, we're increasingly seeing with many of our customers that data prep and analytics are foundational components of ML workflows.\n", "\n", "For example, although organizations have data engineering teams to clean and prepare data for analytics and ML, the specific data that a data scientist may need for training a specific model may not be available in the repository of data that a data engineering team may have prepared. But now they can perform their respective task from the same unified interface using **SageMaker Studio with Glue Intractive Session**\n", "\n", "![Intro](img/img4.png)" ] }, { "cell_type": "markdown", "id": "a5f80972-6fa0-4fc3-bb70-ef8042f2dfd3", "metadata": { "tags": [] }, "source": [ "# Problem Statement" ] }, { "cell_type": "markdown", "id": "04dfd3da-6bde-4a15-9fbf-e411913b0ba1", "metadata": {}, "source": [ "Lets take a problem and try to solve it. As we all know, Air pollution in cities can be an acute problem leading to damaging effects on people, animals, plants and property.\n", "\n", "We need to build a machine learning model which can help to predict the amount of NO2 in the area based on weather conditions\n", "\n", "So, ultimately we would like to have a ML model, wherein we are going to feed the weather details of a particular city at a given time, These details would be, mean temperature, maximum temperature, minimum temperate and so on.\n", "\n", "And the Model should predict the NO2 or nitrogen dioxide concentration levels at that time.\n", "\n", "![Intro](img/img3.png)\n" ] }, { "cell_type": "markdown", "id": "54f05e1a-2bf4-4a80-b7d2-aed06ea8acad", "metadata": { "tags": [] }, "source": [ "# Dataset" ] }, { "cell_type": "markdown", "id": "84732ed9-55b1-4d28-bdc5-c3ebc40b9baf", "metadata": { "tags": [] }, "source": [ "For this demo we would use the following dataset:\n", "\n", "- `First dataset` **NO2 Air Quality** \n", " - [(OpenAQ physical air quality data)](https://registry.opendata.aws/openaq/) : Global, aggregated physical air quality data from public data sources provided by government, research-grade and other sources.\n", " 42GB of Data\n", "\n", "- `Second dataset` **Weather** \n", " - [(NOAA Global Surface Summary of Day)](https://registry.opendata.aws/noaa-gsod/) : Global summary of day data for 18 surface meteorological elements are derived from the synoptic/hourly observations contained in USAF DATSAV3 Surface data and Federal Climate Complex Integrated Surface Hourly (ISH)." ] }, { "cell_type": "markdown", "id": "ec12e7d8-0e72-4fa5-a456-21ee04e6476c", "metadata": { "tags": [] }, "source": [ "# Lets get started\n", "\n", "Check the AWS Glue Interactive Sessions from the [**Glue console**](https://us-east-2.console.aws.amazon.com/glue/home?region=us-east-2#/v2/getting-started)" ] }, { "cell_type": "code", "execution_count": 3, "id": "a15fcb8f-e83e-40bc-8d69-e73d83fcfd13", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Welcome to the Glue Interactive Sessions Kernel\n", "For more information on available magic commands, please type %help in any new cell.\n", "\n", "Please view our Getting Started page to access the most up-to-date information on the Interactive Sessions kernel: https://docs.aws.amazon.com/glue/latest/dg/interactive-sessions.html\n", "It looks like there is a newer version of the kernel available. The latest version is 0.38.1 and you have 0.37.4 installed.\n", "Please run `pip install --upgrade aws-glue-sessions` to upgrade your kernel\n", "There is no current session.\n" ] } ], "source": [ "%stop_session" ] }, { "cell_type": "code", "execution_count": 5, "id": "b85b99a7-d86f-47c5-b2f5-f10e9cd89b50", "metadata": { "tags": [] }, "outputs": [ { "data": { "text/markdown": [ "\n", "# Available Magic Commands\n", "\n", "## Sessions Magic\n", "\n", "----\n", " %help Return a list of descriptions and input types for all magic commands. \n", " %profile String Specify a profile in your aws configuration to use as the credentials provider.\n", " %region String Specify the AWS region in which to initialize a session. \n", " Default from ~/.aws/config on Linux or macOS, \n", " or C:\\Users\\ USERNAME \\.aws\\config\" on Windows.\n", " %idle_timeout Int The number of minutes of inactivity after which a session will timeout. \n", " Default: 2880 minutes (48 hours).\n", " %session_id_prefix String Define a String that will precede all session IDs in the format \n", " [session_id_prefix]-[session_id]. If a session ID is not provided,\n", " a random UUID will be generated.\n", " %status Returns the status of the current Glue session including its duration, \n", " configuration and executing user / role.\n", " %session_id Returns the session ID for the running session. \n", " %list_sessions Lists all currently running sessions by ID.\n", " %stop_session Stops the current session.\n", " %glue_version String The version of Glue to be used by this session. \n", " Currently, the only valid options are 2.0 and 3.0. \n", " Default: 2.0.\n", "----\n", "\n", "## Selecting Job Types\n", "\n", "----\n", " %streaming String Sets the session type to Glue Streaming.\n", " %etl String Sets the session type to Glue ETL.\n", " %glue_ray String Sets the session type to Glue Ray.\n", "----\n", "\n", "## Glue Config Magic \n", "*(common across all job types)*\n", "\n", "----\n", "\n", " %%configure Dictionary A json-formatted dictionary consisting of all configuration parameters for \n", " a session. Each parameter can be specified here or through individual magics.\n", " %iam_role String Specify an IAM role ARN to execute your session with.\n", " Default from ~/.aws/config on Linux or macOS, \n", " or C:\\Users\\%USERNAME%\\.aws\\config` on Windows.\n", " %number_of_workers int The number of workers of a defined worker_type that are allocated \n", " when a session runs.\n", " Default: 5.\n", " %additional_python_modules List Comma separated list of additional Python modules to include in your cluster \n", " (can be from Pypi or S3).\n", "----\n", "\n", " \n", "## Magic for Spark Jobs (ETL & Streaming)\n", "\n", "----\n", " %worker_type String Set the type of instances the session will use as workers. \n", " ETL and Streaming support G.1X, G.2X, G.4X and G.8X. \n", " Default: G.1X.\n", " %connections List Specify a comma separated list of connections to use in the session.\n", " %extra_py_files List Comma separated list of additional Python files From S3.\n", " %extra_jars List Comma separated list of additional Jars to include in the cluster.\n", " %spark_conf String Specify custom spark configurations for your session. \n", " E.g. %spark_conf spark.serializer=org.apache.spark.serializer.KryoSerializer\n", "----\n", " \n", "## Magic for Ray Job\n", "\n", "----\n", " %min_workers Int The minimum number of workers that are allocated to a Ray job. \n", " Default: 1.\n", " %object_memory_head Int The percentage of free memory on the instance head node after a warm start. \n", " Minimum: 0. Maximum: 100.\n", " %object_memory_worker Int The percentage of free memory on the instance worker nodes after a warm start. \n", " Minimum: 0. Maximum: 100.\n", "----\n", "\n", "## Action Magic\n", "\n", "----\n", "\n", " %%sql String Run SQL code. All lines after the initial %%sql magic will be passed\n", " as part of the SQL code. \n", "----\n", "\n" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "%help" ] }, { "cell_type": "code", "execution_count": 11, "id": "c1673d5c-9297-448e-8e31-2a6d42809496", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Setting session ID prefix to my-smgluedemo\n", "Setting Glue version to: 3.0\n", "Previous number of workers: 5\n", "Setting new number of workers to: 50\n", "Current idle_timeout is 2880 minutes.\n", "idle_timeout has been set to 600 minutes.\n", "Additional python modules to be included:\n", "sagemaker\n" ] } ], "source": [ "%session_id_prefix my-smgluedemo\n", "%glue_version 3.0\n", "%number_of_workers 50\n", "%idle_timeout 600\n", "%additional_python_modules sagemaker" ] }, { "cell_type": "code", "execution_count": 1, "id": "e4780304-97ff-459d-ac2b-029b83c0257e", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Authenticating with environment variables and user-defined glue_role_arn: arn:aws:iam::507922848584:role/service-role/SageMaker-myRoleforSMStudio\n", "Trying to create a Glue session for the kernel.\n", "Worker Type: G.1X\n", "Number of Workers: 50\n", "Session ID: my-smgluedemo-31f96e60-2cdb-401b-9d00-52a77c55c6c9\n", "Job Type: glueetl\n", "Applying the following default arguments:\n", "--glue_kernel_version 0.37.4\n", "--enable-glue-datacatalog true\n", "--additional-python-modules sagemaker\n", "Waiting for session my-smgluedemo-31f96e60-2cdb-401b-9d00-52a77c55c6c9 to get into ready status...\n", "Session my-smgluedemo-31f96e60-2cdb-401b-9d00-52a77c55c6c9 has been created.\n", "3.1.1-amzn-0\n" ] } ], "source": [ "print(spark.version)" ] }, { "cell_type": "markdown", "id": "e6bdcc48-8304-4661-a6a9-f6f03f618cd3", "metadata": { "tags": [] }, "source": [ "# Part 1: Data preparation and cleaning using Spark " ] }, { "cell_type": "markdown", "id": "f5b9b01e-1bdc-4537-875f-1dda545827fc", "metadata": {}, "source": [ "## 1.1 Data preparation and cleaning of the `first` dataset (NO2 Air Quality)\n", "\n", "In the cells below, we're going to perform the following operations:\n", "\n", "- Use Spark to read our data from the `OpenAQ` S3 Bucket.\n", "- Filter the available data to `Boston` and `NO2 readings` (indicative of air quality).\n", "- Group the readings by day." ] }, { "cell_type": "code", "execution_count": 2, "id": "b722be6a-f815-4d32-954c-631629e7e104", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "+--------------------+---------------+------------------+--------------------+-------+--------------------+--------+------+---------+-----------+----------+-----+-----+\n", "| attribution|averagingPeriod| city| coordinates|country| date|location|mobile|parameter| sourceName|sourceType| unit|value|\n", "+--------------------+---------------+------------------+--------------------+-------+--------------------+--------+------+---------+-----------+----------+-----+-----+\n", "|[{EEA, http://www...| {hours, 1.0}|Escaldes-Engordany|{42.5096939994651...| AD|{2023-02-21T02:00...| AD0942A| false| pm10|EEA Andorra|government|µg/m³| 18.0|\n", "|[{EEA, http://www...| {hours, 1.0}|Escaldes-Engordany|{42.5096939994651...| AD|{2023-02-21T03:00...| AD0942A| false| pm10|EEA Andorra|government|µg/m³| 20.0|\n", "+--------------------+---------------+------------------+--------------------+-------+--------------------+--------+------+---------+-----------+----------+-----+-----+\n", "only showing top 2 rows\n" ] } ], "source": [ "df = spark.read.json(\"s3://openaq-fetches/realtime-gzipped/2022-01-05/1641409725.ndjson.gz\")\n", "df2 = spark.read.schema(df.schema).json(\"s3://openaq-fetches/realtime-gzipped/202*\")\n", "\n", "df2.show(2, truncate=True)" ] }, { "cell_type": "code", "execution_count": 3, "id": "51fb24ce-0949-40e5-8e70-32fb28649e21", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "+--------------------+---------------+--------------------+--------------------+-------+--------------------+--------------------+------+---------+----------+----------+----+------+----+\n", "| attribution|averagingPeriod| city| coordinates|country| date| location|mobile|parameter|sourceName|sourceType|unit| value|year|\n", "+--------------------+---------------+--------------------+--------------------+-------+--------------------+--------------------+------+---------+----------+----------+----+------+----+\n", "|[{US EPA AirNow, ...| {hours, 1.0}|Boston-Cambridge-...|{42.474701, -70.9...| US|{2023-04-05T17:00...| LYNN| false| no2| AirNow|government| ppm|0.0024|2023|\n", "|[{US EPA AirNow, ...| {hours, 1.0}|Boston-Cambridge-...|{42.2117, -71.114...| US|{2023-04-05T17:00...|E. Milton - Blue Hil| false| no2| AirNow|government| ppm| 0.002|2023|\n", "|[{US EPA AirNow, ...| {hours, 1.0}|Boston-Cambridge-...|{42.3489, -71.097...| US|{2023-04-05T17:00...| BOSTON-KENMORE| false| no2| AirNow|government| ppm| 0.016|2023|\n", "|[{US EPA AirNow, ...| {hours, 1.0}|Boston-Cambridge-...|{42.329399, -71.0...| US|{2023-04-05T17:00...| Boston - Roxbury| false| no2| AirNow|government| ppm|0.0068|2023|\n", "|[{US EPA AirNow, ...| {hours, 1.0}|Boston-Cambridge-...|{42.474701, -70.9...| US|{2023-04-07T16:00...| LYNN| false| no2| AirNow|government| ppm|9.0E-4|2023|\n", "+--------------------+---------------+--------------------+--------------------+-------+--------------------+--------------------+------+---------+----------+----------+----+------+----+\n", "only showing top 5 rows\n" ] } ], "source": [ "import pyspark.sql.functions as F\n", "\n", "'''\n", "Filtering Data ONLY for \n", " City : Boston\n", " Parameter : no2\n", "\n", "Adding a new Column 'YEAR' \n", "'''\n", "dfBos = df2.filter(F.lower((df2.city)) \\\n", " .contains('boston')) \\\n", " .filter(df2.parameter == \"no2\") \\\n", " .withColumn(\"year\", F.substring(df2.date.utc, 1, 4)) \\\n", " .cache()\n", "\n", "dfBos.show(5, truncate=True)" ] }, { "cell_type": "code", "execution_count": 4, "id": "7953d5d5-86e3-4bff-9ebf-ac905b58d49f", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "+----------+--------------------+\n", "| ymd| no2_avg|\n", "+----------+--------------------+\n", "|2023-02-23|0.009197023809523808|\n", "|2020-12-26|0.003088235294117...|\n", "|2022-12-04|0.003643782837127...|\n", "|2021-11-10|0.010848101265822781|\n", "|2021-11-23|0.004562500000000002|\n", "|2022-05-31|0.005007751937984496|\n", "|2021-12-13| 0.01046226415094339|\n", "|2020-07-14|0.005851851851851853|\n", "|2020-04-12|0.004528089887640451|\n", "|2020-01-16|0.014249999999999997|\n", "+----------+--------------------+\n", "only showing top 10 rows\n" ] } ], "source": [ "'''\n", "Aggregating the data day wise by taking the average of `no2` value across each day. \n", "Reducing the no. of data points from ~2.5M to ~ 2K \n", "'''\n", "\n", "dfNoAvg = (dfBos.withColumn(\"ymd\", F.to_date(dfBos.date.utc)) \n", " .groupBy(\"ymd\") \n", " .avg(\"value\") \n", " .withColumnRenamed(\"avg(value)\", \"no2_avg\")\n", " )\n", "\n", "dfNoAvg.show(10, truncate=True)" ] }, { "cell_type": "code", "execution_count": 5, "id": "45eeb4a8-63d8-481d-817c-bf4ec73082ed", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "(2019, 2023)\n" ] } ], "source": [ "import pyspark.sql.functions as F\n", "\n", "year_min = dfNoAvg.select(F.year(\"ymd\").alias(\"year\")).agg({'year':'min'}).first()[0]\n", "year_max = dfNoAvg.select(F.year(\"ymd\").alias(\"year\")).agg({'year':'max'}).first()[0]\n", "\n", "(year_min, year_max)" ] }, { "cell_type": "markdown", "id": "9c82f762-3183-404c-bdbf-737c0897845c", "metadata": { "tags": [] }, "source": [ "## 1.2 Data preparation and cleaning of the `second` dataset (Weather )\n", "\n", "\n", "Now that our first dataset looks good, we used the **year_min** and **year_max** variables to limit the data we want to read from the second dataset.\n" ] }, { "cell_type": "markdown", "id": "8ebedd47-fee9-4375-987b-c2523780a45a", "metadata": {}, "source": [ "### And now the weather dataset" ] }, { "cell_type": "code", "execution_count": 6, "id": "3520ed79-7174-43f8-8c38-f52bfbed0e03", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "\n" ] } ], "source": [ "from pyspark.sql.types import DoubleType\n", "from pyspark.sql import functions as F\n", "\n", "# For reading the schema \n", "dfSchema = spark.read.csv(\"s3://noaa-gsod-pds/2022/32509099999.csv\", header=True, inferSchema=True)\n", "\n", "# Boston, MA, USA\n", "longLeft, latBottom, longRight, latTop = [-71.191162,42.227087,-70.986022,42.397057]\n", "\n", "\n", "# We read our first year, then union the rest of the years :)\n", "def read_year(year):\n", " return spark.read.csv(f\"s3://noaa-gsod-pds/{year}/\", header=True, schema=dfSchema.schema)\n", "\n", "year_range = range(int(year_min), int(year_max)+1)\n", "df = read_year(year_range[0])\n", "for year in year_range[1:]:\n", " df = df.union(read_year(year))\n", "\n", "df = df \\\n", " .withColumn('LATITUDE', df.LATITUDE.cast(DoubleType())) \\\n", " .withColumn('LONGITUDE', df.LONGITUDE.cast(DoubleType()))\n", "\n", "bostondf = df \\\n", " .filter(df.LATITUDE >= latBottom) \\\n", " .filter(df.LATITUDE <= latTop) \\\n", " .filter(df.LONGITUDE >= longLeft) \\\n", " .filter(df.LONGITUDE <= longRight)\n", "\n", "# Rename columns so they're easier to read\n", "bostonfeatures = bostondf.selectExpr(\"Date as date\", \"MAX as temp_max\", \"MIN as temp_min\", \"WDSP as wind_avg\", \"SLP as pressure_sea_level\", \"STP as pressure_station\")\n", "\n", "# Remove invalid readings\n", "no_data_mappings = [\n", " [\"temp_max\", 9999.9],\n", " [\"temp_min\", 9999.9],\n", " [\"wind_avg\", 999.9],\n", " [\"pressure_sea_level\", 9999.9],\n", " [\"pressure_station\", 9999.9]\n", "]\n", "\n", "for [name, val] in no_data_mappings:\n", " bostonfeatures = bostonfeatures \\\n", " .withColumn(name, F.when(F.col(name)==val, None) \\\n", " .otherwise(F.col(name)))\n", " \n", "# Now average each reading per day\n", "bostonfeatures = bostonfeatures \\\n", " .groupBy(\"date\") \\\n", " .agg(*[F.mean(c).alias(c) for c in bostonfeatures.columns[1:]]) \\\n", " .cache()\n", "\n", "bostonfeatures = bostonfeatures.withColumn(\"date\", F.to_date(F.col(\"date\"), \"yyyy-MM-dd\"))" ] }, { "cell_type": "markdown", "id": "04b2e17b-696a-4c1d-b294-b263b27dcea2", "metadata": {}, "source": [ "## 1.3. Marry the `two processed data` and make it ready for the `ML training`\n", "\n", "Now that we've taken a quick look at our data and done some initial exploration for both the dataset, let's merge the two datasets." ] }, { "cell_type": "code", "execution_count": 7, "id": "969544e0-5a15-4e2e-bb24-e93814dae7a8", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "+----------+--------------------+\n", "| ymd| no2_avg|\n", "+----------+--------------------+\n", "|2021-11-10|0.010848101265822783|\n", "|2023-02-23| 0.00919702380952381|\n", "|2022-12-04|0.003643782837127...|\n", "|2022-05-31|0.005007751937984495|\n", "|2021-12-13|0.010462264150943394|\n", "+----------+--------------------+\n", "only showing top 5 rows\n" ] } ], "source": [ "# First Dataset - NO2 Dataset\n", "dfNoAvg.show(5)" ] }, { "cell_type": "code", "execution_count": 8, "id": "e22f9965-4e52-415f-a4fd-c2c5afaad7e0", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "+----------+------------------+------------------+--------+------------------+----------------+\n", "| date| temp_max| temp_min|wind_avg|pressure_sea_level|pressure_station|\n", "+----------+------------------+------------------+--------+------------------+----------------+\n", "|2022-03-18| 68.1|42.349999999999994| 4.7| 1013.3| 506.15|\n", "|2023-04-18| 57.85| 48.8| 10.3| 1003.55| 501.3|\n", "|2022-06-02| 63.0| 54.0| 7.1| 1009.2| 8.2|\n", "|2020-01-07| 41.35| 30.6| 4.3| 1015.0| 507.25|\n", "|2020-04-13|61.150000000000006| 43.55| 8.85| 1013.05| 505.95|\n", "+----------+------------------+------------------+--------+------------------+----------------+\n", "only showing top 5 rows\n" ] } ], "source": [ "# Second Dataset - Weather Dataset \n", "bostonfeatures.show(5)" ] }, { "cell_type": "code", "execution_count": 9, "id": "9c7c5af5-3e55-4c78-a19c-d9eb07622521", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "+--------------------+----------+------------------+--------+--------+------------------+------------------+\n", "| no2_avg| date| temp_max|temp_min|wind_avg|pressure_sea_level| pressure_station|\n", "+--------------------+----------+------------------+--------+--------+------------------+------------------+\n", "| 0.011|2019-12-31|43.150000000000006| 36.2| 7.3|1004.8499999999999| 502.45|\n", "|0.005319148936170215|2020-01-01| 42.55| 36.0| 6.05| 1001.2| 500.3|\n", "|0.009977011494252877|2020-01-02| 47.65| 34.15| 5.85|1009.6500000000001| 504.55|\n", "| 0.01679166666666667|2020-01-03| 50.8| 38.95| 3.9|1007.8499999999999| 503.7|\n", "| 0.01897872340425532|2020-01-04| 50.35| 42.8| 2.0|1005.5999999999999| 502.5|\n", "|0.003416666666666...|2020-01-05| 50.1| 35.0| 7.1| 1001.45| 500.25|\n", "| 0.01388541666666667|2020-01-06| 38.65| 27.6| 2.85|1012.1500000000001| 505.8|\n", "|0.015556962025316456|2020-01-07| 41.35| 30.6| 4.3| 1015.0| 507.25|\n", "| 0.011425|2020-01-08| 44.25| 33.35| 4.65| 1009.45| 504.45|\n", "|0.005458333333333335|2020-01-09| 40.4| 22.1| 7.45|1029.8000000000002| 514.6|\n", "|0.016882978723404257|2020-01-10| 51.0| 24.9| 5.3| 1034.6| 517.05|\n", "|0.006465909090909092|2020-01-11| 67.85| 49.25| 7.25| 1023.6|511.59999999999997|\n", "|0.001406250000000...|2020-01-12| 71.95| 51.25| 9.85| 1012.05|505.59999999999997|\n", "| 0.00792553191489362|2020-01-13| 50.75| 35.95| 4.5| 1030.1| 514.7|\n", "|0.012370370370370374|2020-01-14| 41.35| 35.55| 2.5| 1026.65| 513.05|\n", "| 0.01078125|2020-01-15| 50.1| 38.2| 3.6| 1018.75|509.09999999999997|\n", "|0.014249999999999999|2020-01-16| 48.75| 36.85| 5.1| 1011.65| 505.4|\n", "|0.004300000000000002|2020-01-17| 41.05| 16.65| 9.0| 1025.5| 512.5|\n", "|0.009635416666666669|2020-01-18| 26.15| 13.65| 4.45| 1032.55| 516.0|\n", "|0.014961538461538462|2020-01-19| 44.0| 17.35| 5.0| 1005.75| 502.7|\n", "+--------------------+----------+------------------+--------+--------+------------------+------------------+\n", "only showing top 20 rows\n" ] } ], "source": [ "# Merging the two dataset \n", "merged_df = dfNoAvg.join(bostonfeatures, dfNoAvg['ymd'] == bostonfeatures['date'], 'inner') \\\n", " .drop('ymd').orderBy('date', ascending=True)\n", "merged_df.show()" ] }, { "cell_type": "markdown", "id": "073effb6-de26-4e14-8cac-fd6c471997bc", "metadata": {}, "source": [ "## 1.4. Save the data to S3\n" ] }, { "cell_type": "code", "execution_count": 10, "id": "db3e64bf-5413-419d-bf29-072b2a73d960", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "\n" ] } ], "source": [ "bucket = 'fcc-spark-example'\n", "merged_df.coalesce(1).write.parquet(f\"s3://{bucket}/subset-boston-data.parquet\")" ] }, { "cell_type": "markdown", "id": "2638a7ad-46b1-4c61-bb67-e9cb4bac4c0d", "metadata": { "tags": [] }, "source": [ "
\n", "

End of Data Processing and Cleaning

\n", "
\n" ] }, { "cell_type": "markdown", "id": "f68808b0-6ac7-4ee4-8800-c55a600b5abb", "metadata": {}, "source": [ "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n", "\n" ] }, { "cell_type": "markdown", "id": "63cfb07c-96d6-4fe3-81c7-8d7b75e3854f", "metadata": {}, "source": [ "![Intro](img/img5.png)" ] }, { "cell_type": "code", "execution_count": null, "id": "16c6de56-08f2-4ec8-8dd5-bd1209de6cd5", "metadata": {}, "outputs": [], "source": [] }, { "cell_type": "code", "execution_count": null, "id": "a56b61b5-b346-4a3f-ada6-dd1471bd13af", "metadata": {}, "outputs": [], "source": [] }, { "cell_type": "code", "execution_count": null, "id": "fb0ebcec-a186-481f-8812-3301b9b87028", "metadata": {}, "outputs": [], "source": [] }, { "cell_type": "code", "execution_count": null, "id": "f53f705b-b29d-4083-aa9f-42887132b1d2", "metadata": {}, "outputs": [], "source": [] }, { "cell_type": "markdown", "id": "75fc56de-f230-4aa4-a802-6df3b05cb08a", "metadata": {}, "source": [] }, { "cell_type": "markdown", "id": "cf9c268e-fc0e-4b59-962a-8dccfd12e48b", "metadata": { "tags": [] }, "source": [ "# Part 2: Model Building, Training and Deployment" ] }, { "cell_type": "markdown", "id": "278daa2c-c55e-49b4-b847-56583a6cb584", "metadata": {}, "source": [ "## Spliting the data for training and testing " ] }, { "cell_type": "code", "execution_count": 11, "id": "8a396437-dc12-4063-a31d-81c480229ade", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "(1071, 5) (1071, 1)\n", "(170, 5) (170, 1)\n", " temp_max temp_min wind_avg pressure_sea_level pressure_station\n", "2020-01-01 42.55 36.00 6.05 1001.20 500.30\n", "2020-01-02 47.65 34.15 5.85 1009.65 504.55\n", "2020-01-03 50.80 38.95 3.90 1007.85 503.70\n", "2020-01-04 50.35 42.80 2.00 1005.60 502.50\n", "2020-01-05 50.10 35.00 7.10 1001.45 500.25\n" ] } ], "source": [ "import pandas as pd\n", "import numpy as np\n", "import matplotlib.pyplot as plt\n", "\n", "import warnings\n", "warnings.simplefilter(action='ignore', category=FutureWarning)\n", "\n", "# Loading the processed data \n", "comp_pd_df = merged_df.toPandas()\n", "comp_pd_df = comp_pd_df.set_index('date')\n", "comp_pd_df.index.name = None\n", "\n", "# Drop the 1st row \n", "aq_df = comp_pd_df.iloc[1:].copy()\n", "aq_df.index = pd.to_datetime(aq_df.index)\n", "\n", "# Dropping the NaN \n", "aq_df = aq_df.dropna()\n", "\n", "# Use the data from years 2020 up to 2022 as training, and the year 2023 as our candidate year for testing and validating our model.\n", "aq_train_df = aq_df[aq_df.index.year < 2023]\n", "aq_test_df = aq_df[aq_df.index.year == 2023]\n", "\n", "# Input Feature \n", "x_train = aq_train_df.drop('no2_avg',axis=1)\n", "x_test = aq_test_df.drop('no2_avg',axis=1)\n", "\n", "# Class Labels \n", "y_train = aq_train_df[[\"no2_avg\"]]\n", "y_test = aq_test_df[[\"no2_avg\"]]\n", "\n", "print(x_train.shape, y_train.shape)\n", "print(x_test.shape, y_test.shape)\n", "print(x_train.head())" ] }, { "cell_type": "markdown", "id": "bf6e53ac-db10-44be-b0a3-f673589c90f6", "metadata": {}, "source": [ "## Few helper functions" ] }, { "cell_type": "code", "execution_count": 12, "id": "4a9949cd-96c8-4cf7-ba70-e28785b3035b", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "\n" ] } ], "source": [ "from math import sqrt\n", "from sklearn.metrics import mean_squared_error, r2_score, explained_variance_score\n", "\n", "# sMAPE is used in KDD Air Quality challenge: https://biendata.com/competition/kdd_2018/evaluation/ \n", "def smape(actual, predicted):\n", " dividend= np.abs(np.array(actual) - np.array(predicted))\n", " denominator = np.array(actual) + np.array(predicted)\n", " \n", " return 2 * np.mean(np.divide(dividend, denominator, out=np.zeros_like(dividend), where=denominator!=0, casting='unsafe'))\n", "\n", "def print_metrics(y_test, y_pred):\n", " print(\"RMSE: %.4f\" % sqrt(mean_squared_error(y_test, y_pred)))\n", " print('Variance score: %.4f' % r2_score(y_test, y_pred))\n", " print('Explained variance score: %.4f' % explained_variance_score(y_test, y_pred))\n", " forecast_err = np.array(y_test) - np.array(y_pred)\n", " print('Forecast bias: %.4f' % (np.sum(forecast_err) * 1.0/len(y_pred) ))\n", " print('sMAPE: %.4f' % smape(y_test, y_pred))" ] }, { "cell_type": "markdown", "id": "4d314108-3bea-4663-823d-73123139cd4e", "metadata": {}, "source": [ "## Training the model \n", "\n", "In the section below, we create a new training job using the Linear Learner algorithm. Once that job completes, we deploy an endpoint and run some validation tests against it. \n", "\n", "**NOTE**: You only need to create this training job and deploy it once. You can use the same endpoint, even in future runs of this notebook, without re-training or re-deploying. " ] }, { "cell_type": "code", "execution_count": 13, "id": "31d24ff9-169f-4330-a5e4-f917b16e35a0", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "2023-06-20 23:10:41 Starting - Starting the training job...\n", "2023-06-20 23:10:56 Starting - Preparing the instances for training......\n", "2023-06-20 23:11:49 Downloading - Downloading input data...\n", "2023-06-20 23:12:15 Training - Downloading the training image...\n", "2023-06-20 23:13:10 Training - Training image download completed. Training in progress...Docker entrypoint called with argument(s): train\n", "Running default environment configuration script\n", "[06/20/2023 23:13:18 INFO 140324763719488] Reading default configuration from /opt/amazon/lib/python3.7/site-packages/algorithm/resources/default-input.json: {'mini_batch_size': '1000', 'epochs': '15', 'feature_dim': 'auto', 'use_bias': 'true', 'binary_classifier_model_selection_criteria': 'accuracy', 'f_beta': '1.0', 'target_recall': '0.8', 'target_precision': '0.8', 'num_models': 'auto', 'num_calibration_samples': '10000000', 'init_method': 'uniform', 'init_scale': '0.07', 'init_sigma': '0.01', 'init_bias': '0.0', 'optimizer': 'auto', 'loss': 'auto', 'margin': '1.0', 'quantile': '0.5', 'loss_insensitivity': '0.01', 'huber_delta': '1.0', 'num_classes': '1', 'accuracy_top_k': '3', 'wd': 'auto', 'l1': 'auto', 'momentum': 'auto', 'learning_rate': 'auto', 'beta_1': 'auto', 'beta_2': 'auto', 'bias_lr_mult': 'auto', 'bias_wd_mult': 'auto', 'use_lr_scheduler': 'true', 'lr_scheduler_step': 'auto', 'lr_scheduler_factor': 'auto', 'lr_scheduler_minimum_lr': 'auto', 'positive_example_weight_mult': '1.0', 'balance_multiclass_weights': 'false', 'normalize_data': 'true', 'normalize_label': 'auto', 'unbias_data': 'auto', 'unbias_label': 'auto', 'num_point_for_scaler': '10000', '_kvstore': 'auto', '_num_gpus': 'auto', '_num_kv_servers': 'auto', '_log_level': 'info', '_tuning_objective_metric': '', 'early_stopping_patience': '3', 'early_stopping_tolerance': '0.001', '_enable_profiler': 'false'}\n", "[06/20/2023 23:13:18 INFO 140324763719488] Merging with provided configuration from /opt/ml/input/config/hyperparameters.json: {'feature_dim': '5', 'mini_batch_size': '1000', 'normalize_data': 'True', 'normalize_label': 'True', 'predictor_type': 'regressor'}\n", "[06/20/2023 23:13:18 INFO 140324763719488] Final configuration: {'mini_batch_size': '1000', 'epochs': '15', 'feature_dim': '5', 'use_bias': 'true', 'binary_classifier_model_selection_criteria': 'accuracy', 'f_beta': '1.0', 'target_recall': '0.8', 'target_precision': '0.8', 'num_models': 'auto', 'num_calibration_samples': '10000000', 'init_method': 'uniform', 'init_scale': '0.07', 'init_sigma': '0.01', 'init_bias': '0.0', 'optimizer': 'auto', 'loss': 'auto', 'margin': '1.0', 'quantile': '0.5', 'loss_insensitivity': '0.01', 'huber_delta': '1.0', 'num_classes': '1', 'accuracy_top_k': '3', 'wd': 'auto', 'l1': 'auto', 'momentum': 'auto', 'learning_rate': 'auto', 'beta_1': 'auto', 'beta_2': 'auto', 'bias_lr_mult': 'auto', 'bias_wd_mult': 'auto', 'use_lr_scheduler': 'true', 'lr_scheduler_step': 'auto', 'lr_scheduler_factor': 'auto', 'lr_scheduler_minimum_lr': 'auto', 'positive_example_weight_mult': '1.0', 'balance_multiclass_weights': 'false', 'normalize_data': 'True', 'normalize_label': 'True', 'unbias_data': 'auto', 'unbias_label': 'auto', 'num_point_for_scaler': '10000', '_kvstore': 'auto', '_num_gpus': 'auto', '_num_kv_servers': 'auto', '_log_level': 'info', '_tuning_objective_metric': '', 'early_stopping_patience': '3', 'early_stopping_tolerance': '0.001', '_enable_profiler': 'false', 'predictor_type': 'regressor'}\n", "[06/20/2023 23:13:20 WARNING 140324763719488] Loggers have already been setup.\n", "[06/20/2023 23:13:20 INFO 140324763719488] Final configuration: {'mini_batch_size': '1000', 'epochs': '15', 'feature_dim': '5', 'use_bias': 'true', 'binary_classifier_model_selection_criteria': 'accuracy', 'f_beta': '1.0', 'target_recall': '0.8', 'target_precision': '0.8', 'num_models': 'auto', 'num_calibration_samples': '10000000', 'init_method': 'uniform', 'init_scale': '0.07', 'init_sigma': '0.01', 'init_bias': '0.0', 'optimizer': 'auto', 'loss': 'auto', 'margin': '1.0', 'quantile': '0.5', 'loss_insensitivity': '0.01', 'huber_delta': '1.0', 'num_classes': '1', 'accuracy_top_k': '3', 'wd': 'auto', 'l1': 'auto', 'momentum': 'auto', 'learning_rate': 'auto', 'beta_1': 'auto', 'beta_2': 'auto', 'bias_lr_mult': 'auto', 'bias_wd_mult': 'auto', 'use_lr_scheduler': 'true', 'lr_scheduler_step': 'auto', 'lr_scheduler_factor': 'auto', 'lr_scheduler_minimum_lr': 'auto', 'positive_example_weight_mult': '1.0', 'balance_multiclass_weights': 'false', 'normalize_data': 'True', 'normalize_label': 'True', 'unbias_data': 'auto', 'unbias_label': 'auto', 'num_point_for_scaler': '10000', '_kvstore': 'auto', '_num_gpus': 'auto', '_num_kv_servers': 'auto', '_log_level': 'info', '_tuning_objective_metric': '', 'early_stopping_patience': '3', 'early_stopping_tolerance': '0.001', '_enable_profiler': 'false', 'predictor_type': 'regressor'}\n", "[06/20/2023 23:13:20 WARNING 140324763719488] Loggers have already been setup.\n", "Process 7 is a worker.\n", "[06/20/2023 23:13:20 INFO 140324763719488] Using default worker.\n", "[06/20/2023 23:13:20 INFO 140324763719488] Checkpoint loading and saving are disabled.\n", "[2023-06-20 23:13:20.912] [tensorio] [warning] TensorIO is already initialized; ignoring the initialization routine.\n", "[2023-06-20 23:13:20.924] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 0, \"duration\": 13, \"num_examples\": 1, \"num_bytes\": 64000}\n", "[06/20/2023 23:13:20 INFO 140324763719488] Create Store: local\n", "[2023-06-20 23:13:20.936] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 1, \"duration\": 11, \"num_examples\": 2, \"num_bytes\": 68544}\n", "[06/20/2023 23:13:20 INFO 140324763719488] Scaler algorithm parameters\n", " \n", "[06/20/2023 23:13:20 INFO 140324763719488] Scaling model computed with parameters:\n", " {'stdev_label': \n", "[0.00439914]\n", ", 'stdev_weight': \n", "[ 17.317747 16.486868 2.9431407 7.611876 107.63156 ]\n", ", 'mean_label': \n", "[0.00713437]\n", ", 'mean_weight': \n", "[ 63.2533 47.22165 5.8689 1015.7924 503.70746]\n", "}\n", "[06/20/2023 23:13:20 INFO 140324763719488] nvidia-smi: took 0.031 seconds to run.\n", "[06/20/2023 23:13:20 INFO 140324763719488] nvidia-smi identified 0 GPUs.\n", "[06/20/2023 23:13:20 INFO 140324763719488] Number of GPUs being used: 0\n", "#metrics {\"StartTime\": 1687302801.018846, \"EndTime\": 1687302801.0188751, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"Meta\": \"init_train_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 2071.0, \"count\": 1, \"min\": 2071, \"max\": 2071}, \"Total Batches Seen\": {\"sum\": 3.0, \"count\": 1, \"min\": 3, \"max\": 3}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Number of Records Since Last Reset\": {\"sum\": 0.0, \"count\": 1, \"min\": 0, \"max\": 0}, \"Number of Batches Since Last Reset\": {\"sum\": 0.0, \"count\": 1, \"min\": 0, \"max\": 0}}}\n", "[2023-06-20 23:13:21.057] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 4, \"duration\": 38, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.057702, \"EndTime\": 1687302801.0577638, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.02216162109375, \"count\": 1, \"min\": 1.02216162109375, \"max\": 1.02216162109375}}}\n", "#metrics {\"StartTime\": 1687302801.0578303, \"EndTime\": 1687302801.0578413, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0017120361328125, \"count\": 1, \"min\": 1.0017120361328125, \"max\": 1.0017120361328125}}}\n", "#metrics {\"StartTime\": 1687302801.0578754, \"EndTime\": 1687302801.0578835, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0297730712890625, \"count\": 1, \"min\": 1.0297730712890625, \"max\": 1.0297730712890625}}}\n", "#metrics {\"StartTime\": 1687302801.0579112, \"EndTime\": 1687302801.0579183, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0047203369140625, \"count\": 1, \"min\": 1.0047203369140625, \"max\": 1.0047203369140625}}}\n", "#metrics {\"StartTime\": 1687302801.0579443, \"EndTime\": 1687302801.0579495, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0763988037109375, \"count\": 1, \"min\": 1.0763988037109375, \"max\": 1.0763988037109375}}}\n", "#metrics {\"StartTime\": 1687302801.0579677, \"EndTime\": 1687302801.057974, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.134108154296875, \"count\": 1, \"min\": 1.134108154296875, \"max\": 1.134108154296875}}}\n", "#metrics {\"StartTime\": 1687302801.0580008, \"EndTime\": 1687302801.0580075, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1350196533203125, \"count\": 1, \"min\": 1.1350196533203125, \"max\": 1.1350196533203125}}}\n", "#metrics {\"StartTime\": 1687302801.0580392, \"EndTime\": 1687302801.0580475, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9340640258789062, \"count\": 1, \"min\": 0.9340640258789062, \"max\": 0.9340640258789062}}}\n", "#metrics {\"StartTime\": 1687302801.0580733, \"EndTime\": 1687302801.058082, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8850604248046875, \"count\": 1, \"min\": 0.8850604248046875, \"max\": 0.8850604248046875}}}\n", "#metrics {\"StartTime\": 1687302801.0581121, \"EndTime\": 1687302801.0581203, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9903997802734374, \"count\": 1, \"min\": 0.9903997802734374, \"max\": 0.9903997802734374}}}\n", "#metrics {\"StartTime\": 1687302801.058148, \"EndTime\": 1687302801.0581555, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.104272216796875, \"count\": 1, \"min\": 1.104272216796875, \"max\": 1.104272216796875}}}\n", "#metrics {\"StartTime\": 1687302801.0581877, \"EndTime\": 1687302801.0581963, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0242237548828126, \"count\": 1, \"min\": 1.0242237548828126, \"max\": 1.0242237548828126}}}\n", "#metrics {\"StartTime\": 1687302801.0582228, \"EndTime\": 1687302801.05823, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.98870703125, \"count\": 1, \"min\": 0.98870703125, \"max\": 0.98870703125}}}\n", "#metrics {\"StartTime\": 1687302801.0582576, \"EndTime\": 1687302801.0582647, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0908687744140626, \"count\": 1, \"min\": 1.0908687744140626, \"max\": 1.0908687744140626}}}\n", "#metrics {\"StartTime\": 1687302801.058294, \"EndTime\": 1687302801.0583017, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1164923095703125, \"count\": 1, \"min\": 1.1164923095703125, \"max\": 1.1164923095703125}}}\n", "#metrics {\"StartTime\": 1687302801.0583332, \"EndTime\": 1687302801.058342, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9398934326171875, \"count\": 1, \"min\": 0.9398934326171875, \"max\": 0.9398934326171875}}}\n", "#metrics {\"StartTime\": 1687302801.0583663, \"EndTime\": 1687302801.058374, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0530172119140624, \"count\": 1, \"min\": 1.0530172119140624, \"max\": 1.0530172119140624}}}\n", "#metrics {\"StartTime\": 1687302801.0584068, \"EndTime\": 1687302801.058416, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.07394384765625, \"count\": 1, \"min\": 1.07394384765625, \"max\": 1.07394384765625}}}\n", "#metrics {\"StartTime\": 1687302801.0584397, \"EndTime\": 1687302801.0584471, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.099530517578125, \"count\": 1, \"min\": 1.099530517578125, \"max\": 1.099530517578125}}}\n", "#metrics {\"StartTime\": 1687302801.058472, \"EndTime\": 1687302801.0584798, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0041085205078124, \"count\": 1, \"min\": 1.0041085205078124, \"max\": 1.0041085205078124}}}\n", "#metrics {\"StartTime\": 1687302801.0585144, \"EndTime\": 1687302801.0585217, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9506248779296875, \"count\": 1, \"min\": 0.9506248779296875, \"max\": 0.9506248779296875}}}\n", "#metrics {\"StartTime\": 1687302801.0585446, \"EndTime\": 1687302801.0585513, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9690664672851562, \"count\": 1, \"min\": 0.9690664672851562, \"max\": 0.9690664672851562}}}\n", "#metrics {\"StartTime\": 1687302801.0585792, \"EndTime\": 1687302801.0585864, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0072550659179687, \"count\": 1, \"min\": 1.0072550659179687, \"max\": 1.0072550659179687}}}\n", "#metrics {\"StartTime\": 1687302801.0586145, \"EndTime\": 1687302801.0586228, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0789600830078125, \"count\": 1, \"min\": 1.0789600830078125, \"max\": 1.0789600830078125}}}\n", "#metrics {\"StartTime\": 1687302801.0586529, \"EndTime\": 1687302801.0586607, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9693126220703125, \"count\": 1, \"min\": 0.9693126220703125, \"max\": 0.9693126220703125}}}\n", "#metrics {\"StartTime\": 1687302801.0586863, \"EndTime\": 1687302801.058694, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9535215454101562, \"count\": 1, \"min\": 0.9535215454101562, \"max\": 0.9535215454101562}}}\n", "#metrics {\"StartTime\": 1687302801.058723, \"EndTime\": 1687302801.058731, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9526671752929687, \"count\": 1, \"min\": 0.9526671752929687, \"max\": 0.9526671752929687}}}\n", "#metrics {\"StartTime\": 1687302801.0587606, \"EndTime\": 1687302801.0587687, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0269730224609375, \"count\": 1, \"min\": 1.0269730224609375, \"max\": 1.0269730224609375}}}\n", "#metrics {\"StartTime\": 1687302801.0588012, \"EndTime\": 1687302801.0588088, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9508773193359376, \"count\": 1, \"min\": 0.9508773193359376, \"max\": 0.9508773193359376}}}\n", "#metrics {\"StartTime\": 1687302801.0588334, \"EndTime\": 1687302801.0588408, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.050606689453125, \"count\": 1, \"min\": 1.050606689453125, \"max\": 1.050606689453125}}}\n", "#metrics {\"StartTime\": 1687302801.0588696, \"EndTime\": 1687302801.058877, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0049104614257813, \"count\": 1, \"min\": 1.0049104614257813, \"max\": 1.0049104614257813}}}\n", "#metrics {\"StartTime\": 1687302801.05891, \"EndTime\": 1687302801.0589185, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8938972778320312, \"count\": 1, \"min\": 0.8938972778320312, \"max\": 0.8938972778320312}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=0, train mse_objective =1.02216162109375\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=0, criteria=mse_objective, value=0.8850604248046875\n", "[06/20/2023 23:13:21 INFO 140324763719488] Epoch 0: Loss improved. Updating best model\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 0\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpianesp2x/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 6.666666666666667 % of epochs\n", "#metrics {\"StartTime\": 1687302801.019128, \"EndTime\": 1687302801.0663705, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 0, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 3142.0, \"count\": 1, \"min\": 3142, \"max\": 3142}, \"Total Batches Seen\": {\"sum\": 5.0, \"count\": 1, \"min\": 5, \"max\": 5}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 3.0, \"count\": 1, \"min\": 3, \"max\": 3}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=22618.829728096676 records/second\n", "[2023-06-20 23:13:21.078] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 6, \"duration\": 11, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.0784597, \"EndTime\": 1687302801.0785086, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.006066162109375, \"count\": 1, \"min\": 1.006066162109375, \"max\": 1.006066162109375}}}\n", "#metrics {\"StartTime\": 1687302801.078565, \"EndTime\": 1687302801.0785763, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.985068603515625, \"count\": 1, \"min\": 0.985068603515625, \"max\": 0.985068603515625}}}\n", "#metrics {\"StartTime\": 1687302801.0786102, \"EndTime\": 1687302801.0786183, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0134525756835937, \"count\": 1, \"min\": 1.0134525756835937, \"max\": 1.0134525756835937}}}\n", "#metrics {\"StartTime\": 1687302801.0786555, \"EndTime\": 1687302801.0786638, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9882579956054688, \"count\": 1, \"min\": 0.9882579956054688, \"max\": 0.9882579956054688}}}\n", "#metrics {\"StartTime\": 1687302801.0786986, \"EndTime\": 1687302801.0787065, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9089137573242188, \"count\": 1, \"min\": 0.9089137573242188, \"max\": 0.9089137573242188}}}\n", "#metrics {\"StartTime\": 1687302801.0787408, \"EndTime\": 1687302801.0787497, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8862894897460938, \"count\": 1, \"min\": 0.8862894897460938, \"max\": 0.8862894897460938}}}\n", "#metrics {\"StartTime\": 1687302801.0787833, \"EndTime\": 1687302801.0787916, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8347337646484375, \"count\": 1, \"min\": 0.8347337646484375, \"max\": 0.8347337646484375}}}\n", "#metrics {\"StartTime\": 1687302801.0788262, \"EndTime\": 1687302801.078835, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.807532470703125, \"count\": 1, \"min\": 0.807532470703125, \"max\": 0.807532470703125}}}\n", "#metrics {\"StartTime\": 1687302801.078868, \"EndTime\": 1687302801.078876, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8729747314453125, \"count\": 1, \"min\": 0.8729747314453125, \"max\": 0.8729747314453125}}}\n", "#metrics {\"StartTime\": 1687302801.0789084, \"EndTime\": 1687302801.0789165, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9741072998046875, \"count\": 1, \"min\": 0.9741072998046875, \"max\": 0.9741072998046875}}}\n", "#metrics {\"StartTime\": 1687302801.0789502, \"EndTime\": 1687302801.0789585, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0858118896484374, \"count\": 1, \"min\": 1.0858118896484374, \"max\": 1.0858118896484374}}}\n", "#metrics {\"StartTime\": 1687302801.078993, \"EndTime\": 1687302801.079002, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0068366088867187, \"count\": 1, \"min\": 1.0068366088867187, \"max\": 1.0068366088867187}}}\n", "#metrics {\"StartTime\": 1687302801.0790343, \"EndTime\": 1687302801.079043, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7468883056640625, \"count\": 1, \"min\": 0.7468883056640625, \"max\": 0.7468883056640625}}}\n", "#metrics {\"StartTime\": 1687302801.0790746, \"EndTime\": 1687302801.0790827, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8101138916015626, \"count\": 1, \"min\": 0.8101138916015626, \"max\": 0.8101138916015626}}}\n", "#metrics {\"StartTime\": 1687302801.0791144, \"EndTime\": 1687302801.0791228, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9052518310546875, \"count\": 1, \"min\": 0.9052518310546875, \"max\": 0.9052518310546875}}}\n", "#metrics {\"StartTime\": 1687302801.0791519, \"EndTime\": 1687302801.0791593, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7107618408203125, \"count\": 1, \"min\": 0.7107618408203125, \"max\": 0.7107618408203125}}}\n", "#metrics {\"StartTime\": 1687302801.0791898, \"EndTime\": 1687302801.0791974, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0356744384765626, \"count\": 1, \"min\": 1.0356744384765626, \"max\": 1.0356744384765626}}}\n", "#metrics {\"StartTime\": 1687302801.0792289, \"EndTime\": 1687302801.0792375, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0568455810546875, \"count\": 1, \"min\": 1.0568455810546875, \"max\": 1.0568455810546875}}}\n", "#metrics {\"StartTime\": 1687302801.079269, \"EndTime\": 1687302801.0792768, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0808736572265625, \"count\": 1, \"min\": 1.0808736572265625, \"max\": 1.0808736572265625}}}\n", "#metrics {\"StartTime\": 1687302801.079311, \"EndTime\": 1687302801.0793192, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9871458129882813, \"count\": 1, \"min\": 0.9871458129882813, \"max\": 0.9871458129882813}}}\n", "#metrics {\"StartTime\": 1687302801.0793529, \"EndTime\": 1687302801.079361, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.816015869140625, \"count\": 1, \"min\": 0.816015869140625, \"max\": 0.816015869140625}}}\n", "#metrics {\"StartTime\": 1687302801.079394, \"EndTime\": 1687302801.0794024, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8192308959960938, \"count\": 1, \"min\": 0.8192308959960938, \"max\": 0.8192308959960938}}}\n", "#metrics {\"StartTime\": 1687302801.0794353, \"EndTime\": 1687302801.0794432, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7830580444335937, \"count\": 1, \"min\": 0.7830580444335937, \"max\": 0.7830580444335937}}}\n", "#metrics {\"StartTime\": 1687302801.0794754, \"EndTime\": 1687302801.0794837, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8611998901367187, \"count\": 1, \"min\": 0.8611998901367187, \"max\": 0.8611998901367187}}}\n", "#metrics {\"StartTime\": 1687302801.0795126, \"EndTime\": 1687302801.0795207, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9743131103515625, \"count\": 1, \"min\": 0.9743131103515625, \"max\": 0.9743131103515625}}}\n", "#metrics {\"StartTime\": 1687302801.079551, \"EndTime\": 1687302801.0795588, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9576375122070313, \"count\": 1, \"min\": 0.9576375122070313, \"max\": 0.9576375122070313}}}\n", "#metrics {\"StartTime\": 1687302801.079593, \"EndTime\": 1687302801.0796018, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9543069458007812, \"count\": 1, \"min\": 0.9543069458007812, \"max\": 0.9543069458007812}}}\n", "#metrics {\"StartTime\": 1687302801.0796366, \"EndTime\": 1687302801.0796447, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0207877197265625, \"count\": 1, \"min\": 1.0207877197265625, \"max\": 1.0207877197265625}}}\n", "#metrics {\"StartTime\": 1687302801.0796764, \"EndTime\": 1687302801.079684, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1274150390625, \"count\": 1, \"min\": 1.1274150390625, \"max\": 1.1274150390625}}}\n", "#metrics {\"StartTime\": 1687302801.0797172, \"EndTime\": 1687302801.079725, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9528507690429687, \"count\": 1, \"min\": 0.9528507690429687, \"max\": 0.9528507690429687}}}\n", "#metrics {\"StartTime\": 1687302801.0797596, \"EndTime\": 1687302801.0797675, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.112237060546875, \"count\": 1, \"min\": 1.112237060546875, \"max\": 1.112237060546875}}}\n", "#metrics {\"StartTime\": 1687302801.0798094, \"EndTime\": 1687302801.0798178, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.26973046875, \"count\": 1, \"min\": 1.26973046875, \"max\": 1.26973046875}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=1, train mse_objective =1.006066162109375\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=1, criteria=mse_objective, value=0.7107618408203125\n", "[06/20/2023 23:13:21 INFO 140324763719488] Epoch 1: Loss improved. Updating best model\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 1\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpxdds337n/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 13.333333333333334 % of epochs\n", "#metrics {\"StartTime\": 1687302801.0666046, \"EndTime\": 1687302801.0853987, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 1, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 4213.0, \"count\": 1, \"min\": 4213, \"max\": 4213}, \"Total Batches Seen\": {\"sum\": 7.0, \"count\": 1, \"min\": 7, \"max\": 7}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 4.0, \"count\": 1, \"min\": 4, \"max\": 4}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=56681.92936366733 records/second\n", "[2023-06-20 23:13:21.097] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 8, \"duration\": 11, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.097549, \"EndTime\": 1687302801.097597, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9903468017578125, \"count\": 1, \"min\": 0.9903468017578125, \"max\": 0.9903468017578125}}}\n", "#metrics {\"StartTime\": 1687302801.0976484, \"EndTime\": 1687302801.0976589, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.97018798828125, \"count\": 1, \"min\": 0.97018798828125, \"max\": 0.97018798828125}}}\n", "#metrics {\"StartTime\": 1687302801.0976927, \"EndTime\": 1687302801.0977006, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.998368896484375, \"count\": 1, \"min\": 0.998368896484375, \"max\": 0.998368896484375}}}\n", "#metrics {\"StartTime\": 1687302801.097735, \"EndTime\": 1687302801.0977433, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9728997802734375, \"count\": 1, \"min\": 0.9728997802734375, \"max\": 0.9728997802734375}}}\n", "#metrics {\"StartTime\": 1687302801.0977771, \"EndTime\": 1687302801.0977857, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.86045703125, \"count\": 1, \"min\": 0.86045703125, \"max\": 0.86045703125}}}\n", "#metrics {\"StartTime\": 1687302801.0978177, \"EndTime\": 1687302801.097825, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9148922729492187, \"count\": 1, \"min\": 0.9148922729492187, \"max\": 0.9148922729492187}}}\n", "#metrics {\"StartTime\": 1687302801.0978572, \"EndTime\": 1687302801.097865, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0937572021484374, \"count\": 1, \"min\": 1.0937572021484374, \"max\": 1.0937572021484374}}}\n", "#metrics {\"StartTime\": 1687302801.0979018, \"EndTime\": 1687302801.0979106, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8990514526367187, \"count\": 1, \"min\": 0.8990514526367187, \"max\": 0.8990514526367187}}}\n", "#metrics {\"StartTime\": 1687302801.0979445, \"EndTime\": 1687302801.097953, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8609124145507813, \"count\": 1, \"min\": 0.8609124145507813, \"max\": 0.8609124145507813}}}\n", "#metrics {\"StartTime\": 1687302801.097985, \"EndTime\": 1687302801.097993, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9586372680664063, \"count\": 1, \"min\": 0.9586372680664063, \"max\": 0.9586372680664063}}}\n", "#metrics {\"StartTime\": 1687302801.098026, \"EndTime\": 1687302801.098034, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.06913330078125, \"count\": 1, \"min\": 1.06913330078125, \"max\": 1.06913330078125}}}\n", "#metrics {\"StartTime\": 1687302801.098067, \"EndTime\": 1687302801.098075, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9909069213867188, \"count\": 1, \"min\": 0.9909069213867188, \"max\": 0.9909069213867188}}}\n", "#metrics {\"StartTime\": 1687302801.0981095, \"EndTime\": 1687302801.098118, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1690634765625, \"count\": 1, \"min\": 1.1690634765625, \"max\": 1.1690634765625}}}\n", "#metrics {\"StartTime\": 1687302801.0981507, \"EndTime\": 1687302801.098159, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1389429931640624, \"count\": 1, \"min\": 1.1389429931640624, \"max\": 1.1389429931640624}}}\n", "#metrics {\"StartTime\": 1687302801.098191, \"EndTime\": 1687302801.0981994, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8999092407226562, \"count\": 1, \"min\": 0.8999092407226562, \"max\": 0.8999092407226562}}}\n", "#metrics {\"StartTime\": 1687302801.0982306, \"EndTime\": 1687302801.0982387, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1725975341796875, \"count\": 1, \"min\": 1.1725975341796875, \"max\": 1.1725975341796875}}}\n", "#metrics {\"StartTime\": 1687302801.098271, \"EndTime\": 1687302801.0982795, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0186034545898437, \"count\": 1, \"min\": 1.0186034545898437, \"max\": 1.0186034545898437}}}\n", "#metrics {\"StartTime\": 1687302801.098311, \"EndTime\": 1687302801.0983188, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0402691650390625, \"count\": 1, \"min\": 1.0402691650390625, \"max\": 1.0402691650390625}}}\n", "#metrics {\"StartTime\": 1687302801.0983486, \"EndTime\": 1687302801.0983562, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.062996826171875, \"count\": 1, \"min\": 1.062996826171875, \"max\": 1.062996826171875}}}\n", "#metrics {\"StartTime\": 1687302801.0983872, \"EndTime\": 1687302801.098395, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9714458618164062, \"count\": 1, \"min\": 0.9714458618164062, \"max\": 0.9714458618164062}}}\n", "#metrics {\"StartTime\": 1687302801.098427, \"EndTime\": 1687302801.098435, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.897747314453125, \"count\": 1, \"min\": 0.897747314453125, \"max\": 0.897747314453125}}}\n", "#metrics {\"StartTime\": 1687302801.098467, \"EndTime\": 1687302801.098475, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8461341552734375, \"count\": 1, \"min\": 0.8461341552734375, \"max\": 0.8461341552734375}}}\n", "#metrics {\"StartTime\": 1687302801.0985067, \"EndTime\": 1687302801.0985153, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9999383544921875, \"count\": 1, \"min\": 0.9999383544921875, \"max\": 0.9999383544921875}}}\n", "#metrics {\"StartTime\": 1687302801.0985441, \"EndTime\": 1687302801.0985518, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9619004516601563, \"count\": 1, \"min\": 0.9619004516601563, \"max\": 0.9619004516601563}}}\n", "#metrics {\"StartTime\": 1687302801.0985801, \"EndTime\": 1687302801.0985885, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9779675903320313, \"count\": 1, \"min\": 0.9779675903320313, \"max\": 0.9779675903320313}}}\n", "#metrics {\"StartTime\": 1687302801.0986216, \"EndTime\": 1687302801.09863, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.96242529296875, \"count\": 1, \"min\": 0.96242529296875, \"max\": 0.96242529296875}}}\n", "#metrics {\"StartTime\": 1687302801.0986652, \"EndTime\": 1687302801.0986736, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.95726220703125, \"count\": 1, \"min\": 0.95726220703125, \"max\": 0.95726220703125}}}\n", "#metrics {\"StartTime\": 1687302801.0987065, \"EndTime\": 1687302801.0987148, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0160242919921876, \"count\": 1, \"min\": 1.0160242919921876, \"max\": 1.0160242919921876}}}\n", "#metrics {\"StartTime\": 1687302801.0987473, \"EndTime\": 1687302801.0987556, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.3235482177734375, \"count\": 1, \"min\": 1.3235482177734375, \"max\": 1.3235482177734375}}}\n", "#metrics {\"StartTime\": 1687302801.0987897, \"EndTime\": 1687302801.0987985, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.2599205322265625, \"count\": 1, \"min\": 1.2599205322265625, \"max\": 1.2599205322265625}}}\n", "#metrics {\"StartTime\": 1687302801.0988312, \"EndTime\": 1687302801.098839, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.3516844482421875, \"count\": 1, \"min\": 1.3516844482421875, \"max\": 1.3516844482421875}}}\n", "#metrics {\"StartTime\": 1687302801.0988715, \"EndTime\": 1687302801.0988793, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.3527298583984375, \"count\": 1, \"min\": 1.3527298583984375, \"max\": 1.3527298583984375}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=2, train mse_objective =0.9903468017578125\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=2, criteria=mse_objective, value=0.8461341552734375\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 2\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpl1v3yw8j/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 20.0 % of epochs\n", "#metrics {\"StartTime\": 1687302801.085604, \"EndTime\": 1687302801.103807, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 2, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 5284.0, \"count\": 1, \"min\": 5284, \"max\": 5284}, \"Total Batches Seen\": {\"sum\": 9.0, \"count\": 1, \"min\": 9, \"max\": 9}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 5.0, \"count\": 1, \"min\": 5, \"max\": 5}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=58522.88470257172 records/second\n", "[2023-06-20 23:13:21.115] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 10, \"duration\": 11, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.1156852, \"EndTime\": 1687302801.115734, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.974605224609375, \"count\": 1, \"min\": 0.974605224609375, \"max\": 0.974605224609375}}}\n", "#metrics {\"StartTime\": 1687302801.1158004, \"EndTime\": 1687302801.1158116, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.953135498046875, \"count\": 1, \"min\": 0.953135498046875, \"max\": 0.953135498046875}}}\n", "#metrics {\"StartTime\": 1687302801.115845, \"EndTime\": 1687302801.1158526, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.982017578125, \"count\": 1, \"min\": 0.982017578125, \"max\": 0.982017578125}}}\n", "#metrics {\"StartTime\": 1687302801.1158805, \"EndTime\": 1687302801.115888, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.956477294921875, \"count\": 1, \"min\": 0.956477294921875, \"max\": 0.956477294921875}}}\n", "#metrics {\"StartTime\": 1687302801.1159203, \"EndTime\": 1687302801.115929, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7241810302734375, \"count\": 1, \"min\": 0.7241810302734375, \"max\": 0.7241810302734375}}}\n", "#metrics {\"StartTime\": 1687302801.1159606, \"EndTime\": 1687302801.115969, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6986046752929688, \"count\": 1, \"min\": 0.6986046752929688, \"max\": 0.6986046752929688}}}\n", "#metrics {\"StartTime\": 1687302801.116, \"EndTime\": 1687302801.1160076, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6737341918945312, \"count\": 1, \"min\": 0.6737341918945312, \"max\": 0.6737341918945312}}}\n", "#metrics {\"StartTime\": 1687302801.1160414, \"EndTime\": 1687302801.1160498, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7074252319335937, \"count\": 1, \"min\": 0.7074252319335937, \"max\": 0.7074252319335937}}}\n", "#metrics {\"StartTime\": 1687302801.116082, \"EndTime\": 1687302801.11609, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.84929736328125, \"count\": 1, \"min\": 0.84929736328125, \"max\": 0.84929736328125}}}\n", "#metrics {\"StartTime\": 1687302801.1161237, \"EndTime\": 1687302801.116131, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9425160522460938, \"count\": 1, \"min\": 0.9425160522460938, \"max\": 0.9425160522460938}}}\n", "#metrics {\"StartTime\": 1687302801.1161628, \"EndTime\": 1687302801.1161704, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0502464599609376, \"count\": 1, \"min\": 1.0502464599609376, \"max\": 1.0502464599609376}}}\n", "#metrics {\"StartTime\": 1687302801.1162026, \"EndTime\": 1687302801.1162112, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9733524169921876, \"count\": 1, \"min\": 0.9733524169921876, \"max\": 0.9733524169921876}}}\n", "#metrics {\"StartTime\": 1687302801.1162453, \"EndTime\": 1687302801.1162536, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6739257202148438, \"count\": 1, \"min\": 0.6739257202148438, \"max\": 0.6739257202148438}}}\n", "#metrics {\"StartTime\": 1687302801.1162875, \"EndTime\": 1687302801.116295, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6684318237304687, \"count\": 1, \"min\": 0.6684318237304687, \"max\": 0.6684318237304687}}}\n", "#metrics {\"StartTime\": 1687302801.1163294, \"EndTime\": 1687302801.1163383, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7045728759765625, \"count\": 1, \"min\": 0.7045728759765625, \"max\": 0.7045728759765625}}}\n", "#metrics {\"StartTime\": 1687302801.116371, \"EndTime\": 1687302801.116379, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6841575317382812, \"count\": 1, \"min\": 0.6841575317382812, \"max\": 0.6841575317382812}}}\n", "#metrics {\"StartTime\": 1687302801.11644, \"EndTime\": 1687302801.11645, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0016835327148437, \"count\": 1, \"min\": 1.0016835327148437, \"max\": 1.0016835327148437}}}\n", "#metrics {\"StartTime\": 1687302801.1164856, \"EndTime\": 1687302801.1164944, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0234996337890625, \"count\": 1, \"min\": 1.0234996337890625, \"max\": 1.0234996337890625}}}\n", "#metrics {\"StartTime\": 1687302801.1165304, \"EndTime\": 1687302801.1165392, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0445482177734375, \"count\": 1, \"min\": 1.0445482177734375, \"max\": 1.0445482177734375}}}\n", "#metrics {\"StartTime\": 1687302801.116571, \"EndTime\": 1687302801.116579, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9544625244140625, \"count\": 1, \"min\": 0.9544625244140625, \"max\": 0.9544625244140625}}}\n", "#metrics {\"StartTime\": 1687302801.1166077, \"EndTime\": 1687302801.1166162, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6853966064453125, \"count\": 1, \"min\": 0.6853966064453125, \"max\": 0.6853966064453125}}}\n", "#metrics {\"StartTime\": 1687302801.1166515, \"EndTime\": 1687302801.1166599, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6920762329101563, \"count\": 1, \"min\": 0.6920762329101563, \"max\": 0.6920762329101563}}}\n", "#metrics {\"StartTime\": 1687302801.1166942, \"EndTime\": 1687302801.1167028, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6565408325195312, \"count\": 1, \"min\": 0.6565408325195312, \"max\": 0.6565408325195312}}}\n", "#metrics {\"StartTime\": 1687302801.1167374, \"EndTime\": 1687302801.1167457, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6762831420898437, \"count\": 1, \"min\": 0.6762831420898437, \"max\": 0.6762831420898437}}}\n", "#metrics {\"StartTime\": 1687302801.116779, \"EndTime\": 1687302801.1167874, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9790867309570312, \"count\": 1, \"min\": 0.9790867309570312, \"max\": 0.9790867309570312}}}\n", "#metrics {\"StartTime\": 1687302801.116816, \"EndTime\": 1687302801.1168242, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9662904052734375, \"count\": 1, \"min\": 0.9662904052734375, \"max\": 0.9662904052734375}}}\n", "#metrics {\"StartTime\": 1687302801.116856, \"EndTime\": 1687302801.116865, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9580093994140625, \"count\": 1, \"min\": 0.9580093994140625, \"max\": 0.9580093994140625}}}\n", "#metrics {\"StartTime\": 1687302801.1168995, \"EndTime\": 1687302801.116908, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0091856689453125, \"count\": 1, \"min\": 1.0091856689453125, \"max\": 1.0091856689453125}}}\n", "#metrics {\"StartTime\": 1687302801.1169438, \"EndTime\": 1687302801.1169522, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.018538818359375, \"count\": 1, \"min\": 1.018538818359375, \"max\": 1.018538818359375}}}\n", "#metrics {\"StartTime\": 1687302801.1169844, \"EndTime\": 1687302801.1169927, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0114102172851562, \"count\": 1, \"min\": 1.0114102172851562, \"max\": 1.0114102172851562}}}\n", "#metrics {\"StartTime\": 1687302801.1170273, \"EndTime\": 1687302801.1170356, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.97855810546875, \"count\": 1, \"min\": 0.97855810546875, \"max\": 0.97855810546875}}}\n", "#metrics {\"StartTime\": 1687302801.1170688, \"EndTime\": 1687302801.1170774, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9770177001953125, \"count\": 1, \"min\": 0.9770177001953125, \"max\": 0.9770177001953125}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=3, train mse_objective =0.974605224609375\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=3, criteria=mse_objective, value=0.6565408325195312\n", "[06/20/2023 23:13:21 INFO 140324763719488] Epoch 3: Loss improved. Updating best model\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 3\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpoqgtezyr/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 26.666666666666668 % of epochs\n", "#metrics {\"StartTime\": 1687302801.104007, \"EndTime\": 1687302801.1239574, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 3, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 6355.0, \"count\": 1, \"min\": 6355, \"max\": 6355}, \"Total Batches Seen\": {\"sum\": 11.0, \"count\": 1, \"min\": 11, \"max\": 11}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 6.0, \"count\": 1, \"min\": 6, \"max\": 6}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=53415.69359192361 records/second\n", "[2023-06-20 23:13:21.144] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 12, \"duration\": 19, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.144264, \"EndTime\": 1687302801.1443243, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9594590454101563, \"count\": 1, \"min\": 0.9594590454101563, \"max\": 0.9594590454101563}}}\n", "#metrics {\"StartTime\": 1687302801.1443899, \"EndTime\": 1687302801.1444023, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9384002685546875, \"count\": 1, \"min\": 0.9384002685546875, \"max\": 0.9384002685546875}}}\n", "#metrics {\"StartTime\": 1687302801.144468, \"EndTime\": 1687302801.1444788, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9673231201171875, \"count\": 1, \"min\": 0.9673231201171875, \"max\": 0.9673231201171875}}}\n", "#metrics {\"StartTime\": 1687302801.144514, \"EndTime\": 1687302801.1445236, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9415489501953125, \"count\": 1, \"min\": 0.9415489501953125, \"max\": 0.9415489501953125}}}\n", "#metrics {\"StartTime\": 1687302801.14456, \"EndTime\": 1687302801.1445696, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7144944458007813, \"count\": 1, \"min\": 0.7144944458007813, \"max\": 0.7144944458007813}}}\n", "#metrics {\"StartTime\": 1687302801.144604, \"EndTime\": 1687302801.1446123, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7529827270507813, \"count\": 1, \"min\": 0.7529827270507813, \"max\": 0.7529827270507813}}}\n", "#metrics {\"StartTime\": 1687302801.1446476, \"EndTime\": 1687302801.1446564, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8737408447265625, \"count\": 1, \"min\": 0.8737408447265625, \"max\": 0.8737408447265625}}}\n", "#metrics {\"StartTime\": 1687302801.1446888, \"EndTime\": 1687302801.1446977, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7503407592773438, \"count\": 1, \"min\": 0.7503407592773438, \"max\": 0.7503407592773438}}}\n", "#metrics {\"StartTime\": 1687302801.14473, \"EndTime\": 1687302801.144739, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8378130493164062, \"count\": 1, \"min\": 0.8378130493164062, \"max\": 0.8378130493164062}}}\n", "#metrics {\"StartTime\": 1687302801.144775, \"EndTime\": 1687302801.144784, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9275397338867187, \"count\": 1, \"min\": 0.9275397338867187, \"max\": 0.9275397338867187}}}\n", "#metrics {\"StartTime\": 1687302801.144818, \"EndTime\": 1687302801.1448267, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.033701171875, \"count\": 1, \"min\": 1.033701171875, \"max\": 1.033701171875}}}\n", "#metrics {\"StartTime\": 1687302801.144861, \"EndTime\": 1687302801.1448693, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.957732666015625, \"count\": 1, \"min\": 0.957732666015625, \"max\": 0.957732666015625}}}\n", "#metrics {\"StartTime\": 1687302801.1449027, \"EndTime\": 1687302801.144911, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9273709716796875, \"count\": 1, \"min\": 0.9273709716796875, \"max\": 0.9273709716796875}}}\n", "#metrics {\"StartTime\": 1687302801.144945, \"EndTime\": 1687302801.144953, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8939179077148437, \"count\": 1, \"min\": 0.8939179077148437, \"max\": 0.8939179077148437}}}\n", "#metrics {\"StartTime\": 1687302801.1449869, \"EndTime\": 1687302801.1449957, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7269241333007812, \"count\": 1, \"min\": 0.7269241333007812, \"max\": 0.7269241333007812}}}\n", "#metrics {\"StartTime\": 1687302801.1450284, \"EndTime\": 1687302801.1450365, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9409971923828125, \"count\": 1, \"min\": 0.9409971923828125, \"max\": 0.9409971923828125}}}\n", "#metrics {\"StartTime\": 1687302801.1450703, \"EndTime\": 1687302801.1450794, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9852452392578125, \"count\": 1, \"min\": 0.9852452392578125, \"max\": 0.9852452392578125}}}\n", "#metrics {\"StartTime\": 1687302801.145115, \"EndTime\": 1687302801.145123, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0075147705078125, \"count\": 1, \"min\": 1.0075147705078125, \"max\": 1.0075147705078125}}}\n", "#metrics {\"StartTime\": 1687302801.1451576, \"EndTime\": 1687302801.1451664, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.027198974609375, \"count\": 1, \"min\": 1.027198974609375, \"max\": 1.027198974609375}}}\n", "#metrics {\"StartTime\": 1687302801.1451993, \"EndTime\": 1687302801.1452081, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9391788940429687, \"count\": 1, \"min\": 0.9391788940429687, \"max\": 0.9391788940429687}}}\n", "#metrics {\"StartTime\": 1687302801.1452408, \"EndTime\": 1687302801.1452487, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6889517822265625, \"count\": 1, \"min\": 0.6889517822265625, \"max\": 0.6889517822265625}}}\n", "#metrics {\"StartTime\": 1687302801.1452818, \"EndTime\": 1687302801.1452916, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6707691650390625, \"count\": 1, \"min\": 0.6707691650390625, \"max\": 0.6707691650390625}}}\n", "#metrics {\"StartTime\": 1687302801.1453261, \"EndTime\": 1687302801.1453347, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7683834838867187, \"count\": 1, \"min\": 0.7683834838867187, \"max\": 0.7683834838867187}}}\n", "#metrics {\"StartTime\": 1687302801.1453688, \"EndTime\": 1687302801.1453774, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7269542236328125, \"count\": 1, \"min\": 0.7269542236328125, \"max\": 0.7269542236328125}}}\n", "#metrics {\"StartTime\": 1687302801.1454115, \"EndTime\": 1687302801.1454196, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9786641845703125, \"count\": 1, \"min\": 0.9786641845703125, \"max\": 0.9786641845703125}}}\n", "#metrics {\"StartTime\": 1687302801.145454, \"EndTime\": 1687302801.1454618, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9710018920898438, \"count\": 1, \"min\": 0.9710018920898438, \"max\": 0.9710018920898438}}}\n", "#metrics {\"StartTime\": 1687302801.145494, \"EndTime\": 1687302801.1455026, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.960674560546875, \"count\": 1, \"min\": 0.960674560546875, \"max\": 0.960674560546875}}}\n", "#metrics {\"StartTime\": 1687302801.1455367, \"EndTime\": 1687302801.1455462, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0044139404296875, \"count\": 1, \"min\": 1.0044139404296875, \"max\": 1.0044139404296875}}}\n", "#metrics {\"StartTime\": 1687302801.1455803, \"EndTime\": 1687302801.1455886, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0411546630859374, \"count\": 1, \"min\": 1.0411546630859374, \"max\": 1.0411546630859374}}}\n", "#metrics {\"StartTime\": 1687302801.1456227, \"EndTime\": 1687302801.1456313, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1614339599609376, \"count\": 1, \"min\": 1.1614339599609376, \"max\": 1.1614339599609376}}}\n", "#metrics {\"StartTime\": 1687302801.1456637, \"EndTime\": 1687302801.1456718, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1201153564453126, \"count\": 1, \"min\": 1.1201153564453126, \"max\": 1.1201153564453126}}}\n", "#metrics {\"StartTime\": 1687302801.145706, \"EndTime\": 1687302801.1457145, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9459694213867188, \"count\": 1, \"min\": 0.9459694213867188, \"max\": 0.9459694213867188}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=4, train mse_objective =0.9594590454101563\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=4, criteria=mse_objective, value=0.6707691650390625\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 4\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmp7wurr3tr/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 33.333333333333336 % of epochs\n", "#metrics {\"StartTime\": 1687302801.1241736, \"EndTime\": 1687302801.1575854, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 4, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 7426.0, \"count\": 1, \"min\": 7426, \"max\": 7426}, \"Total Batches Seen\": {\"sum\": 13.0, \"count\": 1, \"min\": 13, \"max\": 13}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 7.0, \"count\": 1, \"min\": 7, \"max\": 7}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=31970.646188446128 records/second\n", "[2023-06-20 23:13:21.172] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 14, \"duration\": 14, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.1721487, \"EndTime\": 1687302801.1721923, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9451370849609375, \"count\": 1, \"min\": 0.9451370849609375, \"max\": 0.9451370849609375}}}\n", "#metrics {\"StartTime\": 1687302801.1722748, \"EndTime\": 1687302801.1722946, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9238663940429688, \"count\": 1, \"min\": 0.9238663940429688, \"max\": 0.9238663940429688}}}\n", "#metrics {\"StartTime\": 1687302801.1723247, \"EndTime\": 1687302801.172331, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.95290185546875, \"count\": 1, \"min\": 0.95290185546875, \"max\": 0.95290185546875}}}\n", "#metrics {\"StartTime\": 1687302801.1723673, \"EndTime\": 1687302801.1723742, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9269659423828125, \"count\": 1, \"min\": 0.9269659423828125, \"max\": 0.9269659423828125}}}\n", "#metrics {\"StartTime\": 1687302801.1724253, \"EndTime\": 1687302801.1724334, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8364293212890626, \"count\": 1, \"min\": 0.8364293212890626, \"max\": 0.8364293212890626}}}\n", "#metrics {\"StartTime\": 1687302801.1724598, \"EndTime\": 1687302801.172465, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8619542236328125, \"count\": 1, \"min\": 0.8619542236328125, \"max\": 0.8619542236328125}}}\n", "#metrics {\"StartTime\": 1687302801.1724856, \"EndTime\": 1687302801.1725023, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9758377685546875, \"count\": 1, \"min\": 0.9758377685546875, \"max\": 0.9758377685546875}}}\n", "#metrics {\"StartTime\": 1687302801.1725254, \"EndTime\": 1687302801.172531, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8314310302734375, \"count\": 1, \"min\": 0.8314310302734375, \"max\": 0.8314310302734375}}}\n", "#metrics {\"StartTime\": 1687302801.1725643, \"EndTime\": 1687302801.1725702, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8271822509765625, \"count\": 1, \"min\": 0.8271822509765625, \"max\": 0.8271822509765625}}}\n", "#metrics {\"StartTime\": 1687302801.1725934, \"EndTime\": 1687302801.1725986, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.91309033203125, \"count\": 1, \"min\": 0.91309033203125, \"max\": 0.91309033203125}}}\n", "#metrics {\"StartTime\": 1687302801.1726394, \"EndTime\": 1687302801.1726458, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.01736181640625, \"count\": 1, \"min\": 1.01736181640625, \"max\": 1.01736181640625}}}\n", "#metrics {\"StartTime\": 1687302801.1726713, \"EndTime\": 1687302801.1726763, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9423170166015625, \"count\": 1, \"min\": 0.9423170166015625, \"max\": 0.9423170166015625}}}\n", "#metrics {\"StartTime\": 1687302801.172711, \"EndTime\": 1687302801.1727252, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.058690673828125, \"count\": 1, \"min\": 1.058690673828125, \"max\": 1.058690673828125}}}\n", "#metrics {\"StartTime\": 1687302801.1727512, \"EndTime\": 1687302801.172757, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0132245483398437, \"count\": 1, \"min\": 1.0132245483398437, \"max\": 1.0132245483398437}}}\n", "#metrics {\"StartTime\": 1687302801.1727808, \"EndTime\": 1687302801.1727862, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8470865478515625, \"count\": 1, \"min\": 0.8470865478515625, \"max\": 0.8470865478515625}}}\n", "#metrics {\"StartTime\": 1687302801.1728194, \"EndTime\": 1687302801.1728253, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0637578125, \"count\": 1, \"min\": 1.0637578125, \"max\": 1.0637578125}}}\n", "#metrics {\"StartTime\": 1687302801.1728451, \"EndTime\": 1687302801.1728501, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.96971435546875, \"count\": 1, \"min\": 0.96971435546875, \"max\": 0.96971435546875}}}\n", "#metrics {\"StartTime\": 1687302801.1728806, \"EndTime\": 1687302801.172887, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9923033447265625, \"count\": 1, \"min\": 0.9923033447265625, \"max\": 0.9923033447265625}}}\n", "#metrics {\"StartTime\": 1687302801.1729116, \"EndTime\": 1687302801.172917, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0104291381835937, \"count\": 1, \"min\": 1.0104291381835937, \"max\": 1.0104291381835937}}}\n", "#metrics {\"StartTime\": 1687302801.1729372, \"EndTime\": 1687302801.1729531, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.92418798828125, \"count\": 1, \"min\": 0.92418798828125, \"max\": 0.92418798828125}}}\n", "#metrics {\"StartTime\": 1687302801.1729755, \"EndTime\": 1687302801.172981, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7785425415039062, \"count\": 1, \"min\": 0.7785425415039062, \"max\": 0.7785425415039062}}}\n", "#metrics {\"StartTime\": 1687302801.1730149, \"EndTime\": 1687302801.1730208, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.760369384765625, \"count\": 1, \"min\": 0.760369384765625, \"max\": 0.760369384765625}}}\n", "#metrics {\"StartTime\": 1687302801.1730435, \"EndTime\": 1687302801.1730487, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8294029541015625, \"count\": 1, \"min\": 0.8294029541015625, \"max\": 0.8294029541015625}}}\n", "#metrics {\"StartTime\": 1687302801.1730828, \"EndTime\": 1687302801.173089, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7884896240234375, \"count\": 1, \"min\": 0.7884896240234375, \"max\": 0.7884896240234375}}}\n", "#metrics {\"StartTime\": 1687302801.173112, \"EndTime\": 1687302801.1731172, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9778370361328125, \"count\": 1, \"min\": 0.9778370361328125, \"max\": 0.9778370361328125}}}\n", "#metrics {\"StartTime\": 1687302801.173151, \"EndTime\": 1687302801.1731575, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9757728881835938, \"count\": 1, \"min\": 0.9757728881835938, \"max\": 0.9757728881835938}}}\n", "#metrics {\"StartTime\": 1687302801.1731904, \"EndTime\": 1687302801.1731963, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.963368408203125, \"count\": 1, \"min\": 0.963368408203125, \"max\": 0.963368408203125}}}\n", "#metrics {\"StartTime\": 1687302801.1732168, \"EndTime\": 1687302801.1732218, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.99982763671875, \"count\": 1, \"min\": 0.99982763671875, \"max\": 0.99982763671875}}}\n", "#metrics {\"StartTime\": 1687302801.1732554, \"EndTime\": 1687302801.1732616, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.088587890625, \"count\": 1, \"min\": 1.088587890625, \"max\": 1.088587890625}}}\n", "#metrics {\"StartTime\": 1687302801.1732962, \"EndTime\": 1687302801.173302, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.2275877685546874, \"count\": 1, \"min\": 1.2275877685546874, \"max\": 1.2275877685546874}}}\n", "#metrics {\"StartTime\": 1687302801.1733246, \"EndTime\": 1687302801.1733296, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.18158935546875, \"count\": 1, \"min\": 1.18158935546875, \"max\": 1.18158935546875}}}\n", "#metrics {\"StartTime\": 1687302801.1733613, \"EndTime\": 1687302801.1733677, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0184425048828125, \"count\": 1, \"min\": 1.0184425048828125, \"max\": 1.0184425048828125}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=5, train mse_objective =0.9451370849609375\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=5, criteria=mse_objective, value=0.760369384765625\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 5\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpbstleg6c/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 40.0 % of epochs\n", "#metrics {\"StartTime\": 1687302801.1577897, \"EndTime\": 1687302801.1793075, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 5, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 8497.0, \"count\": 1, \"min\": 8497, \"max\": 8497}, \"Total Batches Seen\": {\"sum\": 15.0, \"count\": 1, \"min\": 15, \"max\": 15}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 8.0, \"count\": 1, \"min\": 8, \"max\": 8}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=49535.74593092497 records/second\n", "[2023-06-20 23:13:21.194] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 16, \"duration\": 15, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.1947472, \"EndTime\": 1687302801.1947877, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9309275512695312, \"count\": 1, \"min\": 0.9309275512695312, \"max\": 0.9309275512695312}}}\n", "#metrics {\"StartTime\": 1687302801.1948328, \"EndTime\": 1687302801.1948707, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9089638061523437, \"count\": 1, \"min\": 0.9089638061523437, \"max\": 0.9089638061523437}}}\n", "#metrics {\"StartTime\": 1687302801.1949139, \"EndTime\": 1687302801.194921, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.938397705078125, \"count\": 1, \"min\": 0.938397705078125, \"max\": 0.938397705078125}}}\n", "#metrics {\"StartTime\": 1687302801.1949487, \"EndTime\": 1687302801.1949544, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.912354248046875, \"count\": 1, \"min\": 0.912354248046875, \"max\": 0.912354248046875}}}\n", "#metrics {\"StartTime\": 1687302801.1949875, \"EndTime\": 1687302801.1949937, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7815601196289063, \"count\": 1, \"min\": 0.7815601196289063, \"max\": 0.7815601196289063}}}\n", "#metrics {\"StartTime\": 1687302801.1950197, \"EndTime\": 1687302801.1950254, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7760836181640625, \"count\": 1, \"min\": 0.7760836181640625, \"max\": 0.7760836181640625}}}\n", "#metrics {\"StartTime\": 1687302801.1950626, \"EndTime\": 1687302801.1950684, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8089533081054687, \"count\": 1, \"min\": 0.8089533081054687, \"max\": 0.8089533081054687}}}\n", "#metrics {\"StartTime\": 1687302801.195105, \"EndTime\": 1687302801.1951106, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7221036987304688, \"count\": 1, \"min\": 0.7221036987304688, \"max\": 0.7221036987304688}}}\n", "#metrics {\"StartTime\": 1687302801.1951468, \"EndTime\": 1687302801.1951523, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8167568359375, \"count\": 1, \"min\": 0.8167568359375, \"max\": 0.8167568359375}}}\n", "#metrics {\"StartTime\": 1687302801.1951756, \"EndTime\": 1687302801.195181, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8986716918945312, \"count\": 1, \"min\": 0.8986716918945312, \"max\": 0.8986716918945312}}}\n", "#metrics {\"StartTime\": 1687302801.195201, \"EndTime\": 1687302801.1952062, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.000641357421875, \"count\": 1, \"min\": 1.000641357421875, \"max\": 1.000641357421875}}}\n", "#metrics {\"StartTime\": 1687302801.195227, \"EndTime\": 1687302801.195232, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9267261352539062, \"count\": 1, \"min\": 0.9267261352539062, \"max\": 0.9267261352539062}}}\n", "#metrics {\"StartTime\": 1687302801.195252, \"EndTime\": 1687302801.1952572, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8447373657226562, \"count\": 1, \"min\": 0.8447373657226562, \"max\": 0.8447373657226562}}}\n", "#metrics {\"StartTime\": 1687302801.1952794, \"EndTime\": 1687302801.1952844, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8298294067382812, \"count\": 1, \"min\": 0.8298294067382812, \"max\": 0.8298294067382812}}}\n", "#metrics {\"StartTime\": 1687302801.195307, \"EndTime\": 1687302801.1953125, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7799320678710937, \"count\": 1, \"min\": 0.7799320678710937, \"max\": 0.7799320678710937}}}\n", "#metrics {\"StartTime\": 1687302801.1953347, \"EndTime\": 1687302801.1953397, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.83313525390625, \"count\": 1, \"min\": 0.83313525390625, \"max\": 0.83313525390625}}}\n", "#metrics {\"StartTime\": 1687302801.1953595, \"EndTime\": 1687302801.1953642, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9543753662109375, \"count\": 1, \"min\": 0.9543753662109375, \"max\": 0.9543753662109375}}}\n", "#metrics {\"StartTime\": 1687302801.1953843, \"EndTime\": 1687302801.1953893, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9772023315429688, \"count\": 1, \"min\": 0.9772023315429688, \"max\": 0.9772023315429688}}}\n", "#metrics {\"StartTime\": 1687302801.195409, \"EndTime\": 1687302801.1954143, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9937194213867188, \"count\": 1, \"min\": 0.9937194213867188, \"max\": 0.9937194213867188}}}\n", "#metrics {\"StartTime\": 1687302801.1954343, \"EndTime\": 1687302801.1954393, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9091443481445313, \"count\": 1, \"min\": 0.9091443481445313, \"max\": 0.9091443481445313}}}\n", "#metrics {\"StartTime\": 1687302801.1954594, \"EndTime\": 1687302801.1954641, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7145595703125, \"count\": 1, \"min\": 0.7145595703125, \"max\": 0.7145595703125}}}\n", "#metrics {\"StartTime\": 1687302801.195484, \"EndTime\": 1687302801.195489, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.70767626953125, \"count\": 1, \"min\": 0.70767626953125, \"max\": 0.70767626953125}}}\n", "#metrics {\"StartTime\": 1687302801.1955087, \"EndTime\": 1687302801.195514, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6988079833984375, \"count\": 1, \"min\": 0.6988079833984375, \"max\": 0.6988079833984375}}}\n", "#metrics {\"StartTime\": 1687302801.1955335, \"EndTime\": 1687302801.1955388, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6768855590820313, \"count\": 1, \"min\": 0.6768855590820313, \"max\": 0.6768855590820313}}}\n", "#metrics {\"StartTime\": 1687302801.1955585, \"EndTime\": 1687302801.1955638, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9764688720703125, \"count\": 1, \"min\": 0.9764688720703125, \"max\": 0.9764688720703125}}}\n", "#metrics {\"StartTime\": 1687302801.1955836, \"EndTime\": 1687302801.1955886, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.979979248046875, \"count\": 1, \"min\": 0.979979248046875, \"max\": 0.979979248046875}}}\n", "#metrics {\"StartTime\": 1687302801.1956084, \"EndTime\": 1687302801.1956134, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9657122192382812, \"count\": 1, \"min\": 0.9657122192382812, \"max\": 0.9657122192382812}}}\n", "#metrics {\"StartTime\": 1687302801.1956356, \"EndTime\": 1687302801.1956406, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9950216064453125, \"count\": 1, \"min\": 0.9950216064453125, \"max\": 0.9950216064453125}}}\n", "#metrics {\"StartTime\": 1687302801.1956642, \"EndTime\": 1687302801.1956692, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9844647827148437, \"count\": 1, \"min\": 0.9844647827148437, \"max\": 0.9844647827148437}}}\n", "#metrics {\"StartTime\": 1687302801.1956894, \"EndTime\": 1687302801.1956947, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0702366943359376, \"count\": 1, \"min\": 1.0702366943359376, \"max\": 1.0702366943359376}}}\n", "#metrics {\"StartTime\": 1687302801.1957145, \"EndTime\": 1687302801.1957197, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.028525146484375, \"count\": 1, \"min\": 1.028525146484375, \"max\": 1.028525146484375}}}\n", "#metrics {\"StartTime\": 1687302801.1957412, \"EndTime\": 1687302801.195746, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9692490234375, \"count\": 1, \"min\": 0.9692490234375, \"max\": 0.9692490234375}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=6, train mse_objective =0.9309275512695312\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=6, criteria=mse_objective, value=0.6768855590820313\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 6\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpbuown91l/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 46.666666666666664 % of epochs\n", "#metrics {\"StartTime\": 1687302801.1795163, \"EndTime\": 1687302801.201611, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 6, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 9568.0, \"count\": 1, \"min\": 9568, \"max\": 9568}, \"Total Batches Seen\": {\"sum\": 17.0, \"count\": 1, \"min\": 17, \"max\": 17}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 9.0, \"count\": 1, \"min\": 9, \"max\": 9}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=48283.45569456984 records/second\n", "[2023-06-20 23:13:21.215] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 18, \"duration\": 14, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.2159395, \"EndTime\": 1687302801.2159803, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9171712646484375, \"count\": 1, \"min\": 0.9171712646484375, \"max\": 0.9171712646484375}}}\n", "#metrics {\"StartTime\": 1687302801.2160478, \"EndTime\": 1687302801.2160563, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.894685302734375, \"count\": 1, \"min\": 0.894685302734375, \"max\": 0.894685302734375}}}\n", "#metrics {\"StartTime\": 1687302801.216086, \"EndTime\": 1687302801.216092, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9246111450195312, \"count\": 1, \"min\": 0.9246111450195312, \"max\": 0.9246111450195312}}}\n", "#metrics {\"StartTime\": 1687302801.2161171, \"EndTime\": 1687302801.2161229, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8984644165039063, \"count\": 1, \"min\": 0.8984644165039063, \"max\": 0.8984644165039063}}}\n", "#metrics {\"StartTime\": 1687302801.2161465, \"EndTime\": 1687302801.2161517, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7032120971679687, \"count\": 1, \"min\": 0.7032120971679687, \"max\": 0.7032120971679687}}}\n", "#metrics {\"StartTime\": 1687302801.2161727, \"EndTime\": 1687302801.216178, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.725182373046875, \"count\": 1, \"min\": 0.725182373046875, \"max\": 0.725182373046875}}}\n", "#metrics {\"StartTime\": 1687302801.2162, \"EndTime\": 1687302801.2162056, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7236519165039063, \"count\": 1, \"min\": 0.7236519165039063, \"max\": 0.7236519165039063}}}\n", "#metrics {\"StartTime\": 1687302801.216226, \"EndTime\": 1687302801.216231, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6636129760742188, \"count\": 1, \"min\": 0.6636129760742188, \"max\": 0.6636129760742188}}}\n", "#metrics {\"StartTime\": 1687302801.216253, \"EndTime\": 1687302801.216258, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8064939575195312, \"count\": 1, \"min\": 0.8064939575195312, \"max\": 0.8064939575195312}}}\n", "#metrics {\"StartTime\": 1687302801.21628, \"EndTime\": 1687302801.2162855, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8849080810546875, \"count\": 1, \"min\": 0.8849080810546875, \"max\": 0.8849080810546875}}}\n", "#metrics {\"StartTime\": 1687302801.2163086, \"EndTime\": 1687302801.2163136, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9845374145507813, \"count\": 1, \"min\": 0.9845374145507813, \"max\": 0.9845374145507813}}}\n", "#metrics {\"StartTime\": 1687302801.2163377, \"EndTime\": 1687302801.2163427, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9118574829101562, \"count\": 1, \"min\": 0.9118574829101562, \"max\": 0.9118574829101562}}}\n", "#metrics {\"StartTime\": 1687302801.2163663, \"EndTime\": 1687302801.2163718, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6837935791015625, \"count\": 1, \"min\": 0.6837935791015625, \"max\": 0.6837935791015625}}}\n", "#metrics {\"StartTime\": 1687302801.2163913, \"EndTime\": 1687302801.2163966, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.713102783203125, \"count\": 1, \"min\": 0.713102783203125, \"max\": 0.713102783203125}}}\n", "#metrics {\"StartTime\": 1687302801.216459, \"EndTime\": 1687302801.2164664, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7206134033203125, \"count\": 1, \"min\": 0.7206134033203125, \"max\": 0.7206134033203125}}}\n", "#metrics {\"StartTime\": 1687302801.216491, \"EndTime\": 1687302801.2164965, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6623385620117187, \"count\": 1, \"min\": 0.6623385620117187, \"max\": 0.6623385620117187}}}\n", "#metrics {\"StartTime\": 1687302801.216516, \"EndTime\": 1687302801.2165215, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9394664916992187, \"count\": 1, \"min\": 0.9394664916992187, \"max\": 0.9394664916992187}}}\n", "#metrics {\"StartTime\": 1687302801.2165413, \"EndTime\": 1687302801.2165463, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.962658935546875, \"count\": 1, \"min\": 0.962658935546875, \"max\": 0.962658935546875}}}\n", "#metrics {\"StartTime\": 1687302801.2165658, \"EndTime\": 1687302801.216571, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9776746215820312, \"count\": 1, \"min\": 0.9776746215820312, \"max\": 0.9776746215820312}}}\n", "#metrics {\"StartTime\": 1687302801.2165945, \"EndTime\": 1687302801.2165995, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.894867919921875, \"count\": 1, \"min\": 0.894867919921875, \"max\": 0.894867919921875}}}\n", "#metrics {\"StartTime\": 1687302801.2166197, \"EndTime\": 1687302801.2166247, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.701564453125, \"count\": 1, \"min\": 0.701564453125, \"max\": 0.701564453125}}}\n", "#metrics {\"StartTime\": 1687302801.2166445, \"EndTime\": 1687302801.2166495, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6980833129882813, \"count\": 1, \"min\": 0.6980833129882813, \"max\": 0.6980833129882813}}}\n", "#metrics {\"StartTime\": 1687302801.2166693, \"EndTime\": 1687302801.2166743, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6879876098632812, \"count\": 1, \"min\": 0.6879876098632812, \"max\": 0.6879876098632812}}}\n", "#metrics {\"StartTime\": 1687302801.2166965, \"EndTime\": 1687302801.216702, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.659395751953125, \"count\": 1, \"min\": 0.659395751953125, \"max\": 0.659395751953125}}}\n", "#metrics {\"StartTime\": 1687302801.216724, \"EndTime\": 1687302801.2167292, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9754028930664063, \"count\": 1, \"min\": 0.9754028930664063, \"max\": 0.9754028930664063}}}\n", "#metrics {\"StartTime\": 1687302801.216749, \"EndTime\": 1687302801.216754, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9841283569335938, \"count\": 1, \"min\": 0.9841283569335938, \"max\": 0.9841283569335938}}}\n", "#metrics {\"StartTime\": 1687302801.2167737, \"EndTime\": 1687302801.2167788, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9687301025390626, \"count\": 1, \"min\": 0.9687301025390626, \"max\": 0.9687301025390626}}}\n", "#metrics {\"StartTime\": 1687302801.2167993, \"EndTime\": 1687302801.2168043, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9909588623046875, \"count\": 1, \"min\": 0.9909588623046875, \"max\": 0.9909588623046875}}}\n", "#metrics {\"StartTime\": 1687302801.2168264, \"EndTime\": 1687302801.2168314, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9844620971679687, \"count\": 1, \"min\": 0.9844620971679687, \"max\": 0.9844620971679687}}}\n", "#metrics {\"StartTime\": 1687302801.2168515, \"EndTime\": 1687302801.2168565, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0048580932617188, \"count\": 1, \"min\": 1.0048580932617188, \"max\": 1.0048580932617188}}}\n", "#metrics {\"StartTime\": 1687302801.2168763, \"EndTime\": 1687302801.2168815, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9952666625976563, \"count\": 1, \"min\": 0.9952666625976563, \"max\": 0.9952666625976563}}}\n", "#metrics {\"StartTime\": 1687302801.2169044, \"EndTime\": 1687302801.2169096, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0209095458984374, \"count\": 1, \"min\": 1.0209095458984374, \"max\": 1.0209095458984374}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=7, train mse_objective =0.9171712646484375\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=7, criteria=mse_objective, value=0.659395751953125\n", "[06/20/2023 23:13:21 INFO 140324763719488] Epoch 7: Loss has not improved for 0 epochs.\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 7\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmp0c3gpaql/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 53.333333333333336 % of epochs\n", "#metrics {\"StartTime\": 1687302801.2017632, \"EndTime\": 1687302801.2227693, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 7, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 10639.0, \"count\": 1, \"min\": 10639, \"max\": 10639}, \"Total Batches Seen\": {\"sum\": 19.0, \"count\": 1, \"min\": 19, \"max\": 19}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 10.0, \"count\": 1, \"min\": 10, \"max\": 10}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=50801.81380620646 records/second\n", "[2023-06-20 23:13:21.237] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 20, \"duration\": 14, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.2370536, \"EndTime\": 1687302801.2370925, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9041302490234375, \"count\": 1, \"min\": 0.9041302490234375, \"max\": 0.9041302490234375}}}\n", "#metrics {\"StartTime\": 1687302801.2371347, \"EndTime\": 1687302801.237142, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8814446411132812, \"count\": 1, \"min\": 0.8814446411132812, \"max\": 0.8814446411132812}}}\n", "#metrics {\"StartTime\": 1687302801.2371705, \"EndTime\": 1687302801.2371767, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9116272583007813, \"count\": 1, \"min\": 0.9116272583007813, \"max\": 0.9116272583007813}}}\n", "#metrics {\"StartTime\": 1687302801.237202, \"EndTime\": 1687302801.2372077, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8853553466796875, \"count\": 1, \"min\": 0.8853553466796875, \"max\": 0.8853553466796875}}}\n", "#metrics {\"StartTime\": 1687302801.237232, \"EndTime\": 1687302801.2372372, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7173060302734375, \"count\": 1, \"min\": 0.7173060302734375, \"max\": 0.7173060302734375}}}\n", "#metrics {\"StartTime\": 1687302801.2372606, \"EndTime\": 1687302801.2372656, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7613701782226563, \"count\": 1, \"min\": 0.7613701782226563, \"max\": 0.7613701782226563}}}\n", "#metrics {\"StartTime\": 1687302801.2372894, \"EndTime\": 1687302801.2372947, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.799380126953125, \"count\": 1, \"min\": 0.799380126953125, \"max\": 0.799380126953125}}}\n", "#metrics {\"StartTime\": 1687302801.2373145, \"EndTime\": 1687302801.2373197, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7109956665039062, \"count\": 1, \"min\": 0.7109956665039062, \"max\": 0.7109956665039062}}}\n", "#metrics {\"StartTime\": 1687302801.2373397, \"EndTime\": 1687302801.2373447, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7968128662109375, \"count\": 1, \"min\": 0.7968128662109375, \"max\": 0.7968128662109375}}}\n", "#metrics {\"StartTime\": 1687302801.237365, \"EndTime\": 1687302801.2373698, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.87186474609375, \"count\": 1, \"min\": 0.87186474609375, \"max\": 0.87186474609375}}}\n", "#metrics {\"StartTime\": 1687302801.23739, \"EndTime\": 1687302801.237395, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.96947900390625, \"count\": 1, \"min\": 0.96947900390625, \"max\": 0.96947900390625}}}\n", "#metrics {\"StartTime\": 1687302801.237417, \"EndTime\": 1687302801.237422, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8979135131835938, \"count\": 1, \"min\": 0.8979135131835938, \"max\": 0.8979135131835938}}}\n", "#metrics {\"StartTime\": 1687302801.2374425, \"EndTime\": 1687302801.2374475, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7415409545898437, \"count\": 1, \"min\": 0.7415409545898437, \"max\": 0.7415409545898437}}}\n", "#metrics {\"StartTime\": 1687302801.2374697, \"EndTime\": 1687302801.2374744, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7811779174804687, \"count\": 1, \"min\": 0.7811779174804687, \"max\": 0.7811779174804687}}}\n", "#metrics {\"StartTime\": 1687302801.2374947, \"EndTime\": 1687302801.2374997, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7442326049804687, \"count\": 1, \"min\": 0.7442326049804687, \"max\": 0.7442326049804687}}}\n", "#metrics {\"StartTime\": 1687302801.237519, \"EndTime\": 1687302801.2375243, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7263248901367187, \"count\": 1, \"min\": 0.7263248901367187, \"max\": 0.7263248901367187}}}\n", "#metrics {\"StartTime\": 1687302801.2375438, \"EndTime\": 1687302801.2375488, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9253173828125, \"count\": 1, \"min\": 0.9253173828125, \"max\": 0.9253173828125}}}\n", "#metrics {\"StartTime\": 1687302801.2375681, \"EndTime\": 1687302801.2375731, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9488567504882812, \"count\": 1, \"min\": 0.9488567504882812, \"max\": 0.9488567504882812}}}\n", "#metrics {\"StartTime\": 1687302801.2375932, \"EndTime\": 1687302801.237598, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9623759155273437, \"count\": 1, \"min\": 0.9623759155273437, \"max\": 0.9623759155273437}}}\n", "#metrics {\"StartTime\": 1687302801.2376175, \"EndTime\": 1687302801.2376225, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8814644775390625, \"count\": 1, \"min\": 0.8814644775390625, \"max\": 0.8814644775390625}}}\n", "#metrics {\"StartTime\": 1687302801.2376425, \"EndTime\": 1687302801.2376473, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7748320922851563, \"count\": 1, \"min\": 0.7748320922851563, \"max\": 0.7748320922851563}}}\n", "#metrics {\"StartTime\": 1687302801.2376692, \"EndTime\": 1687302801.2376742, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7945308837890624, \"count\": 1, \"min\": 0.7945308837890624, \"max\": 0.7945308837890624}}}\n", "#metrics {\"StartTime\": 1687302801.237694, \"EndTime\": 1687302801.2376988, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8151787719726562, \"count\": 1, \"min\": 0.8151787719726562, \"max\": 0.8151787719726562}}}\n", "#metrics {\"StartTime\": 1687302801.2377183, \"EndTime\": 1687302801.237723, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.756558837890625, \"count\": 1, \"min\": 0.756558837890625, \"max\": 0.756558837890625}}}\n", "#metrics {\"StartTime\": 1687302801.237743, \"EndTime\": 1687302801.2377477, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9754213256835937, \"count\": 1, \"min\": 0.9754213256835937, \"max\": 0.9754213256835937}}}\n", "#metrics {\"StartTime\": 1687302801.2377675, \"EndTime\": 1687302801.2377722, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9881942138671875, \"count\": 1, \"min\": 0.9881942138671875, \"max\": 0.9881942138671875}}}\n", "#metrics {\"StartTime\": 1687302801.2377918, \"EndTime\": 1687302801.2377965, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9727488403320312, \"count\": 1, \"min\": 0.9727488403320312, \"max\": 0.9727488403320312}}}\n", "#metrics {\"StartTime\": 1687302801.2378163, \"EndTime\": 1687302801.237821, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9879215087890625, \"count\": 1, \"min\": 0.9879215087890625, \"max\": 0.9879215087890625}}}\n", "#metrics {\"StartTime\": 1687302801.2378407, \"EndTime\": 1687302801.2378454, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0943367919921876, \"count\": 1, \"min\": 1.0943367919921876, \"max\": 1.0943367919921876}}}\n", "#metrics {\"StartTime\": 1687302801.2378652, \"EndTime\": 1687302801.23787, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0675760498046876, \"count\": 1, \"min\": 1.0675760498046876, \"max\": 1.0675760498046876}}}\n", "#metrics {\"StartTime\": 1687302801.2378898, \"EndTime\": 1687302801.2378948, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1136256103515625, \"count\": 1, \"min\": 1.1136256103515625, \"max\": 1.1136256103515625}}}\n", "#metrics {\"StartTime\": 1687302801.237914, \"EndTime\": 1687302801.237919, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.159076171875, \"count\": 1, \"min\": 1.159076171875, \"max\": 1.159076171875}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=8, train mse_objective =0.9041302490234375\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=8, criteria=mse_objective, value=0.7109956665039062\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 8\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpxsxu8gxr/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 60.0 % of epochs\n", "#metrics {\"StartTime\": 1687302801.222916, \"EndTime\": 1687302801.2432523, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 8, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 11710.0, \"count\": 1, \"min\": 11710, \"max\": 11710}, \"Total Batches Seen\": {\"sum\": 21.0, \"count\": 1, \"min\": 21, \"max\": 21}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 11.0, \"count\": 1, \"min\": 11, \"max\": 11}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=52468.60461367751 records/second\n", "[2023-06-20 23:13:21.258] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 22, \"duration\": 15, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.2589958, \"EndTime\": 1687302801.259035, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8914915161132813, \"count\": 1, \"min\": 0.8914915161132813, \"max\": 0.8914915161132813}}}\n", "#metrics {\"StartTime\": 1687302801.2590773, \"EndTime\": 1687302801.2590845, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8687643432617187, \"count\": 1, \"min\": 0.8687643432617187, \"max\": 0.8687643432617187}}}\n", "#metrics {\"StartTime\": 1687302801.259134, \"EndTime\": 1687302801.2591405, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8988980102539063, \"count\": 1, \"min\": 0.8988980102539063, \"max\": 0.8988980102539063}}}\n", "#metrics {\"StartTime\": 1687302801.2591653, \"EndTime\": 1687302801.2591708, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8725224609375, \"count\": 1, \"min\": 0.8725224609375, \"max\": 0.8725224609375}}}\n", "#metrics {\"StartTime\": 1687302801.2591915, \"EndTime\": 1687302801.2591968, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7096409301757812, \"count\": 1, \"min\": 0.7096409301757812, \"max\": 0.7096409301757812}}}\n", "#metrics {\"StartTime\": 1687302801.2592297, \"EndTime\": 1687302801.2592354, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7339646606445313, \"count\": 1, \"min\": 0.7339646606445313, \"max\": 0.7339646606445313}}}\n", "#metrics {\"StartTime\": 1687302801.259256, \"EndTime\": 1687302801.2592719, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8223572998046875, \"count\": 1, \"min\": 0.8223572998046875, \"max\": 0.8223572998046875}}}\n", "#metrics {\"StartTime\": 1687302801.259293, \"EndTime\": 1687302801.2592983, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7090670166015625, \"count\": 1, \"min\": 0.7090670166015625, \"max\": 0.7090670166015625}}}\n", "#metrics {\"StartTime\": 1687302801.2593205, \"EndTime\": 1687302801.259326, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7876348876953125, \"count\": 1, \"min\": 0.7876348876953125, \"max\": 0.7876348876953125}}}\n", "#metrics {\"StartTime\": 1687302801.259346, \"EndTime\": 1687302801.259351, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8591118774414063, \"count\": 1, \"min\": 0.8591118774414063, \"max\": 0.8591118774414063}}}\n", "#metrics {\"StartTime\": 1687302801.2593827, \"EndTime\": 1687302801.2593884, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9549996337890625, \"count\": 1, \"min\": 0.9549996337890625, \"max\": 0.9549996337890625}}}\n", "#metrics {\"StartTime\": 1687302801.259409, \"EndTime\": 1687302801.259414, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8843358154296875, \"count\": 1, \"min\": 0.8843358154296875, \"max\": 0.8843358154296875}}}\n", "#metrics {\"StartTime\": 1687302801.2594528, \"EndTime\": 1687302801.259473, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8310491943359375, \"count\": 1, \"min\": 0.8310491943359375, \"max\": 0.8310491943359375}}}\n", "#metrics {\"StartTime\": 1687302801.2594943, \"EndTime\": 1687302801.2595093, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8300687866210937, \"count\": 1, \"min\": 0.8300687866210937, \"max\": 0.8300687866210937}}}\n", "#metrics {\"StartTime\": 1687302801.2595303, \"EndTime\": 1687302801.2595456, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7173473510742188, \"count\": 1, \"min\": 0.7173473510742188, \"max\": 0.7173473510742188}}}\n", "#metrics {\"StartTime\": 1687302801.2595663, \"EndTime\": 1687302801.2595716, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.836541015625, \"count\": 1, \"min\": 0.836541015625, \"max\": 0.836541015625}}}\n", "#metrics {\"StartTime\": 1687302801.2595913, \"EndTime\": 1687302801.2595966, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9116759643554687, \"count\": 1, \"min\": 0.9116759643554687, \"max\": 0.9116759643554687}}}\n", "#metrics {\"StartTime\": 1687302801.2596183, \"EndTime\": 1687302801.2596238, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9354309692382813, \"count\": 1, \"min\": 0.9354309692382813, \"max\": 0.9354309692382813}}}\n", "#metrics {\"StartTime\": 1687302801.2596433, \"EndTime\": 1687302801.2596486, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9474083862304687, \"count\": 1, \"min\": 0.9474083862304687, \"max\": 0.9474083862304687}}}\n", "#metrics {\"StartTime\": 1687302801.259679, \"EndTime\": 1687302801.2596843, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.86839794921875, \"count\": 1, \"min\": 0.86839794921875, \"max\": 0.86839794921875}}}\n", "#metrics {\"StartTime\": 1687302801.2597144, \"EndTime\": 1687302801.2597196, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7741822509765625, \"count\": 1, \"min\": 0.7741822509765625, \"max\": 0.7741822509765625}}}\n", "#metrics {\"StartTime\": 1687302801.2597501, \"EndTime\": 1687302801.2597556, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8315996704101563, \"count\": 1, \"min\": 0.8315996704101563, \"max\": 0.8315996704101563}}}\n", "#metrics {\"StartTime\": 1687302801.2597857, \"EndTime\": 1687302801.2597997, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8564396362304687, \"count\": 1, \"min\": 0.8564396362304687, \"max\": 0.8564396362304687}}}\n", "#metrics {\"StartTime\": 1687302801.2598329, \"EndTime\": 1687302801.2598388, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7822568969726562, \"count\": 1, \"min\": 0.7822568969726562, \"max\": 0.7822568969726562}}}\n", "#metrics {\"StartTime\": 1687302801.2598689, \"EndTime\": 1687302801.2598746, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9763377075195312, \"count\": 1, \"min\": 0.9763377075195312, \"max\": 0.9763377075195312}}}\n", "#metrics {\"StartTime\": 1687302801.2599056, \"EndTime\": 1687302801.259911, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9916995239257812, \"count\": 1, \"min\": 0.9916995239257812, \"max\": 0.9916995239257812}}}\n", "#metrics {\"StartTime\": 1687302801.2599413, \"EndTime\": 1687302801.2599468, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9771697998046875, \"count\": 1, \"min\": 0.9771697998046875, \"max\": 0.9771697998046875}}}\n", "#metrics {\"StartTime\": 1687302801.2599766, \"EndTime\": 1687302801.2599823, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9853470458984375, \"count\": 1, \"min\": 0.9853470458984375, \"max\": 0.9853470458984375}}}\n", "#metrics {\"StartTime\": 1687302801.2600024, \"EndTime\": 1687302801.2600074, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1163533935546874, \"count\": 1, \"min\": 1.1163533935546874, \"max\": 1.1163533935546874}}}\n", "#metrics {\"StartTime\": 1687302801.2600305, \"EndTime\": 1687302801.2600355, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.059918212890625, \"count\": 1, \"min\": 1.059918212890625, \"max\": 1.059918212890625}}}\n", "#metrics {\"StartTime\": 1687302801.2600555, \"EndTime\": 1687302801.2600608, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.151463134765625, \"count\": 1, \"min\": 1.151463134765625, \"max\": 1.151463134765625}}}\n", "#metrics {\"StartTime\": 1687302801.2600803, \"EndTime\": 1687302801.2600853, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.1741480712890624, \"count\": 1, \"min\": 1.1741480712890624, \"max\": 1.1741480712890624}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=9, train mse_objective =0.8914915161132813\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=9, criteria=mse_objective, value=0.7090670166015625\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 9\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpk4emfl2y/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 66.66666666666667 % of epochs\n", "#metrics {\"StartTime\": 1687302801.2433968, \"EndTime\": 1687302801.2659898, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 9, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 12781.0, \"count\": 1, \"min\": 12781, \"max\": 12781}, \"Total Batches Seen\": {\"sum\": 23.0, \"count\": 1, \"min\": 23, \"max\": 23}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 12.0, \"count\": 1, \"min\": 12, \"max\": 12}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=47250.442663300724 records/second\n", "[2023-06-20 23:13:21.281] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 24, \"duration\": 15, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.2819588, \"EndTime\": 1687302801.2819989, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8791598510742188, \"count\": 1, \"min\": 0.8791598510742188, \"max\": 0.8791598510742188}}}\n", "#metrics {\"StartTime\": 1687302801.2820463, \"EndTime\": 1687302801.2820542, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8561797485351562, \"count\": 1, \"min\": 0.8561797485351562, \"max\": 0.8561797485351562}}}\n", "#metrics {\"StartTime\": 1687302801.2821038, \"EndTime\": 1687302801.2821107, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.88634228515625, \"count\": 1, \"min\": 0.88634228515625, \"max\": 0.88634228515625}}}\n", "#metrics {\"StartTime\": 1687302801.282133, \"EndTime\": 1687302801.2821383, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8599588623046875, \"count\": 1, \"min\": 0.8599588623046875, \"max\": 0.8599588623046875}}}\n", "#metrics {\"StartTime\": 1687302801.282159, \"EndTime\": 1687302801.2821646, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6561416015625, \"count\": 1, \"min\": 0.6561416015625, \"max\": 0.6561416015625}}}\n", "#metrics {\"StartTime\": 1687302801.282187, \"EndTime\": 1687302801.2821925, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.66132080078125, \"count\": 1, \"min\": 0.66132080078125, \"max\": 0.66132080078125}}}\n", "#metrics {\"StartTime\": 1687302801.2822125, \"EndTime\": 1687302801.2822177, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7268419799804687, \"count\": 1, \"min\": 0.7268419799804687, \"max\": 0.7268419799804687}}}\n", "#metrics {\"StartTime\": 1687302801.2822497, \"EndTime\": 1687302801.2822554, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.664253662109375, \"count\": 1, \"min\": 0.664253662109375, \"max\": 0.664253662109375}}}\n", "#metrics {\"StartTime\": 1687302801.282276, \"EndTime\": 1687302801.282281, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.77868798828125, \"count\": 1, \"min\": 0.77868798828125, \"max\": 0.77868798828125}}}\n", "#metrics {\"StartTime\": 1687302801.2823117, \"EndTime\": 1687302801.2823172, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8466903686523437, \"count\": 1, \"min\": 0.8466903686523437, \"max\": 0.8466903686523437}}}\n", "#metrics {\"StartTime\": 1687302801.282348, \"EndTime\": 1687302801.2823536, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9406214599609375, \"count\": 1, \"min\": 0.9406214599609375, \"max\": 0.9406214599609375}}}\n", "#metrics {\"StartTime\": 1687302801.282384, \"EndTime\": 1687302801.2823892, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8709078369140625, \"count\": 1, \"min\": 0.8709078369140625, \"max\": 0.8709078369140625}}}\n", "#metrics {\"StartTime\": 1687302801.2824101, \"EndTime\": 1687302801.2824152, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7868310546875, \"count\": 1, \"min\": 0.7868310546875, \"max\": 0.7868310546875}}}\n", "#metrics {\"StartTime\": 1687302801.2824354, \"EndTime\": 1687302801.2824404, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7488751220703125, \"count\": 1, \"min\": 0.7488751220703125, \"max\": 0.7488751220703125}}}\n", "#metrics {\"StartTime\": 1687302801.2824628, \"EndTime\": 1687302801.2824678, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6501358032226563, \"count\": 1, \"min\": 0.6501358032226563, \"max\": 0.6501358032226563}}}\n", "#metrics {\"StartTime\": 1687302801.2824879, \"EndTime\": 1687302801.2824926, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8065079345703124, \"count\": 1, \"min\": 0.8065079345703124, \"max\": 0.8065079345703124}}}\n", "#metrics {\"StartTime\": 1687302801.2825236, \"EndTime\": 1687302801.282529, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8983902587890625, \"count\": 1, \"min\": 0.8983902587890625, \"max\": 0.8983902587890625}}}\n", "#metrics {\"StartTime\": 1687302801.2825673, \"EndTime\": 1687302801.282573, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.92234423828125, \"count\": 1, \"min\": 0.92234423828125, \"max\": 0.92234423828125}}}\n", "#metrics {\"StartTime\": 1687302801.2826092, \"EndTime\": 1687302801.2826145, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9328131103515624, \"count\": 1, \"min\": 0.9328131103515624, \"max\": 0.9328131103515624}}}\n", "#metrics {\"StartTime\": 1687302801.282645, \"EndTime\": 1687302801.2826505, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8555933837890625, \"count\": 1, \"min\": 0.8555933837890625, \"max\": 0.8555933837890625}}}\n", "#metrics {\"StartTime\": 1687302801.2826703, \"EndTime\": 1687302801.2826755, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7120524291992187, \"count\": 1, \"min\": 0.7120524291992187, \"max\": 0.7120524291992187}}}\n", "#metrics {\"StartTime\": 1687302801.2827053, \"EndTime\": 1687302801.2827108, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7839982299804688, \"count\": 1, \"min\": 0.7839982299804688, \"max\": 0.7839982299804688}}}\n", "#metrics {\"StartTime\": 1687302801.2827313, \"EndTime\": 1687302801.2827363, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.789564453125, \"count\": 1, \"min\": 0.789564453125, \"max\": 0.789564453125}}}\n", "#metrics {\"StartTime\": 1687302801.2827663, \"EndTime\": 1687302801.2827718, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.741755615234375, \"count\": 1, \"min\": 0.741755615234375, \"max\": 0.741755615234375}}}\n", "#metrics {\"StartTime\": 1687302801.2828014, \"EndTime\": 1687302801.282807, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9777828979492188, \"count\": 1, \"min\": 0.9777828979492188, \"max\": 0.9777828979492188}}}\n", "#metrics {\"StartTime\": 1687302801.2828274, \"EndTime\": 1687302801.2828321, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9946696166992187, \"count\": 1, \"min\": 0.9946696166992187, \"max\": 0.9946696166992187}}}\n", "#metrics {\"StartTime\": 1687302801.2828636, \"EndTime\": 1687302801.282869, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9814978637695313, \"count\": 1, \"min\": 0.9814978637695313, \"max\": 0.9814978637695313}}}\n", "#metrics {\"StartTime\": 1687302801.2828891, \"EndTime\": 1687302801.2829044, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9828726806640625, \"count\": 1, \"min\": 0.9828726806640625, \"max\": 0.9828726806640625}}}\n", "#metrics {\"StartTime\": 1687302801.282925, \"EndTime\": 1687302801.28293, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0558179931640626, \"count\": 1, \"min\": 1.0558179931640626, \"max\": 1.0558179931640626}}}\n", "#metrics {\"StartTime\": 1687302801.2829504, \"EndTime\": 1687302801.2829554, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9818250122070312, \"count\": 1, \"min\": 0.9818250122070312, \"max\": 0.9818250122070312}}}\n", "#metrics {\"StartTime\": 1687302801.2829776, \"EndTime\": 1687302801.2829828, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.07366552734375, \"count\": 1, \"min\": 1.07366552734375, \"max\": 1.07366552734375}}}\n", "#metrics {\"StartTime\": 1687302801.2830026, \"EndTime\": 1687302801.2830079, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0806964111328126, \"count\": 1, \"min\": 1.0806964111328126, \"max\": 1.0806964111328126}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=10, train mse_objective =0.8791598510742188\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=10, criteria=mse_objective, value=0.6501358032226563\n", "[06/20/2023 23:13:21 INFO 140324763719488] Epoch 10: Loss improved. Updating best model\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 10\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmp8oc222qw/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 73.33333333333333 % of epochs\n", "#metrics {\"StartTime\": 1687302801.266157, \"EndTime\": 1687302801.2895145, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 10, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 13852.0, \"count\": 1, \"min\": 13852, \"max\": 13852}, \"Total Batches Seen\": {\"sum\": 25.0, \"count\": 1, \"min\": 25, \"max\": 25}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 13.0, \"count\": 1, \"min\": 13, \"max\": 13}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=45706.69390828339 records/second\n", "[2023-06-20 23:13:21.305] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 26, \"duration\": 15, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.3050885, \"EndTime\": 1687302801.305128, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8674849243164062, \"count\": 1, \"min\": 0.8674849243164062, \"max\": 0.8674849243164062}}}\n", "#metrics {\"StartTime\": 1687302801.305173, \"EndTime\": 1687302801.305183, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8438611450195312, \"count\": 1, \"min\": 0.8438611450195312, \"max\": 0.8438611450195312}}}\n", "#metrics {\"StartTime\": 1687302801.3052475, \"EndTime\": 1687302801.305258, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8744031372070312, \"count\": 1, \"min\": 0.8744031372070312, \"max\": 0.8744031372070312}}}\n", "#metrics {\"StartTime\": 1687302801.3052924, \"EndTime\": 1687302801.3053005, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8479910888671875, \"count\": 1, \"min\": 0.8479910888671875, \"max\": 0.8479910888671875}}}\n", "#metrics {\"StartTime\": 1687302801.3053317, \"EndTime\": 1687302801.30534, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6508237915039062, \"count\": 1, \"min\": 0.6508237915039062, \"max\": 0.6508237915039062}}}\n", "#metrics {\"StartTime\": 1687302801.3053672, \"EndTime\": 1687302801.3053746, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6531045532226563, \"count\": 1, \"min\": 0.6531045532226563, \"max\": 0.6531045532226563}}}\n", "#metrics {\"StartTime\": 1687302801.3054044, \"EndTime\": 1687302801.305412, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6477562866210937, \"count\": 1, \"min\": 0.6477562866210937, \"max\": 0.6477562866210937}}}\n", "#metrics {\"StartTime\": 1687302801.305443, \"EndTime\": 1687302801.3054516, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6726028442382812, \"count\": 1, \"min\": 0.6726028442382812, \"max\": 0.6726028442382812}}}\n", "#metrics {\"StartTime\": 1687302801.3054821, \"EndTime\": 1687302801.3054905, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7701724853515625, \"count\": 1, \"min\": 0.7701724853515625, \"max\": 0.7701724853515625}}}\n", "#metrics {\"StartTime\": 1687302801.3055196, \"EndTime\": 1687302801.3055274, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8348819580078125, \"count\": 1, \"min\": 0.8348819580078125, \"max\": 0.8348819580078125}}}\n", "#metrics {\"StartTime\": 1687302801.3055558, \"EndTime\": 1687302801.3055632, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.926640625, \"count\": 1, \"min\": 0.926640625, \"max\": 0.926640625}}}\n", "#metrics {\"StartTime\": 1687302801.3055944, \"EndTime\": 1687302801.3056028, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8579482421875, \"count\": 1, \"min\": 0.8579482421875, \"max\": 0.8579482421875}}}\n", "#metrics {\"StartTime\": 1687302801.305633, \"EndTime\": 1687302801.305641, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6874268188476562, \"count\": 1, \"min\": 0.6874268188476562, \"max\": 0.6874268188476562}}}\n", "#metrics {\"StartTime\": 1687302801.3056703, \"EndTime\": 1687302801.3056777, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6541807861328125, \"count\": 1, \"min\": 0.6541807861328125, \"max\": 0.6541807861328125}}}\n", "#metrics {\"StartTime\": 1687302801.3057055, \"EndTime\": 1687302801.3057127, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6472337036132813, \"count\": 1, \"min\": 0.6472337036132813, \"max\": 0.6472337036132813}}}\n", "#metrics {\"StartTime\": 1687302801.305741, \"EndTime\": 1687302801.3057485, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7001089477539062, \"count\": 1, \"min\": 0.7001089477539062, \"max\": 0.7001089477539062}}}\n", "#metrics {\"StartTime\": 1687302801.305781, \"EndTime\": 1687302801.305789, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.885803955078125, \"count\": 1, \"min\": 0.885803955078125, \"max\": 0.885803955078125}}}\n", "#metrics {\"StartTime\": 1687302801.3058171, \"EndTime\": 1687302801.305825, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9098401489257812, \"count\": 1, \"min\": 0.9098401489257812, \"max\": 0.9098401489257812}}}\n", "#metrics {\"StartTime\": 1687302801.3058543, \"EndTime\": 1687302801.305862, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9190045166015625, \"count\": 1, \"min\": 0.9190045166015625, \"max\": 0.9190045166015625}}}\n", "#metrics {\"StartTime\": 1687302801.30589, \"EndTime\": 1687302801.3058972, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8433924560546875, \"count\": 1, \"min\": 0.8433924560546875, \"max\": 0.8433924560546875}}}\n", "#metrics {\"StartTime\": 1687302801.3059251, \"EndTime\": 1687302801.3059332, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7006604614257812, \"count\": 1, \"min\": 0.7006604614257812, \"max\": 0.7006604614257812}}}\n", "#metrics {\"StartTime\": 1687302801.3059626, \"EndTime\": 1687302801.3059707, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7552601928710938, \"count\": 1, \"min\": 0.7552601928710938, \"max\": 0.7552601928710938}}}\n", "#metrics {\"StartTime\": 1687302801.3060007, \"EndTime\": 1687302801.3060086, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.74734716796875, \"count\": 1, \"min\": 0.74734716796875, \"max\": 0.74734716796875}}}\n", "#metrics {\"StartTime\": 1687302801.30604, \"EndTime\": 1687302801.306048, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7451403198242188, \"count\": 1, \"min\": 0.7451403198242188, \"max\": 0.7451403198242188}}}\n", "#metrics {\"StartTime\": 1687302801.306077, \"EndTime\": 1687302801.306085, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9796765747070313, \"count\": 1, \"min\": 0.9796765747070313, \"max\": 0.9796765747070313}}}\n", "#metrics {\"StartTime\": 1687302801.3061168, \"EndTime\": 1687302801.3061254, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.99738232421875, \"count\": 1, \"min\": 0.99738232421875, \"max\": 0.99738232421875}}}\n", "#metrics {\"StartTime\": 1687302801.3061533, \"EndTime\": 1687302801.3061604, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.985761474609375, \"count\": 1, \"min\": 0.985761474609375, \"max\": 0.985761474609375}}}\n", "#metrics {\"StartTime\": 1687302801.3061883, \"EndTime\": 1687302801.306196, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9807617797851562, \"count\": 1, \"min\": 0.9807617797851562, \"max\": 0.9807617797851562}}}\n", "#metrics {\"StartTime\": 1687302801.3062232, \"EndTime\": 1687302801.306231, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0324927978515626, \"count\": 1, \"min\": 1.0324927978515626, \"max\": 1.0324927978515626}}}\n", "#metrics {\"StartTime\": 1687302801.306261, \"EndTime\": 1687302801.3062687, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9663002319335937, \"count\": 1, \"min\": 0.9663002319335937, \"max\": 0.9663002319335937}}}\n", "#metrics {\"StartTime\": 1687302801.3062968, \"EndTime\": 1687302801.3063047, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0195751953125, \"count\": 1, \"min\": 1.0195751953125, \"max\": 1.0195751953125}}}\n", "#metrics {\"StartTime\": 1687302801.306333, \"EndTime\": 1687302801.3063412, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.01674462890625, \"count\": 1, \"min\": 1.01674462890625, \"max\": 1.01674462890625}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=11, train mse_objective =0.8674849243164062\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=11, criteria=mse_objective, value=0.6472337036132813\n", "[06/20/2023 23:13:21 INFO 140324763719488] Epoch 11: Loss improved. Updating best model\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 11\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmp0c9jgnh3/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 80.0 % of epochs\n", "#metrics {\"StartTime\": 1687302801.2896862, \"EndTime\": 1687302801.312142, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 11, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 14923.0, \"count\": 1, \"min\": 14923, \"max\": 14923}, \"Total Batches Seen\": {\"sum\": 27.0, \"count\": 1, \"min\": 27, \"max\": 27}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 14.0, \"count\": 1, \"min\": 14, \"max\": 14}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=47503.77614923385 records/second\n", "[2023-06-20 23:13:21.326] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 28, \"duration\": 13, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.3264565, \"EndTime\": 1687302801.3265002, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.85637353515625, \"count\": 1, \"min\": 0.85637353515625, \"max\": 0.85637353515625}}}\n", "#metrics {\"StartTime\": 1687302801.3265476, \"EndTime\": 1687302801.326558, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8321697387695313, \"count\": 1, \"min\": 0.8321697387695313, \"max\": 0.8321697387695313}}}\n", "#metrics {\"StartTime\": 1687302801.3265977, \"EndTime\": 1687302801.326607, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8630693969726563, \"count\": 1, \"min\": 0.8630693969726563, \"max\": 0.8630693969726563}}}\n", "#metrics {\"StartTime\": 1687302801.3266401, \"EndTime\": 1687302801.3266485, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8366801147460937, \"count\": 1, \"min\": 0.8366801147460937, \"max\": 0.8366801147460937}}}\n", "#metrics {\"StartTime\": 1687302801.326684, \"EndTime\": 1687302801.3266923, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6955277709960938, \"count\": 1, \"min\": 0.6955277709960938, \"max\": 0.6955277709960938}}}\n", "#metrics {\"StartTime\": 1687302801.3267233, \"EndTime\": 1687302801.326731, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6950498657226563, \"count\": 1, \"min\": 0.6950498657226563, \"max\": 0.6950498657226563}}}\n", "#metrics {\"StartTime\": 1687302801.3267543, \"EndTime\": 1687302801.326761, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.672427001953125, \"count\": 1, \"min\": 0.672427001953125, \"max\": 0.672427001953125}}}\n", "#metrics {\"StartTime\": 1687302801.3267875, \"EndTime\": 1687302801.3267953, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7131567993164063, \"count\": 1, \"min\": 0.7131567993164063, \"max\": 0.7131567993164063}}}\n", "#metrics {\"StartTime\": 1687302801.3268247, \"EndTime\": 1687302801.326833, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7621722412109375, \"count\": 1, \"min\": 0.7621722412109375, \"max\": 0.7621722412109375}}}\n", "#metrics {\"StartTime\": 1687302801.3268578, \"EndTime\": 1687302801.3268652, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8236617431640625, \"count\": 1, \"min\": 0.8236617431640625, \"max\": 0.8236617431640625}}}\n", "#metrics {\"StartTime\": 1687302801.3268955, \"EndTime\": 1687302801.326903, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9132716674804687, \"count\": 1, \"min\": 0.9132716674804687, \"max\": 0.9132716674804687}}}\n", "#metrics {\"StartTime\": 1687302801.3269377, \"EndTime\": 1687302801.326946, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8457061157226563, \"count\": 1, \"min\": 0.8457061157226563, \"max\": 0.8457061157226563}}}\n", "#metrics {\"StartTime\": 1687302801.3269684, \"EndTime\": 1687302801.3269749, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6622344360351563, \"count\": 1, \"min\": 0.6622344360351563, \"max\": 0.6622344360351563}}}\n", "#metrics {\"StartTime\": 1687302801.3270032, \"EndTime\": 1687302801.3270109, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6622960205078126, \"count\": 1, \"min\": 0.6622960205078126, \"max\": 0.6622960205078126}}}\n", "#metrics {\"StartTime\": 1687302801.3270392, \"EndTime\": 1687302801.3270469, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6948753051757812, \"count\": 1, \"min\": 0.6948753051757812, \"max\": 0.6948753051757812}}}\n", "#metrics {\"StartTime\": 1687302801.3270762, \"EndTime\": 1687302801.327084, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6568095092773437, \"count\": 1, \"min\": 0.6568095092773437, \"max\": 0.6568095092773437}}}\n", "#metrics {\"StartTime\": 1687302801.327112, \"EndTime\": 1687302801.3271194, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8738639526367188, \"count\": 1, \"min\": 0.8738639526367188, \"max\": 0.8738639526367188}}}\n", "#metrics {\"StartTime\": 1687302801.327148, \"EndTime\": 1687302801.327156, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8979080810546874, \"count\": 1, \"min\": 0.8979080810546874, \"max\": 0.8979080810546874}}}\n", "#metrics {\"StartTime\": 1687302801.327184, \"EndTime\": 1687302801.3271923, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9058086547851563, \"count\": 1, \"min\": 0.9058086547851563, \"max\": 0.9058086547851563}}}\n", "#metrics {\"StartTime\": 1687302801.32722, \"EndTime\": 1687302801.3272274, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8319354248046875, \"count\": 1, \"min\": 0.8319354248046875, \"max\": 0.8319354248046875}}}\n", "#metrics {\"StartTime\": 1687302801.3272576, \"EndTime\": 1687302801.3272665, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7334386596679687, \"count\": 1, \"min\": 0.7334386596679687, \"max\": 0.7334386596679687}}}\n", "#metrics {\"StartTime\": 1687302801.327297, \"EndTime\": 1687302801.327305, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7564253540039062, \"count\": 1, \"min\": 0.7564253540039062, \"max\": 0.7564253540039062}}}\n", "#metrics {\"StartTime\": 1687302801.3273313, \"EndTime\": 1687302801.32734, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.771276611328125, \"count\": 1, \"min\": 0.771276611328125, \"max\": 0.771276611328125}}}\n", "#metrics {\"StartTime\": 1687302801.3273687, \"EndTime\": 1687302801.3273761, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7873569946289063, \"count\": 1, \"min\": 0.7873569946289063, \"max\": 0.7873569946289063}}}\n", "#metrics {\"StartTime\": 1687302801.3274033, \"EndTime\": 1687302801.3274117, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.981862548828125, \"count\": 1, \"min\": 0.981862548828125, \"max\": 0.981862548828125}}}\n", "#metrics {\"StartTime\": 1687302801.3274372, \"EndTime\": 1687302801.3274443, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9998084106445313, \"count\": 1, \"min\": 0.9998084106445313, \"max\": 0.9998084106445313}}}\n", "#metrics {\"StartTime\": 1687302801.3274739, \"EndTime\": 1687302801.3274815, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9901381225585938, \"count\": 1, \"min\": 0.9901381225585938, \"max\": 0.9901381225585938}}}\n", "#metrics {\"StartTime\": 1687302801.3275094, \"EndTime\": 1687302801.3275175, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9793463745117188, \"count\": 1, \"min\": 0.9793463745117188, \"max\": 0.9793463745117188}}}\n", "#metrics {\"StartTime\": 1687302801.3275478, \"EndTime\": 1687302801.3275561, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0514930419921875, \"count\": 1, \"min\": 1.0514930419921875, \"max\": 1.0514930419921875}}}\n", "#metrics {\"StartTime\": 1687302801.3275867, \"EndTime\": 1687302801.3275945, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.032868896484375, \"count\": 1, \"min\": 1.032868896484375, \"max\": 1.032868896484375}}}\n", "#metrics {\"StartTime\": 1687302801.327624, \"EndTime\": 1687302801.327632, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0449483642578126, \"count\": 1, \"min\": 1.0449483642578126, \"max\": 1.0449483642578126}}}\n", "#metrics {\"StartTime\": 1687302801.3276632, \"EndTime\": 1687302801.3276708, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0017442016601563, \"count\": 1, \"min\": 1.0017442016601563, \"max\": 1.0017442016601563}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=12, train mse_objective =0.85637353515625\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=12, criteria=mse_objective, value=0.6568095092773437\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 12\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmp15e3n2yu/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 86.66666666666667 % of epochs\n", "#metrics {\"StartTime\": 1687302801.3123474, \"EndTime\": 1687302801.3323867, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 12, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 15994.0, \"count\": 1, \"min\": 15994, \"max\": 15994}, \"Total Batches Seen\": {\"sum\": 29.0, \"count\": 1, \"min\": 29, \"max\": 29}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 15.0, \"count\": 1, \"min\": 15, \"max\": 15}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=53153.39341158652 records/second\n", "[2023-06-20 23:13:21.346] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 30, \"duration\": 13, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.3464518, \"EndTime\": 1687302801.346503, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8456355590820313, \"count\": 1, \"min\": 0.8456355590820313, \"max\": 0.8456355590820313}}}\n", "#metrics {\"StartTime\": 1687302801.3465524, \"EndTime\": 1687302801.3465633, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8211912841796875, \"count\": 1, \"min\": 0.8211912841796875, \"max\": 0.8211912841796875}}}\n", "#metrics {\"StartTime\": 1687302801.3465996, \"EndTime\": 1687302801.3466086, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.852114501953125, \"count\": 1, \"min\": 0.852114501953125, \"max\": 0.852114501953125}}}\n", "#metrics {\"StartTime\": 1687302801.346639, \"EndTime\": 1687302801.3466475, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8257815551757812, \"count\": 1, \"min\": 0.8257815551757812, \"max\": 0.8257815551757812}}}\n", "#metrics {\"StartTime\": 1687302801.3466823, \"EndTime\": 1687302801.3466916, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7067615966796875, \"count\": 1, \"min\": 0.7067615966796875, \"max\": 0.7067615966796875}}}\n", "#metrics {\"StartTime\": 1687302801.3467212, \"EndTime\": 1687302801.3467295, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6984523315429687, \"count\": 1, \"min\": 0.6984523315429687, \"max\": 0.6984523315429687}}}\n", "#metrics {\"StartTime\": 1687302801.3467543, \"EndTime\": 1687302801.346762, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7371155395507812, \"count\": 1, \"min\": 0.7371155395507812, \"max\": 0.7371155395507812}}}\n", "#metrics {\"StartTime\": 1687302801.346796, \"EndTime\": 1687302801.3468046, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7008563232421875, \"count\": 1, \"min\": 0.7008563232421875, \"max\": 0.7008563232421875}}}\n", "#metrics {\"StartTime\": 1687302801.3468344, \"EndTime\": 1687302801.3468418, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7545886840820313, \"count\": 1, \"min\": 0.7545886840820313, \"max\": 0.7545886840820313}}}\n", "#metrics {\"StartTime\": 1687302801.3468702, \"EndTime\": 1687302801.3468778, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8128051147460937, \"count\": 1, \"min\": 0.8128051147460937, \"max\": 0.8128051147460937}}}\n", "#metrics {\"StartTime\": 1687302801.3469064, \"EndTime\": 1687302801.346914, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.900618896484375, \"count\": 1, \"min\": 0.900618896484375, \"max\": 0.900618896484375}}}\n", "#metrics {\"StartTime\": 1687302801.3469443, \"EndTime\": 1687302801.3469522, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8340492553710938, \"count\": 1, \"min\": 0.8340492553710938, \"max\": 0.8340492553710938}}}\n", "#metrics {\"StartTime\": 1687302801.3469858, \"EndTime\": 1687302801.3469937, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7248660278320312, \"count\": 1, \"min\": 0.7248660278320312, \"max\": 0.7248660278320312}}}\n", "#metrics {\"StartTime\": 1687302801.3470218, \"EndTime\": 1687302801.347029, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7405442504882812, \"count\": 1, \"min\": 0.7405442504882812, \"max\": 0.7405442504882812}}}\n", "#metrics {\"StartTime\": 1687302801.3470564, \"EndTime\": 1687302801.347064, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7050415649414062, \"count\": 1, \"min\": 0.7050415649414062, \"max\": 0.7050415649414062}}}\n", "#metrics {\"StartTime\": 1687302801.3470922, \"EndTime\": 1687302801.3471003, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7087579345703126, \"count\": 1, \"min\": 0.7087579345703126, \"max\": 0.7087579345703126}}}\n", "#metrics {\"StartTime\": 1687302801.3471305, \"EndTime\": 1687302801.3471386, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8623970947265625, \"count\": 1, \"min\": 0.8623970947265625, \"max\": 0.8623970947265625}}}\n", "#metrics {\"StartTime\": 1687302801.347167, \"EndTime\": 1687302801.3471746, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.886348876953125, \"count\": 1, \"min\": 0.886348876953125, \"max\": 0.886348876953125}}}\n", "#metrics {\"StartTime\": 1687302801.3472056, \"EndTime\": 1687302801.3472137, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8930130615234375, \"count\": 1, \"min\": 0.8930130615234375, \"max\": 0.8930130615234375}}}\n", "#metrics {\"StartTime\": 1687302801.3472443, \"EndTime\": 1687302801.3472521, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8210028686523437, \"count\": 1, \"min\": 0.8210028686523437, \"max\": 0.8210028686523437}}}\n", "#metrics {\"StartTime\": 1687302801.3472817, \"EndTime\": 1687302801.3472898, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7306174926757812, \"count\": 1, \"min\": 0.7306174926757812, \"max\": 0.7306174926757812}}}\n", "#metrics {\"StartTime\": 1687302801.3473148, \"EndTime\": 1687302801.3473225, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7227771606445312, \"count\": 1, \"min\": 0.7227771606445312, \"max\": 0.7227771606445312}}}\n", "#metrics {\"StartTime\": 1687302801.3473513, \"EndTime\": 1687302801.3473592, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7784200439453125, \"count\": 1, \"min\": 0.7784200439453125, \"max\": 0.7784200439453125}}}\n", "#metrics {\"StartTime\": 1687302801.3473852, \"EndTime\": 1687302801.3473928, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7812161865234375, \"count\": 1, \"min\": 0.7812161865234375, \"max\": 0.7812161865234375}}}\n", "#metrics {\"StartTime\": 1687302801.3474202, \"EndTime\": 1687302801.347428, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9839522094726563, \"count\": 1, \"min\": 0.9839522094726563, \"max\": 0.9839522094726563}}}\n", "#metrics {\"StartTime\": 1687302801.347452, \"EndTime\": 1687302801.3474588, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0017119750976562, \"count\": 1, \"min\": 1.0017119750976562, \"max\": 1.0017119750976562}}}\n", "#metrics {\"StartTime\": 1687302801.3474867, \"EndTime\": 1687302801.3474946, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9944478149414062, \"count\": 1, \"min\": 0.9944478149414062, \"max\": 0.9944478149414062}}}\n", "#metrics {\"StartTime\": 1687302801.347524, \"EndTime\": 1687302801.347531, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9786015014648437, \"count\": 1, \"min\": 0.9786015014648437, \"max\": 0.9786015014648437}}}\n", "#metrics {\"StartTime\": 1687302801.3475587, \"EndTime\": 1687302801.3475673, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0315396728515625, \"count\": 1, \"min\": 1.0315396728515625, \"max\": 1.0315396728515625}}}\n", "#metrics {\"StartTime\": 1687302801.347595, \"EndTime\": 1687302801.3476024, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0745924072265625, \"count\": 1, \"min\": 1.0745924072265625, \"max\": 1.0745924072265625}}}\n", "#metrics {\"StartTime\": 1687302801.347632, \"EndTime\": 1687302801.3476398, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0667286376953125, \"count\": 1, \"min\": 1.0667286376953125, \"max\": 1.0667286376953125}}}\n", "#metrics {\"StartTime\": 1687302801.347669, \"EndTime\": 1687302801.347677, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9674108276367187, \"count\": 1, \"min\": 0.9674108276367187, \"max\": 0.9674108276367187}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=13, train mse_objective =0.8456355590820313\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=13, criteria=mse_objective, value=0.6984523315429687\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 13\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmpidxxwav4/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 93.33333333333333 % of epochs\n", "#metrics {\"StartTime\": 1687302801.3325934, \"EndTime\": 1687302801.3526533, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 13, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 17065.0, \"count\": 1, \"min\": 17065, \"max\": 17065}, \"Total Batches Seen\": {\"sum\": 31.0, \"count\": 1, \"min\": 31, \"max\": 31}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 16.0, \"count\": 1, \"min\": 16, \"max\": 16}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=53143.96091189797 records/second\n", "[2023-06-20 23:13:21.367] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 32, \"duration\": 14, \"num_examples\": 2, \"num_bytes\": 68544}\n", "#metrics {\"StartTime\": 1687302801.3674743, \"EndTime\": 1687302801.36752, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 0}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8352506713867187, \"count\": 1, \"min\": 0.8352506713867187, \"max\": 0.8352506713867187}}}\n", "#metrics {\"StartTime\": 1687302801.367566, \"EndTime\": 1687302801.3675764, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 1}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8106806640625, \"count\": 1, \"min\": 0.8106806640625, \"max\": 0.8106806640625}}}\n", "#metrics {\"StartTime\": 1687302801.3676102, \"EndTime\": 1687302801.3676193, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 2}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.84137158203125, \"count\": 1, \"min\": 0.84137158203125, \"max\": 0.84137158203125}}}\n", "#metrics {\"StartTime\": 1687302801.367651, \"EndTime\": 1687302801.367659, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 3}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8152022094726562, \"count\": 1, \"min\": 0.8152022094726562, \"max\": 0.8152022094726562}}}\n", "#metrics {\"StartTime\": 1687302801.367689, \"EndTime\": 1687302801.3676975, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 4}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6773630981445312, \"count\": 1, \"min\": 0.6773630981445312, \"max\": 0.6773630981445312}}}\n", "#metrics {\"StartTime\": 1687302801.367726, \"EndTime\": 1687302801.367734, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 5}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6698917236328125, \"count\": 1, \"min\": 0.6698917236328125, \"max\": 0.6698917236328125}}}\n", "#metrics {\"StartTime\": 1687302801.3677623, \"EndTime\": 1687302801.3677697, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 6}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7411163330078125, \"count\": 1, \"min\": 0.7411163330078125, \"max\": 0.7411163330078125}}}\n", "#metrics {\"StartTime\": 1687302801.3678083, \"EndTime\": 1687302801.3678164, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 7}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.652086181640625, \"count\": 1, \"min\": 0.652086181640625, \"max\": 0.652086181640625}}}\n", "#metrics {\"StartTime\": 1687302801.3678455, \"EndTime\": 1687302801.3678534, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 8}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7472750244140625, \"count\": 1, \"min\": 0.7472750244140625, \"max\": 0.7472750244140625}}}\n", "#metrics {\"StartTime\": 1687302801.3678818, \"EndTime\": 1687302801.3678896, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 9}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8023219604492188, \"count\": 1, \"min\": 0.8023219604492188, \"max\": 0.8023219604492188}}}\n", "#metrics {\"StartTime\": 1687302801.3679197, \"EndTime\": 1687302801.3679283, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 10}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8884547119140624, \"count\": 1, \"min\": 0.8884547119140624, \"max\": 0.8884547119140624}}}\n", "#metrics {\"StartTime\": 1687302801.3679574, \"EndTime\": 1687302801.367966, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 11}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8227200317382812, \"count\": 1, \"min\": 0.8227200317382812, \"max\": 0.8227200317382812}}}\n", "#metrics {\"StartTime\": 1687302801.367996, \"EndTime\": 1687302801.3680036, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 12}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7723599243164062, \"count\": 1, \"min\": 0.7723599243164062, \"max\": 0.7723599243164062}}}\n", "#metrics {\"StartTime\": 1687302801.368032, \"EndTime\": 1687302801.36804, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 13}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7706716918945312, \"count\": 1, \"min\": 0.7706716918945312, \"max\": 0.7706716918945312}}}\n", "#metrics {\"StartTime\": 1687302801.3680706, \"EndTime\": 1687302801.368079, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 14}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.679343505859375, \"count\": 1, \"min\": 0.679343505859375, \"max\": 0.679343505859375}}}\n", "#metrics {\"StartTime\": 1687302801.368109, \"EndTime\": 1687302801.368117, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 15}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7605070190429688, \"count\": 1, \"min\": 0.7605070190429688, \"max\": 0.7605070190429688}}}\n", "#metrics {\"StartTime\": 1687302801.3681479, \"EndTime\": 1687302801.368156, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 16}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8513489379882813, \"count\": 1, \"min\": 0.8513489379882813, \"max\": 0.8513489379882813}}}\n", "#metrics {\"StartTime\": 1687302801.3681848, \"EndTime\": 1687302801.3681924, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 17}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8751802368164062, \"count\": 1, \"min\": 0.8751802368164062, \"max\": 0.8751802368164062}}}\n", "#metrics {\"StartTime\": 1687302801.3682237, \"EndTime\": 1687302801.3682318, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 18}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8806326904296875, \"count\": 1, \"min\": 0.8806326904296875, \"max\": 0.8806326904296875}}}\n", "#metrics {\"StartTime\": 1687302801.3682606, \"EndTime\": 1687302801.3682683, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 19}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.8104238891601563, \"count\": 1, \"min\": 0.8104238891601563, \"max\": 0.8104238891601563}}}\n", "#metrics {\"StartTime\": 1687302801.3682964, \"EndTime\": 1687302801.3683045, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 20}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7004464721679687, \"count\": 1, \"min\": 0.7004464721679687, \"max\": 0.7004464721679687}}}\n", "#metrics {\"StartTime\": 1687302801.368336, \"EndTime\": 1687302801.368344, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 21}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.6703817749023437, \"count\": 1, \"min\": 0.6703817749023437, \"max\": 0.6703817749023437}}}\n", "#metrics {\"StartTime\": 1687302801.3683727, \"EndTime\": 1687302801.3683808, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 22}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.72801220703125, \"count\": 1, \"min\": 0.72801220703125, \"max\": 0.72801220703125}}}\n", "#metrics {\"StartTime\": 1687302801.3684297, \"EndTime\": 1687302801.3684382, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 23}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.7311100463867187, \"count\": 1, \"min\": 0.7311100463867187, \"max\": 0.7311100463867187}}}\n", "#metrics {\"StartTime\": 1687302801.3684683, \"EndTime\": 1687302801.368476, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 24}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9857627563476562, \"count\": 1, \"min\": 0.9857627563476562, \"max\": 0.9857627563476562}}}\n", "#metrics {\"StartTime\": 1687302801.3685048, \"EndTime\": 1687302801.368513, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 25}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.0030929565429687, \"count\": 1, \"min\": 1.0030929565429687, \"max\": 1.0030929565429687}}}\n", "#metrics {\"StartTime\": 1687302801.368544, \"EndTime\": 1687302801.3685522, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 26}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9982810668945312, \"count\": 1, \"min\": 0.9982810668945312, \"max\": 0.9982810668945312}}}\n", "#metrics {\"StartTime\": 1687302801.3685808, \"EndTime\": 1687302801.3685887, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 27}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9782197265625, \"count\": 1, \"min\": 0.9782197265625, \"max\": 0.9782197265625}}}\n", "#metrics {\"StartTime\": 1687302801.368619, \"EndTime\": 1687302801.3686266, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 28}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.97561572265625, \"count\": 1, \"min\": 0.97561572265625, \"max\": 0.97561572265625}}}\n", "#metrics {\"StartTime\": 1687302801.368655, \"EndTime\": 1687302801.3686628, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 29}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.048558837890625, \"count\": 1, \"min\": 1.048558837890625, \"max\": 1.048558837890625}}}\n", "#metrics {\"StartTime\": 1687302801.3686924, \"EndTime\": 1687302801.3687005, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 30}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 1.025395263671875, \"count\": 1, \"min\": 1.025395263671875, \"max\": 1.025395263671875}}}\n", "#metrics {\"StartTime\": 1687302801.36873, \"EndTime\": 1687302801.368738, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"model\": 31}, \"Metrics\": {\"train_mse_objective\": {\"sum\": 0.9265108032226562, \"count\": 1, \"min\": 0.9265108032226562, \"max\": 0.9265108032226562}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, epoch=14, train mse_objective =0.8352506713867187\n", "[06/20/2023 23:13:21 INFO 140324763719488] #early_stopping_criteria_metric: host=algo-1, epoch=14, criteria=mse_objective, value=0.652086181640625\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saving model for epoch: 14\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmp1z0pi23r/mx-mod-0000.params\"\n", "[06/20/2023 23:13:21 INFO 140324763719488] #progress_metric: host=algo-1, completed 100.0 % of epochs\n", "#metrics {\"StartTime\": 1687302801.3528442, \"EndTime\": 1687302801.374522, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"epoch\": 14, \"Meta\": \"training_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 18136.0, \"count\": 1, \"min\": 18136, \"max\": 18136}, \"Total Batches Seen\": {\"sum\": 33.0, \"count\": 1, \"min\": 33, \"max\": 33}, \"Max Records Seen Between Resets\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Max Batches Seen Between Resets\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}, \"Reset Count\": {\"sum\": 17.0, \"count\": 1, \"min\": 17, \"max\": 17}, \"Number of Records Since Last Reset\": {\"sum\": 1071.0, \"count\": 1, \"min\": 1071, \"max\": 1071}, \"Number of Batches Since Last Reset\": {\"sum\": 2.0, \"count\": 1, \"min\": 2, \"max\": 2}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #throughput_metric: host=algo-1, train throughput=49180.52074141386 records/second\n", "[06/20/2023 23:13:21 WARNING 140324763719488] wait_for_all_workers will not sync workers since the kv store is not running distributed\n", "[06/20/2023 23:13:21 WARNING 140324763719488] wait_for_all_workers will not sync workers since the kv store is not running distributed\n", "[2023-06-20 23:13:21.376] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 34, \"duration\": 1, \"num_examples\": 1, \"num_bytes\": 64000}\n", "[2023-06-20 23:13:21.381] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/train\", \"epoch\": 36, \"duration\": 3, \"num_examples\": 2, \"num_bytes\": 68544}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #train_score (algo-1) : ('mse_objective', 1.4728637165229082e-05)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #train_score (algo-1) : ('mse', 1.4728637165229082e-05)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #train_score (algo-1) : ('absolute_loss', 0.002878888286684964)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #train_score (algo-1) : ('rmse', 0.003837790661986279)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #train_score (algo-1) : ('r2', 0.2434803334779041)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #train_score (algo-1) : ('mae', 0.0028788883294031914)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, train mse_objective =1.4728637165229082e-05\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, train mse =1.4728637165229082e-05\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, train absolute_loss =0.002878888286684964\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, train rmse =0.003837790661986279\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, train r2 =0.2434803334779041\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, train mae =0.0028788883294031914\n", "[06/20/2023 23:13:21 INFO 140324763719488] Best model found for hyperparameters: {\"optimizer\": \"adam\", \"learning_rate\": 0.1, \"wd\": 0.01, \"l1\": 0.0, \"lr_scheduler_step\": 10, \"lr_scheduler_factor\": 0.99, \"lr_scheduler_minimum_lr\": 0.0001}\n", "[06/20/2023 23:13:21 INFO 140324763719488] Saved checkpoint to \"/tmp/tmp0ovn5j6l/mx-mod-0000.params\"\n", "[2023-06-20 23:13:21.384] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/test\", \"epoch\": 0, \"duration\": 472, \"num_examples\": 1, \"num_bytes\": 10880}\n", "[2023-06-20 23:13:21.385] [tensorio] [info] epoch_stats={\"data_pipeline\": \"/opt/ml/input/data/test\", \"epoch\": 1, \"duration\": 1, \"num_examples\": 1, \"num_bytes\": 10880}\n", "#metrics {\"StartTime\": 1687302801.3843052, \"EndTime\": 1687302801.386345, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\", \"Meta\": \"test_data_iter\"}, \"Metrics\": {\"Total Records Seen\": {\"sum\": 170.0, \"count\": 1, \"min\": 170, \"max\": 170}, \"Total Batches Seen\": {\"sum\": 1.0, \"count\": 1, \"min\": 1, \"max\": 1}, \"Max Records Seen Between Resets\": {\"sum\": 170.0, \"count\": 1, \"min\": 170, \"max\": 170}, \"Max Batches Seen Between Resets\": {\"sum\": 1.0, \"count\": 1, \"min\": 1, \"max\": 1}, \"Reset Count\": {\"sum\": 1.0, \"count\": 1, \"min\": 1, \"max\": 1}, \"Number of Records Since Last Reset\": {\"sum\": 170.0, \"count\": 1, \"min\": 170, \"max\": 170}, \"Number of Batches Since Last Reset\": {\"sum\": 1.0, \"count\": 1, \"min\": 1, \"max\": 1}}}\n", "[06/20/2023 23:13:21 INFO 140324763719488] #test_score (algo-1) : ('mse_objective', 1.9604377174640405e-05)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #test_score (algo-1) : ('mse', 1.9604377174640405e-05)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #test_score (algo-1) : ('absolute_loss', 0.0036581849350648767)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #test_score (algo-1) : ('rmse', 0.0044276830481235225)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #test_score (algo-1) : ('r2', -0.6555626788989293)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #test_score (algo-1) : ('mae', 0.0036581848042688387)\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, test mse_objective =1.9604377174640405e-05\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, test mse =1.9604377174640405e-05\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, test absolute_loss =0.0036581849350648767\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, test rmse =0.0044276830481235225\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, test r2 =-0.6555626788989293\n", "[06/20/2023 23:13:21 INFO 140324763719488] #quality_metric: host=algo-1, test mae =0.0036581848042688387\n", "#metrics {\"StartTime\": 1687302800.9104261, \"EndTime\": 1687302801.389445, \"Dimensions\": {\"Algorithm\": \"Linear Learner\", \"Host\": \"algo-1\", \"Operation\": \"training\"}, \"Metrics\": {\"initialize.time\": {\"sum\": 105.99493980407715, \"count\": 1, \"min\": 105.99493980407715, \"max\": 105.99493980407715}, \"epochs\": {\"sum\": 15.0, \"count\": 1, \"min\": 15, \"max\": 15}, \"check_early_stopping.time\": {\"sum\": 5.173683166503906, \"count\": 15, \"min\": 0.12278556823730469, \"max\": 0.7529258728027344}, \"update.time\": {\"sum\": 319.87881660461426, \"count\": 15, \"min\": 16.545772552490234, \"max\": 44.8908805847168}, \"finalize.time\": {\"sum\": 7.80940055847168, \"count\": 1, \"min\": 7.80940055847168, \"max\": 7.80940055847168}, \"setuptime\": {\"sum\": 1.8687248229980469, \"count\": 1, \"min\": 1.8687248229980469, \"max\": 1.8687248229980469}, \"totaltime\": {\"sum\": 569.9367523193359, \"count\": 1, \"min\": 569.9367523193359, \"max\": 569.9367523193359}}}\n", "\n", "2023-06-20 23:13:36 Uploading - Uploading generated training model\n", "2023-06-20 23:13:36 Completed - Training job completed\n", "Training seconds: 107\n", "Billable seconds: 49\n", "Managed Spot Training savings: 54.2%\n" ] } ], "source": [ "import boto3\n", "from sagemaker import get_execution_role, LinearLearner\n", "\n", "# Setting the bucket name\n", "bucket = 'fcc-spark-example'\n", "data_location = f's3://{bucket}/aq-linearlearner/data/train'\n", "output_location = f's3://{bucket}/aq-linearlearner/output'\n", "\n", "\n", "# This is used to run the LinearLearner training job\n", "role = get_execution_role()\n", "\n", "llearner = LinearLearner(role=role,\n", " predictor_type='regressor',\n", " normalize_data=True,\n", " normalize_label=True,\n", " instance_count=1,\n", " use_spot_instances = True,\n", " max_run= 1800,\n", " max_wait = 3600,\n", " instance_type='ml.c5.xlarge',\n", " output_path=output_location,\n", " data_location=data_location)\n", "\n", "# Starting the training \n", "llearner.fit([\n", " llearner.record_set(x_train.values.astype('float32'), y_train.values[:, 0].astype('float32'), channel='train'),\n", " llearner.record_set(x_test.values.astype('float32'), y_test.values[:, 0].astype('float32'), channel='test')\n", "])" ] }, { "cell_type": "markdown", "id": "8a7a9012-717b-4e02-abe9-1dd18f81a95c", "metadata": {}, "source": [ "## Deploying the model (`endpoint`)" ] }, { "cell_type": "code", "execution_count": 14, "id": "68f22bf9-885e-451f-8585-81bef9646223", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "-------------------!\n" ] } ], "source": [ "llearner_predictor = llearner.deploy(initial_instance_count=1,\n", " instance_type='ml.t2.medium')" ] }, { "cell_type": "code", "execution_count": 15, "id": "9088f3a7-a367-4c48-8777-9fc69d3177e2", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "RMSE: 0.0044\n", "Variance score: -0.6556\n", "Explained variance score: 0.3762\n", "Forecast bias: 0.0035\n", "sMAPE: 1.4279\n" ] } ], "source": [ "result = llearner_predictor.predict(x_test.values.astype('float32'))\n", "y_sm_pred = [r.label[\"score\"].float32_tensor.values[0] for r in result]\n", "y_sm_test = y_test.values[:, 0].astype('float32')\n", "print_metrics(y_sm_test, y_sm_pred)" ] }, { "cell_type": "code", "execution_count": 16, "id": "379bbe1e-7474-4fa1-b1dd-7ce7eef1aaa8", "metadata": { "tags": [] }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "linear-learner-2023-06-20-23-15-00-813\n" ] } ], "source": [ "endpoint_name = llearner_predictor.endpoint_name\n", "print(endpoint_name)" ] }, { "cell_type": "markdown", "id": "45de4509-1e54-47a4-b677-4d9192f0f812", "metadata": {}, "source": [ "## Stop the session " ] }, { "cell_type": "code", "execution_count": null, "id": "53d444d8-9089-480d-8e93-3fafe9c0e1d9", "metadata": {}, "outputs": [], "source": [ "%stop_session" ] }, { "cell_type": "markdown", "id": "404748ee-5530-4e5a-9b12-0452976ec6a1", "metadata": { "tags": [] }, "source": [ "# Clean up" ] }, { "cell_type": "code", "execution_count": null, "id": "d77c110c-e93c-40ed-b528-df45b142139f", "metadata": {}, "outputs": [], "source": [ "## Delete the endpoint \n", "\n", "## llearner_predictor.delete_endpoint()" ] }, { "cell_type": "markdown", "id": "ac243b68-8170-42a6-a337-9d9643cb484f", "metadata": {}, "source": [ "
\n", "

Demo Code

\n", "
\n", "\n", "\n", "
\n", "\"Intro\"\n", "
" ] } ], "metadata": { "availableInstances": [ { "_defaultOrder": 0, "_isFastLaunch": true, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 4, "name": "ml.t3.medium", "vcpuNum": 2 }, { "_defaultOrder": 1, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 8, "name": "ml.t3.large", "vcpuNum": 2 }, { "_defaultOrder": 2, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 16, "name": "ml.t3.xlarge", "vcpuNum": 4 }, { "_defaultOrder": 3, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 32, "name": "ml.t3.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 4, "_isFastLaunch": true, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 8, "name": "ml.m5.large", "vcpuNum": 2 }, { "_defaultOrder": 5, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 16, "name": "ml.m5.xlarge", "vcpuNum": 4 }, { "_defaultOrder": 6, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 32, "name": "ml.m5.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 7, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 64, "name": "ml.m5.4xlarge", "vcpuNum": 16 }, { "_defaultOrder": 8, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 128, "name": "ml.m5.8xlarge", "vcpuNum": 32 }, { "_defaultOrder": 9, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 192, "name": "ml.m5.12xlarge", "vcpuNum": 48 }, { "_defaultOrder": 10, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 256, "name": "ml.m5.16xlarge", "vcpuNum": 64 }, { "_defaultOrder": 11, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 384, "name": "ml.m5.24xlarge", "vcpuNum": 96 }, { "_defaultOrder": 12, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 8, "name": "ml.m5d.large", "vcpuNum": 2 }, { "_defaultOrder": 13, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 16, "name": "ml.m5d.xlarge", "vcpuNum": 4 }, { "_defaultOrder": 14, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 32, "name": "ml.m5d.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 15, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 64, "name": "ml.m5d.4xlarge", "vcpuNum": 16 }, { "_defaultOrder": 16, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 128, "name": "ml.m5d.8xlarge", "vcpuNum": 32 }, { "_defaultOrder": 17, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 192, "name": "ml.m5d.12xlarge", "vcpuNum": 48 }, { "_defaultOrder": 18, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 256, "name": "ml.m5d.16xlarge", "vcpuNum": 64 }, { "_defaultOrder": 19, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 384, "name": "ml.m5d.24xlarge", "vcpuNum": 96 }, { "_defaultOrder": 20, "_isFastLaunch": false, "category": "General purpose", "gpuNum": 0, "hideHardwareSpecs": true, "memoryGiB": 0, "name": "ml.geospatial.interactive", "supportedImageNames": [ "sagemaker-geospatial-v1-0" ], "vcpuNum": 0 }, { "_defaultOrder": 21, "_isFastLaunch": true, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 4, "name": "ml.c5.large", "vcpuNum": 2 }, { "_defaultOrder": 22, "_isFastLaunch": false, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 8, "name": "ml.c5.xlarge", "vcpuNum": 4 }, { "_defaultOrder": 23, "_isFastLaunch": false, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 16, "name": "ml.c5.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 24, "_isFastLaunch": false, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 32, "name": "ml.c5.4xlarge", "vcpuNum": 16 }, { "_defaultOrder": 25, "_isFastLaunch": false, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 72, "name": "ml.c5.9xlarge", "vcpuNum": 36 }, { "_defaultOrder": 26, "_isFastLaunch": false, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 96, "name": "ml.c5.12xlarge", "vcpuNum": 48 }, { "_defaultOrder": 27, "_isFastLaunch": false, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 144, "name": "ml.c5.18xlarge", "vcpuNum": 72 }, { "_defaultOrder": 28, "_isFastLaunch": false, "category": "Compute optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 192, "name": "ml.c5.24xlarge", "vcpuNum": 96 }, { "_defaultOrder": 29, "_isFastLaunch": true, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 16, "name": "ml.g4dn.xlarge", "vcpuNum": 4 }, { "_defaultOrder": 30, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 32, "name": "ml.g4dn.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 31, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 64, "name": "ml.g4dn.4xlarge", "vcpuNum": 16 }, { "_defaultOrder": 32, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 128, "name": "ml.g4dn.8xlarge", "vcpuNum": 32 }, { "_defaultOrder": 33, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 4, "hideHardwareSpecs": false, "memoryGiB": 192, "name": "ml.g4dn.12xlarge", "vcpuNum": 48 }, { "_defaultOrder": 34, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 256, "name": "ml.g4dn.16xlarge", "vcpuNum": 64 }, { "_defaultOrder": 35, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 61, "name": "ml.p3.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 36, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 4, "hideHardwareSpecs": false, "memoryGiB": 244, "name": "ml.p3.8xlarge", "vcpuNum": 32 }, { "_defaultOrder": 37, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 8, "hideHardwareSpecs": false, "memoryGiB": 488, "name": "ml.p3.16xlarge", "vcpuNum": 64 }, { "_defaultOrder": 38, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 8, "hideHardwareSpecs": false, "memoryGiB": 768, "name": "ml.p3dn.24xlarge", "vcpuNum": 96 }, { "_defaultOrder": 39, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 16, "name": "ml.r5.large", "vcpuNum": 2 }, { "_defaultOrder": 40, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 32, "name": "ml.r5.xlarge", "vcpuNum": 4 }, { "_defaultOrder": 41, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 64, "name": "ml.r5.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 42, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 128, "name": "ml.r5.4xlarge", "vcpuNum": 16 }, { "_defaultOrder": 43, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 256, "name": "ml.r5.8xlarge", "vcpuNum": 32 }, { "_defaultOrder": 44, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 384, "name": "ml.r5.12xlarge", "vcpuNum": 48 }, { "_defaultOrder": 45, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 512, "name": "ml.r5.16xlarge", "vcpuNum": 64 }, { "_defaultOrder": 46, "_isFastLaunch": false, "category": "Memory Optimized", "gpuNum": 0, "hideHardwareSpecs": false, "memoryGiB": 768, "name": "ml.r5.24xlarge", "vcpuNum": 96 }, { "_defaultOrder": 47, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 16, "name": "ml.g5.xlarge", "vcpuNum": 4 }, { "_defaultOrder": 48, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 32, "name": "ml.g5.2xlarge", "vcpuNum": 8 }, { "_defaultOrder": 49, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 64, "name": "ml.g5.4xlarge", "vcpuNum": 16 }, { "_defaultOrder": 50, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 128, "name": "ml.g5.8xlarge", "vcpuNum": 32 }, { "_defaultOrder": 51, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 1, "hideHardwareSpecs": false, "memoryGiB": 256, "name": "ml.g5.16xlarge", "vcpuNum": 64 }, { "_defaultOrder": 52, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 4, "hideHardwareSpecs": false, "memoryGiB": 192, "name": "ml.g5.12xlarge", "vcpuNum": 48 }, { "_defaultOrder": 53, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 4, "hideHardwareSpecs": false, "memoryGiB": 384, "name": "ml.g5.24xlarge", "vcpuNum": 96 }, { "_defaultOrder": 54, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 8, "hideHardwareSpecs": false, "memoryGiB": 768, "name": "ml.g5.48xlarge", "vcpuNum": 192 }, { "_defaultOrder": 55, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 8, "hideHardwareSpecs": false, "memoryGiB": 1152, "name": "ml.p4d.24xlarge", "vcpuNum": 96 }, { "_defaultOrder": 56, "_isFastLaunch": false, "category": "Accelerated computing", "gpuNum": 8, "hideHardwareSpecs": false, "memoryGiB": 1152, "name": "ml.p4de.24xlarge", "vcpuNum": 96 } ], "instance_type": "ml.m5.4xlarge", "kernelspec": { "display_name": "Glue Python [PySpark and Ray] (SparkAnalytics 2.0)", "language": "python", "name": "conda-env-sm_glue_is-glue_pyspark__SAGEMAKER_INTERNAL__arn:aws:sagemaker:us-east-2:429704687514:image/sagemaker-sparkanalytics-310-v1" }, "language_info": { "codemirror_mode": { "name": "python", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "Python_Glue_Session", "pygments_lexer": "python3" } }, "nbformat": 4, "nbformat_minor": 5 }