{ "cells": [ { "cell_type": "markdown", "id": "independent-thread", "metadata": {}, "source": [ "---\n", "title: \"Maggy distributed training ResNet-50 on ImageNet (Petastorm)\"\n", "date: 2021-05-03\n", "type: technical_note\n", "draft: false\n", "---" ] }, { "cell_type": "markdown", "id": "therapeutic-working", "metadata": {}, "source": [ "## Training ResNet-50 on ImageNet from a Petastorm dataset\n", "In this notebook, we are going to train a ResNet-50 network on a subset of 10 labels of the original ImageNet dataset. In order to improve our I/O time compared to the standard ImageNet training, we are going to use the Petastorm version of the dataset created in `ImageNet_to_petastorm`." ] }, { "cell_type": "code", "execution_count": 1, "id": "parallel-craft", "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Starting Spark application\n" ] }, { "data": { "text/html": [ "
ID | YARN Application ID | Kind | State | Spark UI | Driver log |
---|---|---|---|---|---|
186 | application_1617699042861_0013 | pyspark | idle | Link | Link |