{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Image Classification with MNIST Using a Petastorm Dataset\n", "\n", "In this notebook we will read a training dataset saved in the Petastorm format in the project's feature store and use that to train a Deep CNN defined in Keras/Tensorflow to classify images of digits in the MNIST dataset.\n", "\n", "This notebook assumes that you have already created the training datasets in the feature store, which you can do by running this notebook: \n", "\n", "[Create Petastorm MNIST Dataset Notebook](PetastormMNIST_CreateDataset.ipynb)\n", "\n", "" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Imports" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Starting Spark application\n" ] }, { "data": { "text/html": [ "
ID | YARN Application ID | Kind | State | Spark UI | Driver log | Current session? |
---|---|---|---|---|---|---|
4 | application_1559565096638_0006 | pyspark | idle | Link | Link | ✔ |