Project IA Deep Learning - Supervised

DataSet MNIST Fashion.

Philippe Jadoul - january 2020

Fashion MNIST - Network Dense

Imports

library.jpg

Import my_fonctions

Tests All

train-test-set.jpg

Load the dataset: Fashion MNIST

dataset-cover.png dataset which contains 70,000 grayscale images in 10 categories. The images show individual articles of clothing at low resolution (28 by 28 pixels), train_images = 60.000, train_test = 10000

Import and load the Fashion MNIST data from local folder data - (Test) files compressed *.gz

Import and load the Fashion MNIST data directly from TensorFlow:

validation-set.jpg

Create Validation Dataset

Distribution Data Before Normalization

Flatten Data images - (28x28) - > (784) Pixels

normalization.jpg

Normalization - Simple

Distribution Data After Simple Normalization

Type of class_names :

Label Class
0 T-shirt/top
1 Trouser
2 Pullover
3 Dress
4 Coat
5 Sandal
6 Shirt
7 Sneaker
8 Bag
9 Ankle boot

Normalization - StandardScater - $z = \frac{(X - \overline{U})}{\sigma}$

Distribution Data Aflter StandardScater Normalization

one-hot-encoding.jpg

One-hot encoding

List targets_names - Labels og Classes

To verify that the data is in the correct format and that you're ready to build and train the network, let's display the first 25 images from the training set and display the class name below each image.

Create the model

simple_nn.jpg

Create the Dense model

Compiling the sequentail model - Dense

compile-model.jpg

Function Compile Model

Compile Model

Train the model

train-set.jpg

(1) Training the model - History model fit

history = model.fit(X_train, Y_train, epochs=50,batch_size=None, validation_split=0.0, verbose=False )

Graph - fig-1.png

Model Summary

train-evaluate.jpg

Function Reset Model Dense (Re-Create & Re-Compile)

Reset Model Dense

Evaluate the Models's Performance & optimalisation metrics

(2) Training with split Data Train 20% - epochs 50 - history model fit

history = model.fit(X_train, Y_train, epochs=50,batch_size=None,validation_split=0.2, verbose=False)

Graph - fig-2.png

Reset Model Dense

(3) Training the model - Amelioration witn batch_size 256 - epochs 50 - validation_split 20%

history = model.fit(X_train, Y_train, epochs=50,batch_size=256, validation_split=0.2, verbose=False)

Graph - fig-3.png

Reset Model Dense

(4) Training with separate Validation Data & epochs 50

history = model.fit(X_train, Y_train, epochs=50,batch_size=None, validation_data=(X_train_validate,Y_train_validate), verbose=False )

Graph - fig-4.png

Reset Model Dense

(5) Training with Validation Data Set - epochs 50 batch size 256

history = model.fit(X_train, Y_train, epochs=50,batch_size=256, validation_data=(X_train_validate,Y_train_validate), verbose=True )

Graph - fig-5.png

Reset Model Dense

(6) Training with Validation Data Set - epochs 256 - batch_size 512

history = model.fit(X_train, Y_train, epochs=100,batch_size=256, validation_data=(X_train_validate,Y_train_validate), verbose=False )

Graph - fig-6.png

function plotting the metrics of train dataset accuracy & loss - from my library

The Best Model Seems to be the (5) - Re-Training - Metrics (5)

Reset Model Dense

(5) Re-Training with Validation Data Set - epochs 50 batch size 256

history = model.fit(X_train, Y_train, epochs=50,batch_size=256, validation_data=(X_train_validate,Y_train_validate), verbose=True )

Best Model (5)

Model Summary

evaluate-model.jpg

Evaluate the Models' Performance with the best current metrics

compare how the model performs on the test dataset:

save the model after training (5)

Delete the model

Load the Model (5)

Model Summary

Re-Evaluate the Models' Performance with the best current metrics (5)

compare how the model performs on the test dataset:

Make predictions

With the model trained, you can use it to make predictions about some images.

resulat prediction image index 12 - (probability - class ( 0 to 9 )

position (index) of the maximum probabability

one-hot encoding - class prediction image index 12

reverse one-hot encoding - images 12

name of the predicted class

image predicted

accuracy of the prediction in % - img 12

import fonction plotresults - my library

Let's plot several images with their predictions. Note that the model can be wrong even when very confident.

Where Does the Model Make Mistakes ? - Statistics -

Conclusion.

First, this neural network is a basic model. Despite obtaining an accuracy at 99.46% in training and validation at 99.70% without Overfitting. During real tests, this network only succeeds in detecting 83.64% of the 10,000 images injected in the test.

One of the problems comes from the "Flatten" operation which converts images of dimension (28x28) into a vector of (784) pixels. This leads to loss of information on the images.

In image processing in Deep Learning Supervised, we prefer to use a Convolution network.
A.S.A.P - Philippe Jadoul