Table of contents

What are Autoencoders?

Applications of Autoencoders

2.1.

Image Denoising

2.2.

Recommendation System

2.3.

Image Generation

Building a simple Autoencoder from scratch

3.1.

Step 1: Importing Necessary Libraries

3.2.

3.3.

Step 2: Loading the MNIST dataset in the notebook

3.4.

Step 3: Data Preparation

3.5.

Step 4: Initializing the Autoencoder Model

3.6.

Step 5: The Encoder and the Decoder Model

3.7.

Step 6: Training the model on the MNIST digits dataset

3.8.

Output

3.9.

Step 7: Generating Predictions

3.10.

Step 8: Visualizing the difference between original and reconstructed images

3.11.

Output

3.12.

Result

Frequently Asked Questions

Key Takeaways

Last Updated: Mar 27, 2024

Autoencoders - Introduction & Implementation

Q: Q1. What are the different types of Autoencoders?

Ans. There are seven types of Autoencoders:

Q: Q3. Autoencoders belongs to which category of Machine Learning?

Ans. Autoencoders belong to the unsupervised machine learning category; they do not need explicit labels for training because input and output are the same.

Q: Q4. What are the three properties of Autoencoders?

Ans. The three properties of autoencoders are:

Author Akshat Chaturvedi

Do you think IIT Guwahati certified course can help you in your career?

Yes

What are Autoencoders?

Autoencoder is Feed-Forward Neural Networks where the input and the output are the same. Autoencoders encode the image and then decode it to get the same image. The core idea of autoencoders is that the middle layer must contain enough information to represent the input.

There are three important properties of autoencoders:

1. Data Specific: We can only use autoencoders for the data that it has previously been trained on. For instance, to encode an MNIST digits image, we’ll have to use an autoencoder that previously has been trained on the MNIST digits dataset.

2. Lossy: Information is lost while encoding and decoding the images using autoencoders, which means that the reconstructed image will have some details missing as compared to the original image.

3. Unsupervised: Autoencoders belong to the unsupervised machine learning category because we do not require explicit labels corresponding to the data; the data itself acts as input and output.

Caption: Architecture of an Autoencoder

Applications of Autoencoders

We primarily use autoencoders for data compression or dimensionality reduction. Once we have a more condensed (low-dimensional) representation of multidimensional data, we can easily visualize it.

Image Denoising

Noise in the image signifies corrupted or bad pixels. To get the real and clear image, we have to denoise the image, and for the task of denoising the images, we can use autoencoders.

To construct a denoising image encoder, we first add noise to our original image and then pass it through the feed-forward neural network to get the original image.

Original image

Adding noise to the above image will yield the following image.

Noisy image

After passing the noisy image through our trained autoencoder, it outputs the denoised image below.

Denoised Output

Recommendation System

We can use autoencoders to give users personalized recommendations based on their history. Deep autoencoders are used for Spotify music recommendations, YouTube video recommendations, or Netflix movie recommendations.

The input data will be the history of songs or the videos watched by a user. When we feed the input data to our autoencoder, the encoder part will capture the user’s interests. Then the decoder part will generate the videos or songs similar to the input data.

Image Generation

With the autoencoders, we can also generate similar images. Variational Autoencoder (VAE) is a type of generative model, which we use to generate images.

For instance, if we input a human face to the autoencoder, we will get similar face instances with slight tweaks.

Caption: Using autoencoder to generate anime faces

Source: https://iq.opengenus.org/

Not only human features, but we can also use the variational autoencoder (VAE) to generate nature sceneries, pictures of historical monuments, ecstatic images, etc.

Building a simple Autoencoder from scratch

To implement an autoencoder, we have to set some hyper-parameters:

Code Size: The size of the compressed data. If we want a more condensed representation of the input, the code size will be less & vice-versa.
Layers: The number of layers, we can specify any number of layers. More layers signify more learning of features.
Loss Function: To calculate information loss, we use Binary Cross Entropy if the input values range from 0 to 1 and otherwise use the Mean Squared error.
Nodes: The number of nodes/neurons per layer, we can specify two or more numbers of neurons corresponding to a layer (except for the input and the output).

It is pretty simple to build a one-layered autoencoder. Let’s see the stepwise demonstration.