This document describes the training of a neural network using gradient descent. It shows the weights and outputs over 15 epochs of training on a single data point. The squared error and mean squared error are calculated after each epoch, with the weights being adjusted based on the learning rate of 0.2. The network consists of 2 input nodes, 2 hidden nodes, and 1 output node.