Basic Steps in Neural Network based Algorithms — A mini-Guide for absolute beginners
This article discusses the flow of an artificial neural network (ANN) for beginners.
Here’s a summary of the main points covered in the article:
- Input Layer: The input to an ANN consists of numeric values represented as a vector. The input data, such as text, images, or speech, needs to be pre-processed and transformed into appropriate numeric representations.
- Hidden Layers: Hidden layers in a neural network acquire and use knowledge. An ANN can have one or more hidden layers, with each layer containing one or more nodes. The architecture of the network is defined by the number of layers and nodes.
- Weights and Biases: Weights and biases are trainable parameters in a neural network. Each node in the network has associated weights and biases, which are adjusted during the training process to minimize prediction error.
- Activation Functions: Activation functions determine how a node propagates information to the next layer. They help the neural network learn patterns in the data. Different activation functions have specific advantages and applications.
- Output Layer: The output layer is the final layer in the neural network, where predictions are obtained. The activation function used in the output layer depends on the type of problem, such as classification or regression.
- Setup and Initialization: Before training the model, the input data is preprocessed, split into training, validation, and test sets, and various parameters and hyperparameters are selected. The weights and biases are initialized, often with random values.
- Forward Propagation: During forward propagation, input data is passed through the neural network to generate predictions. The outputs are compared with the actual values to compute the error.
- Measuring Accuracy and Error: The error is measured using loss and cost functions. Loss functions measure the error for individual samples, while cost functions provide an average error over a set of samples.
- Back Propagation: Back propagation is the process of adjusting the weights and biases based on the prediction error. It starts from the output layer and propagates the error backward through the network, updating the parameters layer by layer.