Neural networks, in the tech field, are useful for statistic regression, data classification, product recommentation, computer vision, natural language understanding and synthesis, speech to text, text to speech, and many other complex tasks. Neural networks are used in machine learning and in deep learning, they are related to artificial intelligence.
A neural network learns by example, it is meant to be trained with data in, data out, to later be able to predict the output given an input similar to what it was trained on.
Many framework exists for programming, training and using artificial neural network. Here are some of the most known machine-learning frameworks:
The typical workflow of training and using neural networks, regardless of the library used, goes like this:
Training Data
X
variable is the input, and the Y
variable is the output. The simplest thing to do is to learn a logic gate, where X
is a vector or two numbers and Y
is a vector of one number. Typically, the input and output values are floats, so if it is words, you might associate each word to a different neuron. You could also directly use characters, then it would use less neurons than to keep a whole dictionary.Architecture
Evaluation
X
part of the data to the neural network, then comparing the Y
it predicts to the real Y
. many metrics exists to assess the quality of the learning performed.Improvement
Real-World Use
import numpy as np #There is a lot of math in neurons, so use numpy to speed things up in python; in other languages, use an efficient array type for that language
import random #Initial neuron weights should be random
class Neuron:
def __init__(self, nbr_inputs, weight_array = None):
if (weight_array != None): #you might already have a trained neuron, and wish to recreate it by passing in a weight array h ere
self.weight_array = weight_array
else: #...but more often, you generate random, small numbers for the input weights. DO NOT USE ALL ZEROES, or you increase the odds of getting stuck when learning
self.weight_array = np.zeros(nbr_inputs+1)
for el in range(nbr_inputs+1): #+1 to account for bias weight
self.weight_array[el] = random.uniform((-2.4/nbr_inputs),(2.4/nbr_inputs))
self.nbr_inputs = nbr_inputs
def neuron_output(self,input_array):
input_array_with_bias = np.insert(input_array,0,-1)
weighted_sum = np.dot(input_array_with_bias,self.weight_array)
#Here we are using a hyperbolic tangent output; there are several output functions which could be used, with different max and min values and shapes
self.output = 1.716 * np.tanh(0.67*weighted_sum)
return self.output
Encog is an easy to use java neural network engine
public static double XOR_INPUT[][] = { { 0.0, 0.0 }, { 1.0, 0.0 },
{ 0.0, 1.0 }, { 1.0, 1.0 } };
public static double XOR_IDEAL[][] = { { 0.0 }, { 1.0 }, { 1.0 }, { 0.0 } };
public static void main(final String args[]) {
// create a neural network, without using a factory
BasicNetwork network = new BasicNetwork();
network.addLayer(new BasicLayer(null,true,2));
network.addLayer(new BasicLayer(new ActivationSigmoid(),true,3));
network.addLayer(new BasicLayer(new ActivationSigmoid(),false,1));
network.getStructure().finalizeStructure();
network.reset();
// create training data
MLDataSet trainingSet = new BasicMLDataSet(XOR_INPUT, XOR_IDEAL);
// train the neural network
final ResilientPropagation train = new ResilientPropagation(network, trainingSet);
int epoch = 1;
do {
train.iteration();
System.out.println("Epoch #" + epoch + " Error:" + train.getError());
epoch++;
} while(train.getError() > 0.01);
train.finishTraining();
// test the neural network
System.out.println("Neural Network Results:");
for(MLDataPair pair: trainingSet ) {
final MLData output = network.compute(pair.getInput());
System.out.println(pair.getInput().getData(0) + "," + pair.getInput().getData(1)
+ ", actual=" + output.getData(0) + ",ideal=" + pair.getIdeal().getData(0));
}
Encog.getInstance().shutdown();
}
This is the 'Hello World' equivalent of neural networks.