imports

In [2]:

import theano
from theano import tensor as T
import numpy as np

generate data with noise

In [3]:

trX = np.linspace(-1, 1, 101)
trY = 2 * trX + np.random.randn(*trX.shape) * 0.33

symbolic vriable initalization

In [4]:

X = T.scalar()
Y = T.scalar()

our model

In [5]:

def model(X, w):
    return X * w

model parameter initalization. hyper variables, with real value

In [6]:

w = theano.shared(np.asarray(0., dtype=theano.config.floatX))

In [7]:

y = model(X, w)

metric to be optimized by model

In [8]:

cost = T.mean(T.sqr(y - Y))

learning signal for parameters. Computes the gradient symbolically

In [9]:

gradient = T.grad(cost=cost, wrt=w)

how to update in each step. 0.01 is the learning rate

In [10]:

updates = [[w, w - gradient * 0.01]]

compile to a python function. allow_input_downcast=True is set to ignore typing issue with Theano on different platforms (GPU handle only 32bit)

In [11]:

train = theano.function(inputs=[X, Y], outputs=cost, updates=updates, allow_input_downcast=True)

iterate 100 times over the entire data (epoch.) In each epoch iterate over all the data samples

In [12]:

for i in range(100):
    for x, y in zip(trX, trY):
        train(x, y)

we should get something close to the true weight of the data: 2

In [13]:

w.get_value()

Out[13]:

array(1.9382810308787022)