Array Indexing in NumPy¶

This notebook presents how to manipulate matrices in NumPy. NumPy is the fundamental package for scientific computing with Python.

In [1]:

import numpy as np

NumPy provides the type N-dimensional array.

In [2]:

a = np.array([[1, 2], [3, 4]])

In [3]:

type(a)

Out[3]:

numpy.ndarray

In [4]:

Out[4]:

array([[1, 2],
       [3, 4]])

You can multiply two numpy.ndarrays:

In [5]:

a * a

Out[5]:

array([[ 1,  4],
       [ 9, 16]])

But you can see that arrays are not exactly mathematical matrices. The expected result for matrix multiplication would be np.array([[7, 10], [15, 22]]). Operators act element-wise on arrays. Since this is the case in matrix addition too, you can use arrays as matrices when you do:

In [6]:

a + a

Out[6]:

array([[2, 4],
       [6, 8]])

Adding an array and a scalar follows the expected (sensible) behaviour.

In [7]:

a + 1

Out[7]:

array([[2, 3],
       [4, 5]])

A Useful Toolbox¶

NumPy provides the usual functions to apply on arrays.

In [8]:

np.sum(a)  # Sums all elements

Out[8]:

Partial sum along the first (index 0) axis:

In [9]:

np.sum(a, 0)  # | (axis=0)

Out[9]:

array([4, 6])

Partial sum along the second (index 1) axis:

In [10]:

np.sum(a, 1)  # --> (axis=1)

Out[10]:

array([3, 7])

In [11]:

np.sum(a, axis=1)

Out[11]:

array([3, 7])

Exercise

What does np.sum(a, 2) do?

Likewise, we can call the mean function:

In [12]:

np.mean(a)

Out[12]:

2.5

Over the entire array or partially:

In [13]:

np.mean(a, axis=0)

Out[13]:

array([ 2.,  3.])

You can look for values in your array:

In [14]:

np.where(a == 2)

Out[14]:

(array([0]), array([1]))

returns the coordinates of element 2. Note that coordinates are consistent with the mathematical matrix convention. Coordinates in this format can be readily used to access elements; trivially:

In [15]:

coord = np.where(a == 1)
a[coord]

Out[15]:

array([1])

Note that everything is stored in arrays. If you have several instances, it looks like the following:

In [16]:

a1 = np.ones((2, 2))
print "Array is", a1
coord1 = np.where(a1 == 1)
print "Ones are found at (row, column) =", coord1
print "Values at these coordinates", a1[coord1]
print "Values of former array at these coordinates", a[coord1]

Array is [[ 1.  1.]
 [ 1.  1.]]
Ones are found at (row, column) = (array([0, 0, 1, 1]), array([0, 1, 0, 1]))
Values at these coordinates [ 1.  1.  1.  1.]
Values of former array at these coordinates [1 2 3 4]

If you have no instances, say:

In [17]:

np.where(a == 0)

Out[17]:

(array([], dtype=int64), array([], dtype=int64))

There is no element equal to 0 in a, so we get an empty array.

Exercise

Access the element of a which lies on the second row and first column.

To select the entire second row, use a slice as follows:

In [18]:

a[1, :]

Out[18]:

array([3, 4])

Slices illustrate the power of high-level programming. Do not write for loops. Let the computer worry about how to do the element-by-element operations!

Matrix multiplication¶

Matrix multiplication can be expressed by the dot product of 2D arrays:

In [19]:

np.dot(a, a)

Out[19]:

array([[ 7, 10],
       [15, 22]])

The dot product of 1D arrays expresses inner product of vectors:

In [20]:

b = np.array([1, 2, 3])

In [21]:

np.dot(b, b)  # 1*1 + 2*2 + 3*3

Out[21]:

Of course, dot() only works on arrays with compatible shapes. For example,

In [22]:

np.dot(a, b)

---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-22-579c274cec9b> in <module>()
----> 1 np.dot(a, b)

ValueError: objects are not aligned

In [23]:

a.shape

Out[23]:

(2, 2)

In [24]:

b.shape

Out[24]:

(3,)

You cannot multiply a 2-by-2 matrix with a vector of size 3.

Exercise

Create an object named c such that you can compute np.dot(a, c). Try np.dot(c, a). What do you notice?

In [25]:

# NumPy does not distinguish between row and colum vectors.

If you want to use * as the operator for matrix multiplication, you need to create matrix objects.

In [26]:

m = np.matrix([[1, 2], [3, 4]])

In [27]:

m * m

Out[27]:

matrix([[ 7, 10],
        [15, 22]])

You can convert an array into a matrix and vice versa.

In [28]:

am = np.matrix(a)

In [29]:

ma = np.array(m)

Prefer Array or Matrix?¶

Matrix has all the features of array. You want to use the matrix type if your problem is linear algebra. Indeed, vectors are then 1-by-N matrices.

In [30]:

bm = np.matrix(b)
bm.shape

Out[30]:

(1, 3)

Transpose it to get a column vector:

In [31]:

np.transpose(bm)

Out[31]:

matrix([[1],
        [2],
        [3]])

Otherwise, typically if you are representing multi-dimensional grids, you should use array.

Always remember to look at the docs: http://docs.scipy.org/doc/numpy/

You will find implementations for conjugate, convolve, correlate, diagonal, fft, gradient, ... These functions are faster than anything you could easily write and ... someone else has tested and debugged them! :)