Notebook

Here, you build a classifier with Scikit-Learn for use in 6b. The You'll want to read in Oscar's original data as an array of bitwise note vectors, and from there build the RBM to predict chords (the y's, perhaps build the chord bank and assign a unique number to each). After that, given a note vector (maybe plural?), you should be able to predict the chords for a note (notes?).

This is for Oscar's musical data. The next step is to do the classification for your n-gram model.

In [1]:

from collections import defaultdict
import pandas as pd
import numpy as np
import scipy.sparse
import random, cPickle

In [2]:

# Extract chords into unique ids, e.g. 1, 2, 3, 4, 5
allchords = defaultdict() # remember that it's a hash table
with open("oscar2chords_extract.txt", 'rb') as f:
    for ix, line in enumerate(f):
        items = line.split()
        allchords[ix] = items
assert len(allchords) == len(set(allchords)) # ensure no duplicate chords

In [3]:

# Read in Oscar's data.
vectors = []
notedata = pd.read_csv(open("oscar2notes.txt", 'rb'), skiprows=2)
allnotes = []
for note, octave in zip(notedata["Note/Rest"], notedata["Octave"]):
    allnotes.append("%s%s" % (note, octave))

print "Number of notes (# of samples for RBM): ", len(notedata)
notedata.head()

Number of notes (# of samples for RBM):  1344

Out[3]:

	Note/Rest	Octave	Len	Offset
0	B	3	0.500000	12.625
1	A	5	0.250000	15.000
2	F	4	3.125000	16.000
3	G	4	0.666667	20.625
4	F	4	1.250000	23.875

5 rows × 4 columns

Pull the function from 9. to generate an altered scale for a given note. This is important for updating the bitwise vectors for the BernoulliRBM input. The altered scalee s are hard-coded, which means that they're immutable. Also, they only go from octave 3 through octave six, wrapping around so all possible notes for a given altered scale will be included.

In [23]:

# Generates the altered scale from octaves 3 to 6 for a pitch (e.g. G-)
# for a given note (e.g. G-3) in music21 style.
# Returns altered scale as list of music21 notes.
def genAltered(note='C3'):
    # In case you have to convert a note (e.g. F#) into form below
    def convertSharps(note):
        pitch = ''.join([i for i in note if i.isdigit() is False])
        enharmonic = {"C#" : "D-", "D#" : "E-", "E#" : "F", "F#" : "G-", "G#" : "A-", "A#" : "B-", "B#" : "C"}
        if '#' in pitch: return enharmonic[pitch]
        return pitch
    
    # Get scale with dictionary. For example: allscales[note[:-1]]
    allscales = {
        "C"  : ["C3", "E-3", "F3", "G3", "B-3",
                "C4", "E-4", "F4", "G4", "B-4",
                "C5", "E-5", "F5", "G5", "B-5",
                "C6", "E-6", "F6", "G6", "B-6"],
        "D-" : ["D-3", "E3", "G-3", "A-3", "B3",
                "D-4", "E4", "G-4", "A-4", "B4",
                "D-5", "E5", "G-5", "A-5", "B5",
                "D-6", "E6", "G-6", "A-6", "B6"],
        "D"  : ["C3", "D3", "F3", "G3", "A3", 
                "C4", "D4", "F4", "G4", "A4", 
                "C5", "D5", "F5", "G5", "A5", 
                "C6", "D6", "F6", "G6", "A6"],
        "E-" : ["D-3", "E-3", "G-3", "A-3", "B-3",
                "D-4", "E-4", "G-4", "A-4", "B-4",
                "D-5", "E-5", "G-5", "A-5", "B-5",
                "D-6", "E-6", "G-6", "A-6", "B-6"],
        "E"  : ["D3", "E3", "G3", "A3", "B3",
                "D4", "E4", "G4", "A4", "B4",
                "D5", "E5", "G5", "A5", "B5",
                "D6", "E6", "G6", "A6", "B6"],
        "F"  : ["C3", "E-3", "F3", "A-3", "B-3",
                "C4", "E-4", "F4", "A-4", "B-4",
                "C5", "E-5", "F5", "A-5", "B-5",
                "C6", "E-6", "F6", "A-6", "B-6"],
        "G-" : ["D-3", "E3", "G-3", "A3", "B3",
                "D-4", "E4", "G-4", "A4", "B4",
                "D-5", "E5", "G-5", "A5", "B5",
                "D-6", "E6", "G-6", "A6", "B6"],
        "G"  : ["C3", "D3", "F3", "G3", "B-3",
                "C4", "D4", "F4", "G4", "B-4",
                "C5", "D5", "F5", "G5", "B-5",
                "C6", "D6", "F6", "G6", "B-6"],
        "A-" : ["D-3", "E-3", "G-3", "A-3", "B3",
                "D-4", "E-4", "G-4", "A-4", "B4",
                "D-5", "E-5", "G-5", "A-5", "B5",
                "D-6", "E-6", "G-6", "A-6", "B6"],
        "A"  : ["C3", "D3", "E3", "G3", "A3",
                "C4", "D4", "E4", "G4", "A4",
                "C5", "D5", "E5", "G5", "A5",
                "C6", "D6", "E6", "G6", "A6"],
        "B-" : ["D-3", "E-3", "F3", "A-3", "B-3",
                "D-4", "E-4", "F4", "A-4", "B-4",
                "D-5", "E-5", "F5", "A-5", "B-5",
                "D-6", "E-6", "F6", "A-6", "B-6"],
        "B"  : ["D3", "E3", "G-3", "A3", "B3",
                "D4", "E4", "G-4", "A4", "B4",
                "D5", "E5", "G-5", "A5", "B5",
                "D6", "E6", "G-6", "A6", "B6"]}
    pitch = ''.join([i for i in note if i.isdigit() is False])
    pitch = convertSharps(note) # Rm. octaveinfo, eg. G-5 --> G-, G5->G
    return allscales[pitch]

Now, let's do the bitwise arrays! For updating and using the bitwise arrays:

First, you should populate the bitwise array (i.e. "turn on" the notes) with the notes that are actually in the given chord, with all octaves. This simply means setting those notes to equal 1 or 0.
Second, for a given chord, you want to predict which altered scale it belongs best to. To do this (performance doesn't matter since you'll be cPickling this anyway), generate the altered scales for each of the notes in the chord, and find which notes "overlap" between those multiple generated scales the most, and turn those on. For example, you may find for C major you get the C, E, and G altered scales. Find, say, up to 8 notes that appear in >1 of these altered scales (i.e. "overlapping" notes), and flip the bits in the vector accordingly. If you're like to create more training data, you can randomize this by picking k random notes from the overlapping notes.

In [24]:

# Given a MUSIC21 note, such as C5 or D#7, convert it
# into a note on the keyboard between 0 and 87 inclusive.
# Don't convert it for mingus; try to use music21 note style
# as much as possible for all this stuff.
def quantify(note):
    notevals = {
        'C' : 0,
        'D' : 2,
        'E' : 4,
        'F' : 5,
        'G' : 7,
        'A' : 9,
        'B' : 11
    }
    quantized = 0
    octave = int(note[-1]) - 1
    for i in note[:-1]:
        if i in notevals: quantized += notevals[i]
        if i == '-': quantized -= 1
        if i == '#': quantized += 1
    quantized += 12 * octave
    return quantized

# Create bitwise note vectors for use with Restricted Boltzmann Machine.
vectors = np.zeros((1, 88))
for ix, note in enumerate(allnotes):
    vect = np.zeros((1, 88))
    vect[0, quantify(note)] = 1
    if ix == 0:
        vectors = vect
    else:
        vectors = np.vstack((vectors, vect))
print vectors.shape

(1344, 88)

See notes on what you should actually do.

Annotate Oscar's chord data so you have a notes vector for each chord listing the notes that go well with it.
Move onto each cluster; for each cluster, build a vector covering all of its notes.
You need a training and a test set, so create those from Oscar's data somehow.
Find a good classifier to use with this. It might be stacked RBMs, or it might not! Use whatever tool is best for the job.

Step 1: build the vocabulary of possible notes (e.g. note vectors with # of notes >= 1) for the class labels (each chord's unique id).

In [42]:

""" Hard-code altered scales right below for genChordNotes(). """

# Convert mingus note back to music21 note. WORKS
def unmingify(note):
    return note.replace('-','').replace('b','-')
    
# Given a list of mingus notes (i.e. a chord), say ['A-2', 'A-3', 'E-3'],
# Takes a chord (i.e. a list of notes) and returns a bitwise notevector with possible notes to go along with it.
# Idea: what if just generate notewise vector with exact same pitches? Indepedence assumption?
def genChordNotes(chord):
    chord = [unmingify(note) for note in chord] # really important to unmingify notes.
    notevect = np.zeros((1, 88))
    
    # populate with initial pitches
    for note in chord:
        notevect[0, quantify(note)] = 1
        
    # add initial pitches transposed to other octaves
    otheroctaves = range(3, 6)
    for note in chord:
        notebase = note[:-1]
        for octv in otheroctaves:
            put = bool(random.getrandbits(1)) # randomize other pitches
            if put is True:
                translated = "%s%s" % (notebase, octv)
                notevect[0, quantify(translated)] = 1
    
    # Add altered scale that contains most # of notes from chord notes
    # e.g. if chord = [e5, g5, b5] then want altered scale with as many of
    # those notes as possible. This lets you expand past simply
    # the notes already in that chord. Encode the notes of the altered
    # scale into the bitwise vector as with the initial pitches.
    # Maybe it works better w/o the altered scales; or maybe instead with pentatonics? try that.
    # Toggle below to include alternative notes (e.g. pentatonic/altered scales) or not
    altfreqs = defaultdict(int)
    for note in chord:
        for i in genAltered(note):
            altfreqs[i] += 1
    topnotes = [k for k, v in altfreqs.items() if v > 2] # get notes that overlap > 2 times
    for note in topnotes: # flip bits randomly from this list
        if bool(random.getrandbits(1)):
            notevect[0, quantify(note)] = 1
    
    # return the vector
    return notevect

# Create initial arrays (1-40, one for each thing)
xdata = np.zeros((1, 88))
for chordID, chord in allchords.items():
    if chordID == 0:
        xdata = genChordNotes(chord)
    else:
        xdata = np.vstack((xdata, genChordNotes(chord)))
ydata = allchords.keys()

print "Before adding random data: ", xdata.shape, len(ydata)

# create more randomized data
for chordID, chord in allchords.items():
    for j in xrange(50): 
        xdata = np.vstack((xdata, genChordNotes(chord)))
        ydata.append(chordID)
ydata = np.array(ydata).reshape(-1, )

print "After adding random data: ", xdata.shape, ydata.shape
# make sure you have the right # of chords. check with # of items in "oscarchords" back in (5).

Before adding random data:  (40, 88) 40
After adding random data:  (2040, 88) (2040,)

Now, it's time for some learning! Create a classifier to get a feel of what the training data is (no test) -- you want to get a deep understanding of what note vectors are associated with which class labels. Remember, the only reason you would need to use train/test sets is to test the effectiveness of the classifier - for the actual prediction in The N-Gram Pipeline, you can fit the classifier to the entire dataset.

In [43]:

from sklearn.svm import SVC
from sklearn.grid_search import GridSearchCV
from sklearn.cross_validation import train_test_split
from sklearn import metrics

In [44]:

# Create train, test sets
xtrain, xtest, ytrain, ytest = train_test_split(xdata, ydata, test_size=0.2, random_state=50)

# Use gridsearch to build the classifier. Change verbose GridSearchCV param to True if want progress on the processing.
grid_search = GridSearchCV(estimator=SVC(), param_grid={'kernel' : ('linear', 'rbf'), 'C' : np.linspace(0.1, 5.1, 10)}, n_jobs=-2)

# Train the classifier
grid_search.fit(xtrain, ytrain)

# Evaluate the classifier's effectiveness.
print "\nPredictions for sample of n=10: "
print "Real values: ", ytest[:20] # verifies you get the class labels, not the problem earlier (only 1-2 of labels)
print "Predicted:   ", grid_search.predict(xtest[:20])
print metrics.classification_report(ytest, grid_search.predict(xtest))
print "Best parameters: ", grid_search.best_params_

Predictions for sample of n=10: 
Real values:  [37 13 33  3  0 34  6 14 23  8 28 21 19 22 34 38  9  1 34 37]
Predicted:    [37 13 33  3  0 34 13 14 23  8 28 21 19 22 34 38 10  1 34 37]
             precision    recall  f1-score   support

          0       1.00      1.00      1.00        12
          1       0.86      1.00      0.92         6
          2       1.00      1.00      1.00         9
          3       1.00      1.00      1.00         8
          4       1.00      1.00      1.00        17
          5       1.00      1.00      1.00         9
          6       0.50      0.30      0.37        10
          7       1.00      1.00      1.00         7
          8       1.00      1.00      1.00         9
          9       0.43      0.60      0.50        10
         10       0.56      0.38      0.45        13
         11       0.46      1.00      0.63         6
         12       1.00      0.36      0.53        11
         13       0.40      0.57      0.47         7
         14       1.00      1.00      1.00         8
         15       1.00      1.00      1.00         7
         16       1.00      1.00      1.00        13
         17       1.00      1.00      1.00        10
         18       1.00      1.00      1.00        10
         19       1.00      1.00      1.00         6
         20       1.00      1.00      1.00        11
         21       0.64      1.00      0.78         9
         22       1.00      0.62      0.76        13
         23       1.00      1.00      1.00         9
         24       1.00      1.00      1.00        10
         25       1.00      1.00      1.00         6
         26       1.00      1.00      1.00        13
         27       1.00      1.00      1.00        16
         28       1.00      1.00      1.00        11
         29       1.00      1.00      1.00        13
         30       1.00      1.00      1.00        14
         31       1.00      1.00      1.00        13
         32       1.00      1.00      1.00        10
         33       1.00      1.00      1.00        10
         34       1.00      1.00      1.00        19
         35       1.00      1.00      1.00         8
         36       1.00      1.00      1.00         9
         37       1.00      1.00      1.00        10
         38       1.00      1.00      1.00        11
         39       1.00      1.00      1.00         5

avg / total       0.93      0.92      0.91       408

Best parameters:  {'kernel': 'rbf', 'C': 5.0999999999999996}

The final step is to write the classifier to disk, having already trained it on the chord data, so you can use it in the official notebook (6b).

In [45]:

# save the classifier to disk for use with 6b. The N-Gram Pipeline, Part II.
with open('part7clf.pkl', 'wb') as fid:
    cPickle.dump(grid_search, fid)

In [46]:

# save the defaultdict (intID : chord) to disk for use with 6b. The N-Gram Pipeline, Part II.
with open('part7cdict.pkl', 'wb') as fid:
    cPickle.dump(allchords, fid)

In [ ]: