Question regarding neural networks and least squares using NumPy. How do I start?

Question

You are asked to implement a data-fitting classifier that can predict the label of an input vector. Specifically, in the context of hand-written digit classification, the label is an integer number between 0 to 9, and the input vector is a vectorized 28 × 28 image with each element as a floating-point number ranging from 0 to 1 (telling the grayscale of the pixel). Try to solve the hand-written digit classification problems using our well-understood mathematical tool of least squares. We will use the least-squares to classify a subset of the MNIST hand-written digit dataset.

The training dataset for this project has 1000 images, which are stored as a 1000 × 784 matrix in train_data.txt (You may refer to the constructor of the class Classifier to learn how to read a matrix from a text file). The testing dataset for this project has 200 images, which are stored as a 200 × 784 matrix in test_data.txt. The labels for the training set are stored as a 1D array (or a 1000 × 1 matrix) in train_labels.txt and the labels for the testing set are in test_labels.txt.

Here is a brief and high-level description of the mathematical model you may consider implementing. You will solve a least-squares problem 𝑋𝑊 = 𝑌 to fit your data set, with 𝑋 as a 1000 × 785 (with the bias term!) data matrix, 𝑊 as a 785 × 10 model matrix, and 𝑌 is a 1000 × 10 true value right-hand matrix composed of 1 and −1 (or any values that distinguish the two classes). For the easiness of the implementation, you may solve the problem as ten separate least-squares problems with each column 𝜃𝑖 (a 785 × 1 vector) as the unknowns. After calculating the values of 𝑊 , you may use it to predict the class of a new image 𝑥 by a simple vector-matrix multiplication 𝑣⃗ = 𝑥⃗𝑊 and then pick the index with the maximum value from the vector 𝑝 = 𝑎𝑟𝑔𝑚𝑎𝑥𝑖{𝑣𝑖}. You need to test your least-squares classifier on the test dataset and report your least-squares model’s accuracy (by counting how many images are predicted correctly).

[Hint: you may suffer from a singular 𝑋 due to the large blank regions in each image. You need to add a small random number to each element in A to solve the problem to make it solvable in your least-squares solver. For example, you may add a noise ranging from [0, 0.001] to each element in A to perturb its features.]

def classifier(x, y, classes=10):

#x: train_data

#y: train_labels

#classes: # of classes

x = add_bias_column(x)

theta = np.zeros([x.shape[1], classes])

### your code starts here ###

### your code ends here ###

return theta

Daniel B. · Accepted Answer

You could start with a high level pseudocode of the teacher's instructions.

I am not sure about a couple things, like where he meant the test to be done, but below is one of many ways of doing it

# Add small random numbers to each element of x

def preprocess(x):

def add_bias_column(x):

def classifier(x, y, classes=10):

#x: train_data

#y: train_labels

#classes: # of classes

x = add_bias_column(x)

theta = np.zeros([x.shape[1], classes])

### your code starts here ###

for i in range(classes):

# calculate theta[i]

### your code ends here ###

return theta

# Test the given W on samples x, y

def test(x, y, w):

# MAIN

# Read train images and train labels

preprocess(train_images)

w = classifier(train_images, train_labels)

# Read test images and train labels

test(test_images, test_labels, w)

Question regarding neural networks and least squares using NumPy. How do I start?

1 Expert Answer

Still looking for help? Get the right answer, fast.

OR

RELATED TOPICS

RELATED QUESTIONS

Write an assembly program that loads two numbers from memory, calculates their sum, and stores the sum to the memory.

why would this show this error? and an example on how to fix it? i keep getting five of them

verify and explain

Need help in a software engineering question

what problem would arise if two real time operating systems are made to be hosted atop a hypervisor resident atop a quad core processor based computing system

RECOMMENDED TUTORS

IXL

Rosetta Stone

Education.com

TPT

Vocabulary.com

ABCya

SpanishDictionary.com

Inglés.com

Emmersion

Question regarding neural networks and least squares using NumPy. How do I start?

1 Expert Answer

Still looking for help? Get the right answer, fast.

OR

RELATED TOPICS

RELATED QUESTIONS

Write an assembly program that loads two numbers from memory, calculates their sum, and stores the sum to the memory.

why would this show this error? and an example on how to fix it? i keep getting five of them

verify and explain

Need help in a software engineering question

what problem would arise if two real time operating systems are made to be hosted atop a hypervisor resident atop a quad core processor based computing system

RECOMMENDED TUTORS

find an online tutor