Question 1

What is the perceptron learning rule?

Accepted Answer

The perceptron learning rule updates each weight by adding the product of the learning rate, the prediction error (true label minus predicted output), and the corresponding input value. Weights are adjusted only when the perceptron makes an incorrect prediction, nudging the decision boundary toward classifying the sample correctly.

Question 2

What does the learning rate η do in the perceptron algorithm?

Accepted Answer

The learning rate η controls how large each weight update step is. A high learning rate makes large updates that can cause instability or overshooting, while a low learning rate makes small, cautious updates that may converge more slowly. Typical values range from 0.01 to 0.5 for simple perceptron tasks.

Question 3

Why does the perceptron fail on the XOR problem?

Accepted Answer

The XOR function is not linearly separable — no single straight line can divide its outputs into two classes. The perceptron can only learn linear decision boundaries, so it cannot represent XOR. This fundamental limitation was famously highlighted by Minsky and Papert in 1969 and spurred research into multi-layer networks.

Question 4

What is the difference between the perceptron and logistic regression?

Accepted Answer

Both are linear binary classifiers, but they differ in activation and loss. The perceptron uses a hard step function and updates only on misclassified points, while logistic regression uses a sigmoid activation and minimises a probabilistic log-loss across all data points. Logistic regression produces probability estimates; the perceptron does not.

Question 5

How is the bias term updated in a perceptron?

Accepted Answer

The bias is treated as a special weight connected to a constant input of 1. It is updated with the same rule as other weights: b(new) = b(old) + η × (y − ŷ). The bias allows the decision boundary to shift away from the origin, giving the perceptron greater flexibility in separating classes.

Perceptron Learning Calculator

Formula

How it works

Worked example

Limitations & notes

Frequently asked questions

What is the perceptron learning rule?

What does the learning rate η do in the perceptron algorithm?

Why does the perceptron fail on the XOR problem?

What is the difference between the perceptron and logistic regression?

How is the bias term updated in a perceptron?