The Parity Problem

  • Perceptrons - AI Winter
XOR01
001
110

Solution: If you add one new quadratic feature , XOR is linearly separable in 3D.

There is another more powerful way to solve this XOR problem : Stacking Linear Combos

This is close, but it won’t work yet. A linear combo of a linear combo is a linear combo … only works for linearly separable points.

We need some non-linearity.

  • Traditional Choice: Logistic Function (Sigmoid) : Smooth, well defined gradients and Hassian.
  • RELU


Network with 1 Hidden Layer

Input Layer: Hidden Layer: Output Layer:

Layer 1 weights: matrix V; is row i; weights into Layer 2 weights: matrix W; is row i; weights into

The weights are in row. (since we want to do ). So, each in the row is gonna be the weight for .