Stochastic Gradient Descent

Next topic
y' = a x + b
x
y
a: b: alpha:
batch size: with Momentum
\(x\)\(y\)\(ŷ\)\(ŷ-y\)L\(\partial L \over \partial a\)\(\partial L \over \partial b\)ab
Epoch: Batch: L (loss):
velocity a: velocity b:
a: b:
chart: