What the trainer shows

A neural network adjusts weights to reduce a loss function. The update rule uses the gradient ablaLabla L and a learning rate etaeta to decide how large each step should be.

More hidden layers and neurons increase model capacity, but they also raise the chance of overfitting if the training run is long and regularization stays weak.

The preview here is a teaching simulation, so it focuses on training tradeoffs instead of full numerical backpropagation.

How to read the results

  • Use the top summary row to compare score, loss, parameter count, and overfit risk.
  • A wider gap between training and validation lines usually means the model is memorizing rather than generalizing.
  • Classification presets report accuracy-style metrics, while regression presets report error metrics such as RMSE and MAE.