The learning rate is a tuning parameter in machine learning and statistics that determines the step size at each iteration while moving towards a minimum of a loss function. It influences how much newly acquired information overrides old information, and must be set carefully to achieve faster convergence and prevent oscillations or getting stuck in local minima. The learning rate can be varied during training with a learning rate schedule or an adaptive learning rate.
Carnegie Mellon University
Spring 2020
This course provides a comprehensive introduction to deep learning, starting from foundational concepts and moving towards complex topics such as sequence-to-sequence models. Students gain hands-on experience with PyTorch and can fine-tune models through practical assignments. A basic understanding of calculus, linear algebra, and Python programming is required.
No concepts data
+ 40 more concepts