wu am i
1 min readJan 11, 2019

--

Thanks exactly my question was, also why learning rate?

I also found another answer

“the gradient ∇f(a) points in the direction of the greatest increase of f, that is, the direction of steepest ascent. Of course, the opposite direction, −∇f(a), is the direction of steepest descent”

Use small learning rate so that we don’t jump off the slope. need to go down the slope to reach slope~0

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

wu am i
wu am i

Written by wu am i

truth serum - pythonist, perpetual learner, ai/ml enthusiast, hope to build a personal robotic assistant (pra?) to take care me in my old age!

No responses yet

Write a response