Softmax - Pushforward/Jvp rule

The softmax is a common function in machine learning to map logit values to discrete probabilities. It is often used as the final layer in a neural network applied to multinomial regression problems. Here, we derive its rule for forward-mode AD. Here are the notes: github.com/Ceyron/machine-lea...
------
👉 This educational series is supported by the world-leaders in integrating machine learning and artificial intelligence with simulation and scientific computing, Pasteur Labs and Institute for Simulation Intelligence. Check out simulation.science/ for more on their pursuit of 'Nobel-Turing' technologies (arxiv.org/abs/2112.03235 ), and for partnership or career opportunities.
-------
📝 : Check out the GitHub Repository of the channel, where I upload all the handwritten notes and source-code files (contributions are very welcome): github.com/Ceyron/machine-lea...
📢 : Follow me on LinkedIn or Twitter for updates on the channel and other cool Machine Learning & Simulation stuff: / felix-koehler and / felix_m_koehler
💸 : If you want to support my work on the channel, you can become a Patreon here: / mlsim
🪙: Or you can make a one-time donation via PayPal: www.paypal.com/paypalme/Felix...
-------
⚙️ My Gear:
(Below are affiliate links to Amazon. If you decide to purchase the product or something else on Amazon through this link, I earn a small commission.)
- 🎙️ Microphone: Blue Yeti: amzn.to/3NU7OAs
- ⌨️ Logitech TKL Mechanical Keyboard: amzn.to/3JhEtwp
- 🎨 Gaomon Drawing Tablet (similar to a WACOM Tablet, but cheaper, works flawlessly under Linux): amzn.to/37katmf
- 🔌 Laptop Charger: amzn.to/3ja0imP
- 💻 My Laptop (generally I like the Dell XPS series): amzn.to/38xrABL
- 📱 My Phone: Fairphone 4 (I love the sustainability and repairability aspect of it): amzn.to/3Jr4ZmV
If I had to purchase these items again, I would probably change the following:
- 🎙️ Rode NT: amzn.to/3NUIGtw
- 💻 Framework Laptop (I do not get a commission here, but I love the vision of Framework. It will definitely be my next Ultrabook): frame.work
As an Amazon Associate I earn from qualifying purchases.
-------
Timestamps:
00:00 What is the softmax?
01:20 Intro
01:35 In general: Jacobian-vector product
02:53 Deriving the Jacobian in index notation
09:07 Evaluating the Jvp in index notation
10:30 The Jvp in symbolic notation
11:40 Full Pushforward rule
12:50 Outro

Пікірлер: 4

  • @cangozpinar
    @cangozpinar4 ай бұрын

    Thank you for deriving the math and not just skipping over it !

  • @MachineLearningSimulation

    @MachineLearningSimulation

    4 ай бұрын

    You're welcome 😊 Not skipping over the detailed math is very important for me. Thanks for appreciating it 👍

  • @theneuralmancer
    @theneuralmancer5 ай бұрын

    Thank you so much for this!

  • @MachineLearningSimulation

    @MachineLearningSimulation

    5 ай бұрын

    You're welcome 😊 Btw: I didn't forget about the other comments on the older autodiff videos. Wanted to give you a thorough reply but had a longer backlog of comments. I hope I can come back to it soon. 😊