NN03 - Logistische Regression

Videos

Folien

Kurze Übersicht

Ausgabe $y$ ist reelle Zahl aus dem stetigen Bereich $(0, 1)$
Die Hypothesenfunktion ist: $\begin{matrix} (1) & h (x) = σ (w^{T} x) = σ (w_{0} + w_{1} x_{1} + w_{2} x_{2} + \dots + w_{n} x_{n}) \end{matrix}$
Der Kreuzentropie Verlust (engl. Cross-Entropy) für einen Datenpunkt $x$ : $\begin{matrix} (2) & L (a, y) = - y \log (a) - (1 - y) \log (1 - a) \end{matrix}$ wobei hier $a := \hat{y}$ die Vorhersage ist.
Die Kosten als durchschnittlicher Verlust über alle Datenpunkte $x^{(1)}, \dots, x^{(m)}$ : $\begin{matrix} (3) & J = \frac{1}{m} \sum_{i = 1}^{m} L (a^{(i)}, y^{(i)}) \end{matrix}$

Der Gradient für einen Datenpunkt $x$ : $\begin{matrix} (4) & \frac{\partial L}{\partial w} = (a - y) x \end{matrix}$
Der Gradient für alle Datenpunkte $X$ in Matrix-Notation: $\begin{matrix} (5) & \nabla J = \frac{\partial J}{\partial w} = \frac{1}{m} X (A - Y)^{T} \end{matrix}$

Übungsblätter/Aufgaben

Lernziele

(K2) Logistische Regression aus Sicht neuronaler Netze: Graphische Darstellung, Vergleich mit Perzeptron und linearer Regression
(K2) Formalisierung
(K2) Sigmoid-Aktivierungsfunktion
(K2) Verlust- und Kosten (Cross-Entropy Loss)
(K3) Gradientenabstieg für logistische Regression

Quizzes