How to compute inner product between two networks'parameters

Question

How to compute inner product between two networks'parameters

Phong Le

2021年4月23日 05:23

Consider a neural network with $f(x) = w^T_2 \sigma(w^T_1 x) $ where $\sigma(.)$ is a activation function such as ReLU. $w_2 \in R^{d \times k}, w_1 \in R^{k \times o}$ are two matrices. I would like to compute the inner product between two initialization of model's parameters $\theta =(w_2, w_1)$ and $\theta'=(w'_2, w'_1)$. Should we stack all elements of networks parameter into a single vector, i.e $\theta, \theta'$ will be a big vector with the number of entries equal to $w\times k + k \times o$. Then we just compute the inner product between two vectors. Is that a good way to compute the inner product between two model' parameters to measure how similarity they are ?

Topic data-product neural-network

Category Data Science

Jayaram Iyer · Accepted Answer · 2021年4月23日 05:23

1

Jayaram Iyer answered at 2021年4月23日 05:23

You can compute the Frobenius norm between the two matrices. Also look up this related article on distance measures between matrices/

How to compute inner product between two networks'parameters

About