I finally got around to reading the Involution #CVPR2021 paper (arxiv.org/abs/2103.06255). Here is a summary and some thoughts: 🧵👇 (1/n)
The method replaces traditional spatial convolution layers in a CNN with a type of dynamic convolution with a different convolution kernel for each (i,j) spatial location. The kernels are spatially varying, and data dependent, i.e., predicted. (2/n)