openreview.net/forum?id=TVHS5…
A very ... interesting 4 page paper at ICLR. I'm curious to see the reviewers' reactions.
Also presented a tweet length implementation of their module haha
def ConvMixr(h,d,k,p,n):
S,C,A=Sequential,Conv2d,lambda x:S(x,GELU(),BatchNorm2d(h))
R=type('',(S,),{'forward':lambda s,x:s[0](x)+x})
return S(A(C(3,h,p,p)),*[S(R(A(C(h,h,k,groups=h,padding=k//2))),A(C(h,h,1))) for i in range(d)],AdaptiveAvgPool2d((1,1)),Flatten(),Linear(h,n))
• • •
Missing some Tweet in this thread? You can try to
force a refresh
Some metadata for those curious about their #ICLR2020 reviews.
1. Histogram of the average reviews. 2. Top x% deciles
Seems like reviews this year at @iclr_conf are substantially lower than previous years. Probably an artifact of the new [1,3,6,8] reviewing system. (1/n)
@iclr_conf For experience:
Out of 7583 total #ICLR2020 reviews:
1078 "do not know much about this area"
2484 "have read many papers in this area"
2604 "have published 1 or 2 papers"
1417 "have published in this field for many years"
47% of reviews haven't published in this area!
@iclr_conf For thoroughness:
601 "made a quick assessment of the paper"
4099 "read the paper at least twice and used their best judgement"
2698 "read the paper thoroughly"