[🧠 GAN paper summary 🧠] "Eyes Tell All" <- An interesting short paper on how to detect fake images generated by GANs (at least the current GAN methods like StyleGAN v2!).
They use a simple heuristic: 1/
1. Crop the face and then eyes (they used dlib for this) 2. Segment out the pupils -> you get the predicted mask 3. Fit an ellipse -> you get the "GT" mask 4. Find BIoU between the 2) and 3)
2/
(BIoU is just a generalization of IoU where we focus only on the boundary pixels of the mask instead of the whole mask)
5. If BIoU is "too low" that means that the predicted pupil is deviating from the elliptical shape and thus the eyes i.e. the face is fake/generated!
3/
Obviously, there are some cons here as well - some real people do have irregular iris shape (i.e. those irises deviate from elliptical shape) and thus the method would produce false positives for real humans that have certain eye conditions.
It's an interesting topic in general that will become increasingly important - how can we make sure that the media we're presented with is genuine?
IMHO, in the long run, these CV-based approaches are futile and we'll have to resort to some kind of digital signatures.
5/
But for the time being, if you feel paranoid about an image, you can check whether the iris is irregular to determine whether the image is fake or not (at least until StyleGAN v3 comes out haha) #gans
6/
• • •
Missing some Tweet in this thread? You can try to
force a refresh
[big update 🥳] I'll be joining @Google@DeepMind later this year as a Research Engineer!!! 🤖😅❤
It feels surreal, I don't even know where to start. 1/
I didn't come from an "Ivy League" university. Not because I couldn't get in - but because when you're 19, from a less developed country, and you didn't have tech-savvy people around you growing up - you are not even aware that @Stanford/@MIT are a thing. 2/
I can deeply empathize with many because of that. Not all of us were lucky enough to get exposed to tech when we were 9. I "heard" about programming when I was 19! We had to compensate by working harder than everyone else. 3/
I've tried to give you all of the tips and tricks I could think of both for more productive learning in general and stuff specific to the RL field.
The structure of the blog: 1) Intro 2) RL 101 (getting you exposed to the terminology) 3) Cool things about RL (awesome RL apps!) 4) RL is not just roses 5) Getting started with RL 6) Going deeper - reading papers 7) Implementing an RL project from scratch 8) Related subfields
1) This time the project is still not completely ready**. I'm yet to achieve the published results - so I encourage you to contribute!
Many of you have been asking me whether you can work on a project with me and I'll finally start doing it that way - from now onwards. ❤
2) This repo has the ambition to grow and become the go-to resource for learning RL. So collaborators are definitely welcome as I won't always have the time myself.
** main reasons are:
a) I was very busy over the last 2 weeks
b) It currently takes ~5 days to fully train DQN