Name: GANs in Action: Deep learning with Generative Adversarial Networks
Rating: 4.25 (7 reviews)
ISBN: 9781617295560

Rob Hocking

248 reviews12 followers

October 27, 2020

I think I first heard (read) the term "Generative Adversarial Network" in this article on Deep Fakes that my friend Cameron sent me a few months back.

https://www.forbes.com/sites/robtoews...

At the time I was beginning to learn basic deep learning on Coursera.com, and I hoped to have a chance to learn about GANs - but the topic was beyond the scope of what Coursera had available at the time. Then over the summer, I decided to try simultaneously going to Chinese school five days a week while also working, which exhausted me and made me stop learning AI.

But in late September, Coursera released a new series of courses on exactly this topic, Chinese School was over and so after getting some paper revisions out of the way, I devoured the first two courses in a week. The third course goes live tomorrow (October 28, 2020) - and while I was waiting for it to come out I had this book priority shipped from the states and read it cover to cover. I have a second, more advanced book arriving from England on Friday.

I've learnt a lot of cool things in my life. Here is a non-exhaustive list of what I consider to be the most interesting things I have ever learnt.

1. It is possible to break a sphere into five pieces and rearrange the pieces via rigid motions into two spheres, each with the same volume as the original. Moreover, this can also be done for hyperspheres in dimension four or higher, but cannot be done with circles in dimension two or line segments in dimension one.
2. Assuming an n-dimensional universe with physics of the form F = ma, and where gravity exists, is radially symmetric and obeys a 1/r^{n-1} law, solar systems can only form if n = 3. If n>=4, the planets fly off into outer space, while for n<=2, they fall into the sun. Although we know physics is *not* of form F=ma, an analogous statement can be proven for Einstien's geodesic equations which replaced F=ma.
3. There exist events for which it is not possible to assign a probability.
4. It is always possible to "add" points in n-dimensional space by adding their individual components. However, it is only possible to *multiply* points in n-dimensional space (in such a way that multiplication has most of the properties we expect and interacts with addition in the way we expect) if n is a power of 2. Moreover, while for n=2 we keep *all* the usual properties of multiplication and addition, for n=4 we lose the property a*b=b*a, while for n=8 we additionally lose a*(b*c) = (a*b)*c.
5. It is possible to define, by way of simple mathematical formulas, shapes that have finite area but infinite perimeter, and which contain within themselves infinitely many copies of themselves. Moreover, we can easily calculate the appearance of these shapes using a computer program, and zoom in on them until they are larger than the universe. We can keep zooming in forever, and never stop finding new details. Moreover, this can also be done in three dimensions (infinite surface area but finite volume).
6. There exist different sizes of infinity. In particular, while the infinity of integers, natural numbers, and rational numbers is all the same infinity, the infinity of the real numbers is strictly bigger. Moreover, there are infinitely many infinities - so many so that the number of infinities is a bigger infinity than any infinity in the set of all infinities, creating a paradox.
7. While time travel into the past appears to be impossible, it is possible to travel into the distant future by putting yourself into a situation where time flows at a faster rate. Moreover, by tuning things appropriately you can travel forward as many years as you like - ten years, a hundred years, a thousand years, 10^10 years, you name it. The simplest way of doing this is to sit near the event horizon of a black hole - because gravity speeds up the flow of time - and then leave after an hour or so. You can tune how far you go into the future by tuning how long you hang out near the event horizon and how far above it you are.
8. The notion of determinism - i.e. that the entire future of the universe is uniquely determined by the position and velocity of every particle in the universe - actually depends on which specific differential equations govern physics. In particular, there exist second order differential equations for which a unique solution is NOT specified by the position and velocity at a given instant. Moreover, this type of equation has appeared in fluid mechanics - but as an approximate equation, rather than a true law of the universe. An experiment was done, and the experiments observed that the real behaviour of the system seemed to "hop around" between the different solutions of the differential equations.
9. Special relativity - which says that time flows at different rates depending on your situation - can be seen as inevitable once we decide that the laws of physics should obey certain symmetries which seem reasonable (i.e. that they don't vary from place to place, don't depend on your orientation or your speed) and that Maxwell's equations of electromagnetism exist. Therefore, if special-relativity seems counter-intuitive, one must ponder what a universe without the above symmetries and without Maxwell's equations would be like, and whether or not it would be more strange than special relativity.

I think that GANs are as interesting as anything in the list above. Allow me to briefly explain what they are:

To be honest, the genius of the idea astonishes me. We have this concept of an arms race - of which the most obvious examples (to me at least) have to do with geopolitics - obviously the USA/Soviet Nuclear Arms Race, but also the Naval arms race between Britain and Germany leading up to World War I etc - then you realize the concept can also be applied to biology. It's been a while since I read Dawkins so I forget the specific examples, but it doesn't matter - you have prey that develops some defence, so the predator has to evolve a way around the defence, so the prey has to make the defence stronger, and so on until you end up in a situation where both are investing this absurd amount of resources just to keep up with one another.

But the common component about all these situations is that we're using arms races as a theoretical tool to explain what we observe in nature. The genius of GANs is to turn this around and instead treat arms races as a mathematical phenomenon - the power of which we want to harness. Effectively you're saying "we want an ability X of an agent A to develop very quickly. What should we do?" and answering "let's introduce a second agent B with ability Y that counterbalances X, and induce an arms race. X and Y will both develop extremely quickly, and when we're done, we throw away agent B".

In more concrete terms, we're doing this. We have two neural networks - a generator network and a discriminator network. The job of the generator is to make fake pictures of something. In an example I have coded myself, they are handwritten digits from a giant database of handwritten digits.

The job of the discriminator is to take in an image and say whether its a real image from the database or a fake created by the discriminator. Every round, we feed the discriminator a bunch of fake images made by the generator, as well as real images from the database, but we don't tell it which is which. It makes its judgement, and then we tell it the answer. It looks at what it did wrong, and improves itself in order to do better next time. Next, we get the generator to send a bunch of fake images to the discriminator, who judges them as real or fake, and we send the judgements back to the generator (this time NOT telling the discriminator whether or not it was correct). The generator looks at which images fooled the discriminator and which ones didn't, and tries to learn from its mistakes and do better in the next round. An arms race ensues.

To me, looking at the output is positively magical. At the start, the generator is just making noise,
but by step 2000, we already have something vaguely numeral-like. By step 5000, they are definitely starting to look like numbers, and by step 23000, they are practically indistinguishable from the original data-set (it's unfortunate I can't include pictures in my review).

But here's where things get interesting. It turns out that while GANs have incredible potential, they are notoriously difficult to make work, because they are only useful if you can maintain - throughout the entire arms race - a balance of power between the discriminator and the generator. The biggest problem turns out to be the discriminator learning too quickly, being like "yeah these are obviously fake WTF it's just some random noise I'm not retarded", thereby depriving the generator of clues that it can use to improve. So you have to deliberately make sure you start off with a discriminator that is as dumb at recognizing fake images as a generator that is just making random noise. You have to make sure the two improve together at roughly the same rate, without one of them ever gaining a decisive edge. The moment this happens, the process breaks down and both of them stop improving.

This suggests that arms races have in inherent instability to them, and that while we deliberately create an artificial arms race in order to extract its power, this inherent instability makes the situation difficult to control. I find this interesting because another book I am reading - "The Narrow Corridor" - essentially argues that "liberty" is the result of an arms race between society and government and only exists when there is a balance of power between the two. If arms races are inherently unstable, this suggests that making society persist in a state of "liberty" should be difficult - societies should naturally "want to" diverge towards authoritarianism or anarchy (I'm not saying this book is correct - I don't know - but it's an interesting idea).

GANs are even more interesting in the context of "image to image translation". I haven't had a chance to code this up myself yet, but you can see a few examples here: https://phillipi.github.io/pix2pix/

GANs in Action: Deep learning with Generative Adversarial Networks

Jakub Langr, Vladimir Bok

About the author

Jakub Langr

Ratings & Reviews

Friends & Following

Community Reviews

Join the discussion

Can't find what you're looking for?