Hello, world!

Which policy gradient algorithm are you using?

An in depth, detailed review of some of the most famous policy gradient algorithms. Starting from vanilla policy gradient, then discussing actor-critic algorithms and finally visiting PPO. Reference implementations are shown and step-by-step improvements are discussed.
Read more →

An even more annotated Transformer

A blog post with a step-by-step implementation of the Transformer model with even more annotations. Specific design choices are discussed and hidden implementation details are highlighted. At the end you can see an example training on the Multi30k machine translation dataset.
Read more →

ResNet, ResNeXt, RegNet,… what else?

A brief history of the most famous CNN architecture and how it was further improved. The evolution of the residual block is discussed and a procedure for designing the full residual network model is given.
Read more →

Hello World

My first blog post where I share my ideas for this blog and my motivation for starting it.
Read more →