A set of tasks that aim to gain a high level understanding of images or video. Typical tasks include image recognition, object detection, pose estimation and much more.
See moreIn this article you will learn how the vision transformer works for image classification problems. We distill all the important details you need to grasp along with reasons it can work very well given enough data for pretraining.
Learn all there is to know about transformer architectures in computer vision, aka ViT.
Explaining the mathematics behind generative learning and latent variable models and how Variational Autoencoders (VAE) were formulated (code included)
A deep dive into the mathematics and the intuition of diffusion models. Learn how the diffusion process is formulated, how we can guide the diffusion, the main principle behind stable diffusion, and their connections to score-based models.
GANs are constructed by two neural networks that compete against each other in a adversarial game, and are proven to be ideal for generating novel data.
See moreThe first article of the GANs in computer vision series - an introduction to generative learning, adversarial learning, gan training algorithm, conditional image generation, mode collapse, mutual information
The third article-series of GAN in computer vision - we encounter some of the most advanced training concepts such as Wasserstein distance, adopt a game theory aspect in the training of GAN, and study the incremental/progressive generative training to reach a megapixel resolution.
GNNs are able to extract features from graphs and produce invaluable insights
See moreStart with Graph Neural Networks from zero and implement a graph convolutional layer in Pytorch
Explore the most popular gnn architectures such as gcn, gat, mpnn, graphsage and temporal graph networks
How can we efficiently train very deep neural network architectures? What are the best in-layer normalization options? We gathered all you need about normalization in transformers, recurrent neural nets, convolutional neural networks.
What is Explainable Artificial Intelligence (XAI), what are the most popular methods, where and how can it be applied
Deep Learning can also be applied in healthcare and medical applications to solve problems such as diagnosis, prognosis and cure. Understanding medical images is a big part of that endeavour
See moreMultiple introductory concepts regarding deep learning in medical imaging, such as coordinate system and dicom data extraction from the machine learning perspective.
Learn how to apply 3D transformations for medical image preprocessing and augmentation, to setup your awesome deep learning pipeline
An area of Computer Science that focuses on processing and modeling Language. The most popular examples are language translation, voice recognition and text generation.
See moreLearn everything there is to know about the attention mechanisms of the infamous transformer, through 10+1 hidden insights and observations
Understand how positional embeddings emerged and how we use the inside self-attention to model highly structured data such as images
Reinforcement learning is an area of Machine Learning that is about taking suitable action to maximize reward in a particular situation. It has been widely used in solving games but has also numerous applications in real problems.
See moreActor critics, A2C, A3C
Explore Policy-based methods and dive into policy gradients
Unsupervised Learning is a research field where model are trained without labeled data
See moreA general perspective on understanding self-supervised representation learning methods.
Learn how to implement the infamous contrastive self-supervised learning method called SimCLR. Step by step implementation in PyTorch and PyTorch-lightning