Deep Learning Revision

By Jean de Dieu Nyandwi

Trends, ideas, and the latest news in deep learning and computer vision.

Deep Learning Revision - Issue #5

UViM - A Unified Modeling Approach for Vision with Learned Guiding CodesUViM presents a single unified framework for handling different computer vision tasks such as panoptic segmentation(a task of labeling every pixel with a semantic label and segmenting/del…


Deep Learning Revision - Issue #4

Imagen - Photorealistic Text-to-Image Diffusion Models with Deep Language UnderstandingImagen is a new and state-of-the-art text-to-image model that has extreme photorealism and language understanding. Imagen can generate images of super-high resolution from …


Flamingo: A Visual Language Model for Few-Shot Learning - Issue #3

Something that is common and (norm) in the deep learning community is transfer learning and fine-tuning where rather than training a vision or language model from scratch, you adapt a pre-trained model to the new task instead. But there is a challenge with fi…


Neural Networks Should be Able to Read Objects in an Image - Issue #2

Modern R-CNN object detectors typically have two-stage networks that compute the object locations and object information(object class and bounding box) respectively. Let’s take the example of Faster R-CNN. Its first stage is the Regional Proposal Network(RPN)…


A Revised History of Deep Learning - Issue #1

In 1958, Frank Rosenblatt invented a perceptron, a very the simple machine that would later be the core and origin of today's intelligent machines.Perceptron was a very simple binary classifier that can determine whether or not a given input image belong to a…