22 MAR 2021 • paper-review / NLP

[Paper Review] Text Understanding with the Attention Sum Reader Network

There are several large cloze-style context-question-answer datasets which was introduced 4~5 years ago - the CNN, Daily Mail news data, and the Children's Book Test. Thanks to the introduction to these large datasets, it became easier to associate text comprehension task to deep-learning techniques that seem to outperform all alternative approaches.

22 MAR 2021 • computer vision / deep-learning

[CV] Few Shot Learning

Intro to Few Shot Learning

22 MAR 2021 • computer vision / deep-learning

Pose Estimation - AlphaPose and its application

[AlphaPose](http://www.mvig.org/research/alphapose.html) is an accurate multi-person pose estimator, which is the first open-source system that achieves 70+ mAP (75 mAP) on COCO dataset and 80+ mAP (82.1 mAP) on MPII dataset.

22 MAR 2021 • deep-learning / NLP

[Paper Review] Neural Translation by Jointly Learning to Align and Translate (2015)

In this post, I aim to address potential problems with RNN based _neural machine translation (NMT)_ models and introduce a solution this paper proposed.

16 MAR 2021 • deep-learning / NLP / pytorch

[DL 101] Understanding Bi-RNN/LSTM (Pytorch)

14 MAR 2021 • paper-review / NLP

[Paper Review] Distributed Representations of Words and Phrases and their Compositionality

To improve the Vector Representation Quality of Skip-gram (one of the Word2Vec method). There are 4 ways to improve representation quality and computational efficiency.

13 MAR 2021 • deep-learning / paper-review / NLP

[Paper Review] Efficient Estimation of Word Representations in Vector Space

How to make text to be input in deep learning?

12 MAR 2021 • paper-review / NLP

[NLP] Google Duplex - 'Jarvis' comes true?

Google Duplex is a name for the technology supporting Google Assistant. This service was first introduced in 2018 and has been mainly used in _booking_ with human-like phone calls.

12 MAR 2021 • computer vision / deep-learning / paper-review

[Paper Review] U-Net: Convolutional Networks for Biomedical Image Segmentation

In Biomedical field, the instance segmentation are frequently used such as detecting tumors based on radiography, lesion segmentation, etc. What is important here, in biomedical data, is that the output should include localization. Let's look at what is U-Net and how it works

12 MAR 2021

Web-crawling using Python

Today, we are going to know how to crawl iherb using Python, especially information about supplements.

12 MAR 2021 • computer vision

[Paper Review] YOLO (You Look Only Once)

See what is YOLO and how it works

11 MAR 2021 • paper-review / deep-learning

[DL 101] Activation Functions

Today we will look at different activation functions, especially the family of **ReLU (Rectified Linear Unit)** activation function. The role of activation functions in neural networks is taking the input and mapping it into output that goes into next layer. Deciding which activation function to use heavily depends on the target.

11 MAR 2021 • computer vision

[Paper Review] Faster R-CNN

Fast R-CNN starts from the idea: What if using convolutional feature maps for generating region proposals?

11 MAR 2021 • computer vision

[Paper Review] Fast R-CNN

Briefly introduce one of the most important object detection papers

10 MAR 2021 • NLP

[NLP] Big Bird

In this posting, we will deal with the NLP model named Big Bird. As you can see in the title, it is the model for _longer_ sequences. Let's take a brief look at the concept of Big Bird.

10 MAR 2021 • NLP

[NLP] Multi-Task Learning & MT-DNN

In this posting, I will talk about **Multi-Task Learning (MTL)** briefly and then deal with the NLP model named MT-DNN.

4 MAR 2021 • paper-review

[Paper Review] Sparse Transformer

I will focus on the core part of this paper (=Factorized self-attention) and briefly mention rest of the stuffs.

4 MAR 2021 • computer vision

[CV] One Shot learning

What is One Shot Learning? examples with face recognition

3 MAR 2021 • anomaly-detection

[AD] Introducing Anomaly Detection

Today, we will look at the basic approaches to anomaly detection.

1 MAR 2021 • computer vision

[DL 101] OpenCV python tutorial

Take baby steps towards the Computer Vision master

1 MAR 2021 • computer vision

[Paper Review] R-CNN

Today we are going to cover some important papers about object detection using deep learning architecture.

28 FEB 2021 • NLP

[NLP] NLP tasks

In this posting, we will take a quick look at the NLP tasks that I picked for explanation.

28 FEB 2021 • transformer / NLP

[NLP] Code - BERT + KMeans

Text Clustering - BERT + Dimension reduction + KMeans

28 FEB 2021 • deep-learning

Useful methods for CV competition

From SWA to TTA

26 FEB 2021 • NLP / transformer

[NLP] How does BERT work?

BERT is a pre-trained model released by Google in 2018, and has been used a lot so far, showing the highest performance in many NLP tasks. In this post, let's learn more about the detailed structure of BERT.

26 FEB 2021 • deep-learning

[DL 101] RAdam - a novel variant of Adam

Rectified Adaptive Learning Rate (RAdam)

26 FEB 2021 • deep-learning

[DL 101] Learning Rate Scheduler

What is learning rate scheduler?

25 FEB 2021 • recommendation-system

[RecBasics] Negative samples in Top-K recommender task (Part 1)

In this post, we will dive into negative sampling for test / train instances in recommendation system.

24 FEB 2021 • NLP / paper-review / transformer

[Paper Review] Attention is all you need

In this posting, we will review a paper titled: Attention is all you need, which introduces the ttention mechanism and Transformer structure that are still widely used in NLP and other fields. BERT, which was covered in the last posting, is the typical NLP model using this attention mechanism and Transformer. Although Attention and Transformer are actively used in NLP, they are also used in many areas where recurrent methods were used. From now on, let's take a closer look at what Attention and Transformer are.

24 FEB 2021 • NLP

[NLP] Difficulties in applying NLP models to Korean data

Although most of the NLP models already offered a pre-trained model for multilingual data, it is still difficult to put it directly into Korean. Korean is a complex language, so there are many aspects that the Tokenizer used here does not fit well. No matter how well pre-training has done, the performance will be terrible if you don't make subword vocab well.

23 FEB 2021 • deep-learning

[DL 101] Maxpooling? Convolutions?

Can we replace maxpooling with convolutional layers?

23 FEB 2021 • deep-learning

[DL 101] Upsampling

In many cases including image segmentation, a model consists of downsampling and upsampling parts and the latter restore the feature map to the input sized image. There are two types of upsampling using torch: UpSampling, ConvTranspose2D.

21 FEB 2021 • deep-learning / pytorch

[DL 101] Autoencoder Tutorial (Pytorch)

Today we are going to build a simple autoencoder model using pytorch. We'll flatten CIFAR-10 dataset vectors then train the autoencoder with these flattened data.

18 FEB 2021 • deep-learning

[Jump into ML] Overview of Cross-validation

Cross-validation is one of the most popular methods to evaluate model performance and tune model parameters. Like the bootstrap, it belongs to a family of Monte Carlo methods. Today, we will go over several types of CV methods.

13 FEB 2021 • NLP / paper-review

[NLP] Overview of NLP

There are countless ways to perform NLP, and the flow of methodologies is changing very quickly. Therefore, it is important to understand the latest trends in NLP. Since it is difficult to handle each method in a single posting, this posting will cover only the overview of NLP, and then individual models will be covered in the subsequent postings in more detail.

13 FEB 2021 • deep-learning