Improved Baselines with Momentum Contrastive Learning (CVPR2020)

Balin

4 min readSep 6, 2021

Introduction

MoCo 的第二版，結合了 SimCLR，達到更好的效果。
下圖(a)的 end-to-end 為 SimCLR 的訓練方式，透過很大的 batch size 得到很多相同 batch 的 negative samples，並不會在下次訓練時用到，而(b)為 MoCo 的訓練方式，會把 negative samples 儲存到 queue，可於之後訓練時參考到。

Method

同樣是 InfoNCE loss，q 是 query (當前的影像)，k_+ 是 positive sample，k_- 是 negative sample，要讓 q 和 k_+ 越像且和 k_- 越不像。

SimCLR 透過以下方式強化 end-to-end 的訓練方式，並透過各種實驗證明其有效性。

很大的 batch size 得到更多的 negative samples
利用額外的 MLP head 去 minimize loss
用更強的 data augmentation

而在 MoCo 的框架中可以透過 queue 的方式得到很大的 negative samples，另外也可以無痛升級成類似 SimCLR 的方式，新增 MLP head 和運用其他 data augmentation 的技巧，因此將其結合在一起並提出 MoCov2。

stantiated. Next westudy these improvements in MoCo.

實做上是把 MoCo 原本 encoder (f_c) 的 head 換成 SimCLR 提出的 2-layer MLP head (hidden layer 2048-d, withReLU)
τ 即使用原本 default 的 0.07，加上 MLP head 也會有比較好的結果，但實驗結果來說加上 MLP 的 τ 在 0.2 可以有更好的結果。

這邊有做一些 ablation study，即使只有新增其他 augmentation (blur and stronger color distortion) 也可以提升 performance。

與 v1 和 SimCLR 的比較，MoCo v1 和 v2 就是差在有沒有額外的 MLP、新增其他 augmentation 和 Cosine learning rate。

batch size 會提高記憶體用量，但除此之外因為 end-to-end 的方法需要 back-propagate 兩個 encoder，因此相對 MoCo 來說也會比較耗記憶體資源。

Experiment

Reference

[arxiv]

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Written by Balin

20 Followers

20 Following

NTUST CSIE

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Recommended from Medium

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Level Up Coding

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

260

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

731

Lists

Natural Language Processing

1977 stories1619 saves

data science and AI

40 stories340 saves

Practical Guides to Machine Learning

10 stories2225 saves

Medium's Huge List of Publications Accepting Submissions

414 stories4678 saves

YOLOv12: Redefining Real-Time Object Detection 🚀

Henry Navarro

YOLOv12: Redefining Real-Time Object Detection 🚀

Introducing the Pioneering Features and Performance of YOLOv12 from the Latest Research

Feb 19

How To Generate Synthetic Images For Object Detection Tasks

TDS Archive

Dr. Leon Eversberg

How To Generate Synthetic Images For Object Detection Tasks

A step-by-step tutorial using Blender, Python, and 3D Assets

Mar 8, 2024

How I Am Using a Lifetime 100% Free Server

Harendra

How I Am Using a Lifetime 100% Free Server

Get a server with 24 GB RAM + 4 CPU + 200 GB Storage + Always Free

Oct 26, 2024

170

Google just confirmed the AI reality many programmers are desperately trying to deny

Coding Beauty

Tari Ibaba

Google just confirmed the AI reality many programmers are desperately trying to deny

AI is slowly taking over coding but many programmers are still sticking their head in the sand about what’s coming…

Feb 20

189

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams

Improved Baselines with Momentum Contrastive Learning (CVPR2020)

Introduction

Method

Experiment

Reference

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by Balin

No responses yet

More from Balin

TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation

Method

TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up

Introduction

Fine-Tuning StyleGAN2 For Cartoon Face Generation

Introduction

EfficientNetV2: Smaller Models and Faster Training

前言

Recommended from Medium

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Lists

Natural Language Processing

data science and AI

Practical Guides to Machine Learning

Medium's Huge List of Publications Accepting Submissions

YOLOv12: Redefining Real-Time Object Detection 🚀

Introducing the Pioneering Features and Performance of YOLOv12 from the Latest Research

How To Generate Synthetic Images For Object Detection Tasks

A step-by-step tutorial using Blender, Python, and 3D Assets

How I Am Using a Lifetime 100% Free Server

Get a server with 24 GB RAM + 4 CPU + 200 GB Storage + Always Free

Google just confirmed the AI reality many programmers are desperately trying to deny

AI is slowly taking over coding but many programmers are still sticking their head in the sand about what’s coming…