Scaling SGD Batch Size to 32K for ImageNet Training

## 一言でいうと
大規模バッチ学習のためのLayer-wise Adaptive Rate Scaling (LARS)を提案．

### 論文リンク
https://digitalassets.lib.berkeley.edu/techreports/ucb/text/EECS-2017-156.pdf

### 著者/所属機関
Yang You, Igor Gitman, Boris Ginsburg (UC Berkeley)

### 投稿日付(yyyy/MM/dd)
2017/09/16

## 概要

<img width="870" alt="Screen Shot 2021-01-04 at 14 11 16" src="https://user-images.githubusercontent.com/10952293/103503195-e214b180-4e96-11eb-8153-f4c6ba0be204.png">


## 新規性・差分
異なるレイヤーで異なる学習率を適用する初の手法．

## 手法

<img width="806" alt="Screen Shot 2021-01-04 at 14 14 30" src="https://user-images.githubusercontent.com/10952293/103503310-3881f000-4e97-11eb-9909-891f64e41889.png">

<img width="808" alt="Screen Shot 2021-01-04 at 14 14 40" src="https://user-images.githubusercontent.com/10952293/103503316-3f106780-4e97-11eb-986f-5532efea4459.png">

## 結果

<img width="815" alt="Screen Shot 2021-01-04 at 14 11 26" src="https://user-images.githubusercontent.com/10952293/103503206-e640cf00-4e96-11eb-94bf-5549a595efbb.png">

<img width="766" alt="Screen Shot 2021-01-04 at 14 11 39" src="https://user-images.githubusercontent.com/10952293/103503216-e8a32900-4e96-11eb-8e07-d11b3c5b6f5a.png">

<img width="815" alt="Screen Shot 2021-01-04 at 14 11 46" src="https://user-images.githubusercontent.com/10952293/103503223-eb9e1980-4e96-11eb-989e-280870518d93.png">

<img width="876" alt="Screen Shot 2021-01-04 at 14 12 02" src="https://user-images.githubusercontent.com/10952293/103503228-ee990a00-4e96-11eb-967c-dcd1183ba961.png">

<img width="804" alt="Screen Shot 2021-01-04 at 14 12 15" src="https://user-images.githubusercontent.com/10952293/103503234-f0fb6400-4e96-11eb-8ca1-4aaa62fb22ae.png">


## コメント


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scaling SGD Batch Size to 32K for ImageNet Training #15

一言でいうと

論文リンク

著者/所属機関

投稿日付(yyyy/MM/dd)

概要

新規性・差分

手法

結果

コメント

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Scaling SGD Batch Size to 32K for ImageNet Training #15

Description

一言でいうと

論文リンク

著者/所属機関

投稿日付(yyyy/MM/dd)

概要

新規性・差分

手法

結果

コメント

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions