Skip to content

Add DCA paper to Very Deep Learning Theory#5

Open
WhymustIhaveaname wants to merge 1 commit intodaviddao:masterfrom
WhymustIhaveaname:add-dca-shortcuts-paper
Open

Add DCA paper to Very Deep Learning Theory#5
WhymustIhaveaname wants to merge 1 commit intodaviddao:masterfrom
WhymustIhaveaname:add-dca-shortcuts-paper

Conversation

@WhymustIhaveaname
Copy link
Copy Markdown

Connects Difference-of-Convex optimization with residual learning. Shows DCA on a vanilla net is equivalent to SGD on a ResNet, offering a second-order explanation for why shortcuts work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant