LBRY Block Explorer • Claim • fast-and-slow-learning-of-recurrent

LBRY Claims • fast-and-slow-learning-of-recurrent

d9e063bf9326967c2f74c7e70515f92c70241a45

Published By

@yannickilcher

Created On

29 May 2021 16:43:39 UTC

Transaction ID

97869c21423cd86b10f3a00096c4609b55daf4204921b38c2123c22205ddc5d4

Cost

Safe for Work

Free

Yes

Fast and Slow Learning of Recurrent Independent Mechanisms (Machine Learning Paper Explained)

#metarim #deeprl #catastrophicforgetting

Reinforcement Learning is very tricky in environments where the objective shifts over time. This paper explores agents in multi-task environments that are usually subject to catastrophic forgetting. Building on the concept of Recurrent Independent Mechanisms (RIM), the authors propose to separate the learning procedures for the mechanism parameters (fast) and the attention parameters (slow) and achieve superior results and more stability, and even better zero-shot transfer performance.

OUTLINE:
0:00 - Intro & Overview
3:30 - Recombining pieces of knowledge
11:30 - Controllers as recurrent neural networks
14:20 - Recurrent Independent Mechanisms
21:20 - Learning at different time scales
28:40 - Experimental Results & My Criticism
44:20 - Conclusion & Comments

Paper: https://arxiv.org/abs/2105.08710
RIM Paper: https://arxiv.org/abs/1909.10893

Abstract:
Decomposing knowledge into interchangeable pieces promises a generalization advantage when there are changes in distribution. A learning agent interacting with its environment is likely to be faced with situations requiring novel combinations of existing pieces of knowledge. We hypothesize that such a decomposition of knowledge is particularly relevant for being able to generalize in a systematic manner to out-of-distribution changes. To study these ideas, we propose a particular training framework in which we assume that the pieces of knowledge an agent needs and its reward function are stationary and can be re-used across tasks. An attention mechanism dynamically selects which modules can be adapted to the current task, and the parameters of the selected modules are allowed to change quickly as the learner is confronted with variations in what it experiences, while the parameters of the attention mechanisms act as stable, slowly changing, meta-parameters. We focus on pieces of knowledge captured by an ensemble of modules sparsely communicating with each other via a bottleneck of attention. We find that meta-learning the modular aspects of the proposed system greatly helps in achieving faster adaptation in a reinforcement learning setup involving navigation in a partially observed grid world with image-level input. We also find that reversing the role of parameters and meta-parameters does not work nearly as well, suggesting a particular role for fast adaptation of the dynamically selected modules.

Authors: Kanika Madan, Nan Rosemary Ke, Anirudh Goyal, Bernhard Schölkopf, Yoshua Bengio

Links:
TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick
YouTube: https://www.youtube.com/c/yannickilcher
Twitter: https://twitter.com/ykilcher
Discord: https://discord.gg/4H8xxDF
BitChute: https://www.bitchute.com/channel/yannic-kilcher
Minds: https://www.mi
...
https://www.youtube.com/watch?v=kU-tWy_wr78

Author

Content Type

Unspecified

video/mp4

Language

English

Open in LBRY

More from the publisher

Controlling

VIDEO

PROMP

promptbreeder-self-referential-self

lbry://@yannickilcher/promptbreeder-self-referential-self

#ai #promptengineering #evolution Promptbreeder is a self-improving self-referential system for automated prompt engineering. Give it a task description and a dataset, and it will automatically come up with appropriate prompts for the task. This is achieved by an evolutionary algorithm where not only the prompts, but also the mutation-prompts are improved over time in a population-based, diversity-focused approach. OUTLINE: 0:00 - Introduction 2:10 - From manual to automated prompt engineering 10:40 - How does Promptbreeder work? 21:30 - Mutation operators 36:00 - Experimental Results 38:05 - A walk through the appendix Paper: https://arxiv.org/abs/2309.16797 Abstract: Popular prompt strategies like Chain-of-Thought Prompting can dramatically improve the reasoning abilities of Large Language Models (LLMs) in various domains. However, such hand-crafted prompt-strategies are often sub-optimal. In this paper, we present Promptbreeder, a general-purpose self-referential self-improvement mechanism that evolves and adapts prompts for a given domain. Driven by an LLM, Promptbreeder mutates a population of task-prompts, and subsequently evaluates them for fitness on a training set. Crucially, the mutation of these task-prompts is governed by mutation-prompts that the LLM generates and improves throughout evolution in a self-referential way. That is, Promptbreeder is not just improving task-prompts, but it is also improving the mutationprompts that improve these task-prompts. Promptbreeder outperforms state-of-the-art prompt strategies such as Chain-of-Thought and Plan-and-Solve Prompting on commonly used arithmetic and commonsense reasoning benchmarks. Furthermore, Promptbreeder is able to evolve intricate task-prompts for the challenging problem of hate speech classification. Authors: Chrisantha Fernando, Dylan Banarse, Henryk Michalewski, Simon Osindero, Tim Rocktäschel Links: Homepage: https://ykilcher.com Merch: https://ykilcher.com/merch YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher Discord: https://ykilcher.com/discord LinkedIn: https://www.linkedin.com/in/ykilcher If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n ... https://www.youtube.com/watch?v=tkX0EfNl4Fc

Transaction

Created

7 months ago

Content Type

Language

video/mp4

English

Controlling

VIDEO

NEURI

neurips-2023-poster-session-4-(thursday

lbry://@yannickilcher/neurips-2023-poster-session-4-(thursday

OUTLINE: 0:30 - Activity Grammars for Temporal Action Segmentation 8:50 - Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback 17:05 - On the Role of Noise in the Sample Complexity of Learning Recurrent Neural Networks: Exponential Gaps for Long Sequences 21:20 - Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming 27:10 - Equivariant Adaptation of Large Pretrained Models 33:10 - Multi-Head Adapter Routing for Cross-Task Generalization 39:25 - Geometry-Aware Adaptation for Pretrained Models 46:10 - Adversarial Learning for Feature Shift Detection and Correction Papers: Title: Activity Grammars for Temporal Action Segmentation Link: https://arxiv.org/abs/2312.04266 Author: Dayoung Gong, Joonseok Lee, Deunsol Jung, Suha Kwak, Minsu Cho -------- Title: Diffusion-TTA: Test-time Adaptation of Discriminative Models via Generative Feedback Link: https://arxiv.org/abs/2311.16102 Author: Mihir Prabhudesai, Tsung-Wei Ke, Alexander C. Li, Deepak Pathak, Katerina Fragkiadaki -------- Title: On the Role of Noise in the Sample Complexity of Learning Recurrent Neural Networks: Exponential Gaps for Long Sequences Link: https://arxiv.org/abs/2305.18423 Author: Alireza Fathollah Pour, Hassan Ashtiani -------- Title: Sketching Algorithms for Sparse Dictionary Learning: PTAS and Turnstile Streaming Link: https://arxiv.org/abs/2310.19068 Author: Gregory Dexter, Petros Drineas, David P. Woodruff, Taisuke Yasuda -------- Title: Equivariant Adaptation of Large Pretrained Models Link: https://arxiv.org/pdf/2310.01647.pdf Author: Arnab Kumar Mondal, Siba Smarak Panigrahi, Sékou-Oumar Kaba, Sai Rajeswar, Siamak Ravanbakhsh -------- Title: Multi-Head Adapter Routing for Cross-Task Generalization Link: https://arxiv.org/abs/2211.03831 Author: Lucas Caccia, Edoardo Ponti, Zhan Su, Matheus Pereira, Nicolas Le Roux, Alessandro Sordoni -------- Title: Geometry-Aware Adaptation for Pretrained Models Link: https://arxiv.org/abs/2307.12226 Author: Nicholas Roberts, Xintong Li, Dyah Adila, Sonia Cromp, Tzu-Heng Huang, Jitian Zhao, Frederic Sala -------- Title: Adversarial Learning for Feature Shift Detection and Correction Link: https://arxiv.org/abs/2312.04546 Author: Miriam Barrabes, Daniel Mas Montserrat, Margarita Geleta, Xavier Giro-i-Nieto, Alexander G. Ioannidis Links: Homepage: https://ykilcher.com Merch: https://ykilcher.com/merch YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher Discord: https://ykilcher.com/discord LinkedIn: https://www.linkedin.com/in/ykilcher If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n ... https://www.youtube.com/watch?v=cx3bbMf9LRA

Transaction

Created

5 months ago

Content Type

Language

video/mp4

Controlling

VIDEO

NEURI

neurips-live-stream-(vendor-hall)

lbry://@yannickilcher/neurips-live-stream-(vendor-hall)

Links: Homepage: https://ykilcher.com Merch: https://ykilcher.com/merch YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher Discord: https://ykilcher.com/discord LinkedIn: https://www.linkedin.com/in/ykilcher If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n ... https://www.youtube.com/watch?v=GGPNdH1lBBc

Transaction

Created

5 months ago

Content Type

Language

video/mp4

English

Controlling

VIDEO

LUMIE

lumiere-a-space-time-diffusion-model-for

lbry://@yannickilcher/lumiere-a-space-time-diffusion-model-for

#lumiere #texttovideoai #google LUMIERE by Google Research tackles globally consistent text-to-video generation by extending the U-Net downsampling concept to the temporal axis of videos. OUTLINE: 0:00 - Introduction 8:20 - Problems with keyframes 16:55 - Space-Time U-Net (STUNet) 21:20 - Extending U-Nets to video 37:20 - Multidiffusion for SSR prediction fusing 44:00 - Stylized generation by swapping weights 49:15 - Training & Evaluation 53:20 - Societal Impact & Conclusion Paper: https://arxiv.org/abs/2401.12945 Website: https://lumiere-video.github.io/ Abstract: We introduce Lumiere -- a text-to-video diffusion model designed for synthesizing videos that portray realistic, diverse and coherent motion -- a pivotal challenge in video synthesis. To this end, we introduce a Space-Time U-Net architecture that generates the entire temporal duration of the video at once, through a single pass in the model. This is in contrast to existing video models which synthesize distant keyframes followed by temporal super-resolution -- an approach that inherently makes global temporal consistency difficult to achieve. By deploying both spatial and (importantly) temporal down- and up-sampling and leveraging a pre-trained text-to-image diffusion model, our model learns to directly generate a full-frame-rate, low-resolution video by processing it in multiple space-time scales. We demonstrate state-of-the-art text-to-video generation results, and show that our design easily facilitates a wide range of content creation tasks and video editing applications, including image-to-video, video inpainting, and stylized generation. Authors: Omer Bar-Tal, Hila Chefer, Omer Tov, Charles Herrmann, Roni Paiss, Shiran Zada, Ariel Ephrat, Junhwa Hur, Yuanzhen Li, Tomer Michaeli, Oliver Wang, Deqing Sun, Tali Dekel, Inbar Mosseri Links: Homepage: https://ykilcher.com Merch: https://ykilcher.com/merch YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher Discord: https://ykilcher.com/discord LinkedIn: https://www.linkedin.com/in/ykilcher If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n ... https://www.youtube.com/watch?v=Pl8BET_K1mc

Transaction

Created

3 months ago

Content Type

Language

video/mp4

English

Controlling

VIDEO

IS SA

is-sam-altman-coming-back-(openai-drama

lbry://@yannickilcher/is-sam-altman-coming-back-(openai-drama

#openai #samaltman #coup After his spectacular firing yesterday, investors and employees are pressuring OpenAI's board of directors to reinstate Sam Altman back as CEO. References: https://twitter.com/gdb/status/1725736242137182594 https://arstechnica.com/information-technology/2023/11/report-sutskever-led-board-coup-at-openai-that-ousted-altman-over-ai-safety-concerns/?utm_medium=social&utm_source=twitter&utm_social-type=owned&utm_brand=ars https://www.axios.com/2023/11/18/openai-memo-altman-firing-malfeasance-communications-breakdown https://www.theinformation.com/articles/openais-86-billion-share-sale-in-jeopardy-following-altman-firing https://www.wired.com/story/openai-sam-altman-ousted-what-happened/ https://www.theinformation.com/articles/openai-co-founder-altman-plans-new-venture https://www.bloomberg.com/news/articles/2023-11-18/openai-board-being-pressed-by-some-investors-to-reinstate-altman?utm_source=twitter&utm_campaign=socialflow-organic&utm_medium=social&utm_content=business&cmpid=socialflow-twitter-business#xj4y7vzkg https://www.theverge.com/2023/11/18/23967199/breaking-openai-board-in-discussions-with-sam-altman-to-return-as-ceo https://www.theinformation.com/articles/altman-decision-looms-as-sequoia-tiger-negotiate-behind-scenes https://twitter.com/deliprao/status/1726126676990456212 https://twitter.com/Simeon_Cps/status/1725958642405400896 https://twitter.com/gdb https://twitter.com/sama/status/1726099792600903681/quotes https://twitter.com/kyutai_labs/status/1725483921041760323 https://www.theverge.com/2023/11/18/23966980/meta-disbanded-responsible-ai-team-artificial-intelligence Links: Homepage: https://ykilcher.com Merch: https://ykilcher.com/merch YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher Discord: https://ykilcher.com/discord LinkedIn: https://www.linkedin.com/in/ykilcher If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n ... https://www.youtube.com/watch?v=-SWONSNzJGo

Transaction

Created

6 months ago

Content Type

Language

video/mp4

English

Controlling

VIDEO

[DRAM

drama-who-invented-contrast-sets

lbry://@yannickilcher/drama-who-invented-contrast-sets

Funny Twitter spat between researchers arguing who was the first to invent an idea that has probably been around since 1990 :D References: https://arxiv.org/abs/2004.02709 https://twitter.com/nlpmattg/status/1247326213296672768 https://arxiv.org/abs/1909.12434 https://twitter.com/zacharylipton/status/1247357810410762240 https://twitter.com/nlpmattg/status/1247373386839252992 https://twitter.com/zacharylipton/status/1247383141075083267 Links: YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher BitChute: https://www.bitchute.com/channel/yannic-kilcher Minds: https://www.minds.com/ykilcher ... https://www.youtube.com/watch?v=DRy_Mr732yA

Transaction

Created

1 year ago

Content Type

Language

video/mp4

English

Controlling

VIDEO

[ML N

ml-news-google's-540b-palm-language

lbry://@yannickilcher/ml-news-google's-540b-palm-language

#mlnews #palm #dalle2 Google releases PaLM and OpenAI releases DALL-E 2 (and more news). Sponsor: Weights & BIases Start here: https://wandb.me/yannic Thumbnail credit: DALL-E 2 via Sam Altman OUTLINE 0:00 - Street interview w/ random stranger 2:25 - Intro 2:50 - PaLM - Google's 540B Pathways Language Model 7:50 - Sponsor: Weights & Biases 9:10 - OpenAI releases DALL-E 2 12:05 - Open Source Datasets and Models 13:20 - Salesforce releases CodeGen My Live Reaction to DALL-E 2: https://youtu.be/gGPv_SYVDC8 My Video on GLIDE: https://youtu.be/gwI6g1pBD84 My Video on the Pathways System: https://youtu.be/vGFaiLeoLWw References: PaLM - Google's 540B Pathways Language Model https://ai.googleblog.com/2022/04/pathways-language-model-palm-scaling-to.html https://storage.googleapis.com/pathways-language-model/PaLM-paper.pdf OpenAI releases DALL-E 2 https://openai.com/dall-e-2/ https://cdn.openai.com/papers/dall-e-2.pdf https://www.instagram.com/openaidalle/ https://twitter.com/sama/status/1511724264629678084?s=09&t=58fWOJMHUDnOla5nD_ygjg&utm_source=pocket_mylist https://twitter.com/sama/media https://twitter.com/BorisMPower/status/1511738735175610371 https://twitter.com/ariskonstant/status/1511744708875218945 Open Source Datasets and Models https://twitter.com/multimodalart/status/1510999907498442756 https://laion.ai/laion-5b-a-new-era-of-open-large-scale-multi-modal-datasets/ https://github.com/mlfoundations/open_clip Salesforce releases CodeGen https://github.com/salesforce/CodeGen Links: Merch: http://store.ykilcher.com TabNine Code Completion (Referral): http://bit.ly/tabnine-yannick YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher Discord: https://discord.gg/4H8xxDF BitChute: https://www.bitchute.com/channel/yannic-kilcher LinkedIn: https://www.linkedin.com/in/ykilcher BiliBili: https://space.bilibili.com/2017636191 If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n ... https://www.youtube.com/watch?v=RJwPN4qNi_Y

Transaction

Created

1 year ago

Content Type

Language

video/mp4

English

Controlling

VIDEO

TREE

tree-of-thoughts-deliberate-problem

lbry://@yannickilcher/tree-of-thoughts-deliberate-problem

#gpt4 #ai #prompt Tree-of-Thought improves prompting of large language models (LLMs) by generalizing the concept of Chain-of-Thought prompting and introduces a tree search across language model thoughts, including state evaluation and backtracking. Experiments on toy tasks show large improvements over both classic and Chain-of-Thought prompting. OUTLINE: 0:00 - Introduction 1:20 - From Chain-of-Thought to Tree-of-Thought 11:10 - Formalizing the algorithm 16:00 - Game of 24 & Creative writing 18:30 - Crosswords 23:30 - Is this a general problem solver? 26:50 - Ablation studies 28:55 - Conclusion Paper: https://arxiv.org/abs/2305.10601 Abstract: Language models are increasingly being deployed for general problem solving across a wide range of tasks, but are still confined to token-level, left-to-right decision-making processes during inference. This means they can fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. To surmount these challenges, we introduce a new framework for language model inference, Tree of Thoughts (ToT), which generalizes over the popular Chain of Thought approach to prompting language models, and enables exploration over coherent units of text (thoughts) that serve as intermediate steps toward problem solving. ToT allows LMs to perform deliberate decision making by considering multiple different reasoning paths and self-evaluating choices to decide the next course of action, as well as looking ahead or backtracking when necessary to make global choices. Our experiments show that ToT significantly enhances language models' problem-solving abilities on three novel tasks requiring non-trivial planning or search: Game of 24, Creative Writing, and Mini Crosswords. For instance, in Game of 24, while GPT-4 with chain-of-thought prompting only solved 4% of tasks, our method achieved a success rate of 74%. Code repo with all prompts: this https URL. Authors: Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Thomas L. Griffiths, Yuan Cao, Karthik Narasimhan Links: Homepage: https://ykilcher.com Merch: https://ykilcher.com/merch YouTube: https://www.youtube.com/c/yannickilcher Twitter: https://twitter.com/ykilcher Discord: https://ykilcher.com/discord LinkedIn: https://www.linkedin.com/in/ykilcher If you want to support me, the best thing to do is to share out the content :) If you want to support me financially (completely optional and voluntary, but a lot of people have asked for this): SubscribeStar: https://www.subscribestar.com/yannickilcher Patreon: https://www.patreon.com/yannickilcher Bitcoin (BTC): bc1q49lsw3q325tr58ygf8sudx2dqfguclvngvy2cq Ethereum (ETH): 0x7ad3513E3B8f66799f507Aa7874b1B0eBC7F85e2 Litecoin (LTC): LQW2TRyKYetVC8WjFkhpPhtpbDM4Vw7r9m Monero (XMR): 4ACL8AGrEo5hAir8A9CeVrW8pEauWvnp1WnSDZxW7tziCDLhZAGsgzhRQABDnFy8yuM9fWJDviJPHKRjV4FWt19CJZN9D4n ... https://www.youtube.com/watch?v=ut5kp56wW_4

Transaction

Created

1 year ago

Content Type

Language

video/mp4

English

Controlling

VIDEO

THE V

the-visual-task-adaptation-benchmark

lbry://@yannickilcher/the-visual-task-adaptation-benchmark

This paper presents a new benchmark for Visual Task Adaptation (i.e. BERT for images) and investigates several baseline methods for doing so. Abstract: Representation learning promises to unlock deep learning for the long tail of vision tasks without expansive labelled datasets. Yet, the absence of a unified yardstick to evaluate general visual representations hinders progress. Many sub-fields promise representations, but each has different evaluation protocols that are either too constrained (linear classification), limited in scope (ImageNet, CIFAR, Pascal-VOC), or only loosely related to representation quality (generation). We present the Visual Task Adaptation Benchmark (VTAB): a diverse, realistic, and challenging benchmark to evaluate representations. VTAB embodies one principle: good representations adapt to unseen tasks with few examples. We run a large VTAB study of popular algorithms, answering questions like: How effective are ImageNet representation on non-standard datasets? Are generative models competitive? Is self-supervision useful if one already has labels? Authors: Xiaohua Zhai, Joan Puigcerver, Alexander Kolesnikov, Pierre Ruyssen, Carlos Riquelme, Mario Lucic, Josip Djolonga, Andre Susano Pinto, Maxim Neumann, Alexey Dosovitskiy, Lucas Beyer, Olivier Bachem, Michael Tschannen, Marcin Michalski, Olivier Bousquet, Sylvain Gelly, Neil Houlsby https://arxiv.org/abs/1910.04867 https://github.com/google-research/task_adaptation ... https://www.youtube.com/watch?v=ctCv_NRpqvM

Transaction

Created

1 year ago

Content Type

Language

video/mp4

English