neural network – Page 22 – Andrew Fairless, Ph.D.

What I Read: Reducing High Cost of Training NLP Models

By Andrew Fairless on March 29, 2021April 3, 2021

https://www.asapp.com/blog/reducing-the-high-cost-of-training-nlp-models-with-sru/ Reducing the High Cost of Training NLP Models With SRU++By Tao Lei, PhDResearch Leader and Scientist at ASAPP “The Transformer architecture was proposed to accelerate model training in NLP….Continue readingWhat I Read: Reducing High Cost of Training NLP Models

What I Read: Language Model Fine-tuning

By Andrew Fairless on March 26, 2021March 7, 2021

https://ruder.io/recent-advances-lm-fine-tuning/ Recent Advances in Language Model Fine-tuningThis article provides an overview of recent methods to fine-tune large pre-trained language models.Sebastian Ruder24 Feb 2021 “While pre-training is compute-intensive, fine-tuning can beContinue readingWhat I Read: Language Model Fine-tuning

What I Read: Neural Nets, How Brains Learn

By Andrew Fairless on March 23, 2021March 7, 2021

https://www.quantamagazine.org/artificial-neural-nets-finally-yield-clues-to-how-brains-learn-20210218/ Artificial Neural Nets Finally Yield Clues to How Brains LearnAnil AnanthaswamyContributing WriterFebruary 18, 2021 “The learning algorithm that enables the runaway success of deep neural networks doesn’t work inContinue readingWhat I Read: Neural Nets, How Brains Learn

What I Read: Continual Learning, Amnesia, Neural Networks

By Andrew Fairless on March 22, 2021March 7, 2021

https://medium.com/dataseries/ibm-uses-continual-learning-to-avoid-the-amnesia-problem-in-neural-networks-ae8241e1f3a3 IBM Uses Continual Learning to Avoid The Amnesia Problem in Neural NetworksUsing continual learning might avoid the famous catastrophic forgetting problem in neural networks.Jesus RodriguezJan 25 “Building neural networksContinue readingWhat I Read: Continual Learning, Amnesia, Neural Networks

What I Read: Deep learning, black box

By Andrew Fairless on March 21, 2021March 7, 2021

https://bdtechtalks.com/2021/01/11/concept-whitening-interpretable-neural-networks/ Deep learning doesn’t need to be a black boxBen DicksonJanuary 11, 2021 “The innerworkings of neural networks are often a mystery…. scientists at Duke University propose “concept whitening,” aContinue readingWhat I Read: Deep learning, black box

What I Read: Adversarial generation of extreme samples

By Andrew Fairless on March 19, 2021March 7, 2021

https://aihub.org/2020/10/01/adversarial-generation-of-extreme-samples/ Adversarial generation of extreme samplesby Lucy SmithOctober 1, 2020 “…existing GAN-based methods tend to generate typical samples.. the authors seek to design deep learning-based models that can generate samplesContinue readingWhat I Read: Adversarial generation of extreme samples

What I Read: Ensemble, knowledge distillation, and self-distillation

By Andrew Fairless on March 16, 2021March 7, 2021

https://www.microsoft.com/en-us/research/blog/three-mysteries-in-deep-learning-ensemble-knowledge-distillation-and-self-distillation/ Three mysteries in deep learning: Ensemble, knowledge distillation, and self-distillationPublished January 19, 2021By Zeyuan Allen-Zhu , Senior Researcher Yuanzhi Li , Assistant Professor, Carnegie Mellon University “…besides this smallContinue readingWhat I Read: Ensemble, knowledge distillation, and self-distillation

What I Read: ML Models are Missing Contracts

By Andrew Fairless on March 15, 2021March 7, 2021

https://gradio.app/blog/missing-contracts Machine Learning Models are Missing ContractsWhy pretrained machine learning models are often unusable and irreproducible — and what we can do about it.By Abubakar Abid “We need to provideContinue readingWhat I Read: ML Models are Missing Contracts

What I Read: Transformer Networks to Answer Questions About Images

By Andrew Fairless on March 13, 2021March 7, 2021

https://medium.com/dataseries/microsoft-uses-transformer-networks-to-answer-questions-about-images-with-minimum-training-f978c018bb72 Microsoft Uses Transformer Networks to Answer Questions About Images With Minimum TrainingUnified VLP can understand concepts about scenic images by using pretrained models.Jesus RodriguezJan 12 “Can we build deepContinue readingWhat I Read: Transformer Networks to Answer Questions About Images

What I Read: Interpretation for Image Recognition

By Andrew Fairless on March 12, 2021March 7, 2021

https://thegradient.pub/a-visual-history-of-interpretation-for-image-recognition/ A Visual History of Interpretation for Image Recognition16.Jan.2021by Ali Abdalla “…during the last decade, researchers developed many different methods to open the “black box” of deep learning, aiming toContinue readingWhat I Read: Interpretation for Image Recognition

Tag: neural network