natural language processing – Page 7

What I Read: Sampling Text Generation

By Andrew Fairless on March 19, 2024February 5, 2024

https://huyenchip.com//2024/01/16/sampling.html Sampling for Text GenerationChip Huyen1/15/24 “ML models are probabilistic…. This probabilistic nature makes AI great for creative tasks…. However, this probabilistic nature also causes inconsistency and hallucinations. It’s fatalContinue readingWhat I Read: Sampling Text Generation

What I Read: Chatbots Understand Text

By Andrew Fairless on March 18, 2024February 5, 2024

https://www.quantamagazine.org/new-theory-suggests-chatbots-can-understand-text-20240122/ New Theory Suggests Chatbots Can Understand TextAnil Ananthaswamy1/22/24 “Far from being “stochastic parrots,” the biggest large language models seem to learn enough skills to understand the words they’re processing.”

What I Read: Self-Attention in GPT

By Andrew Fairless on March 4, 2024January 25, 2024

https://twiecki.io/blog/2024/01/04/ An Intuitive Guide to Self-Attention in GPT: The Venetian MasqueradeThomas WieckiJanuary 4, 2024 “In AI, especially with something as intricate as self-attention, it’s easy to get lost in theContinue readingWhat I Read: Self-Attention in GPT

What I Read: Research Directions

By Andrew Fairless on February 21, 2024January 7, 2024

https://nlpnewsletter.substack.com/p/nlp-research-in-the-era-of-llms NLP Research in the Era of LLMs5 Key Research Directions Without Much ComputeSebastian RuderDec 19, 2023 “In an era where running state-of-the-art models requires a garrison of expensive GPUs,Continue readingWhat I Read: Research Directions

What I Read: Instruction Tuning

By Andrew Fairless on February 19, 2024December 19, 2023

https://gaotianyu.xyz/blog/2023/11/30/instruction-tuning/ Teach Llamas to Talk: Recent Progress in Instruction TuningTianyu Gao30 November 2023 “…open-ended instruction tuning… fine-tunes an LLM such that it can follow user instructions…. there have been numerousContinue readingWhat I Read: Instruction Tuning

What I Read: Limits of Transformers on Compositionality

By Andrew Fairless on February 13, 2024December 19, 2023

https://arxiv.org/abs/2305.18654 Faith and Fate: Limits of Transformers on CompositionalityNouha Dziri, Ximing Lu, Melanie Sclar, Xiang Lorraine Li, Liwei Jiang, Bill Yuchen Lin, Peter West, Chandra Bhagavatula, Ronan Le Bras, JenaContinue readingWhat I Read: Limits of Transformers on Compositionality

What I Read: survey LLM tooling

By Andrew Fairless on February 8, 2024February 5, 2024

https://ericmjl.github.io/blog/2024/2/1/an-incomplete-and-opinionated-survey-of-llm-tooling/ An (incomplete and opinionated) survey of LLM toolingEric J. Ma1/31/24 7:00 PM “More critical than specifying specific tools, my goal with this blog post is to outline a frameworkContinue readingWhat I Read: survey LLM tooling

What I Read: Multi-Modal Retrieval-Augmented Generation

By Andrew Fairless on February 7, 2024December 19, 2023

https://blog.llamaindex.ai/evaluating-multi-modal-retrieval-augmented-generation-db3ca824d428?gi=45acebfc0a3a Evaluating Multi-Modal Retrieval-Augmented GenerationLlamaIndexNov 16 “A natural starting point is to consider how evaluation was done in traditional, text-only RAG and then ask ourselves how this ought to beContinue readingWhat I Read: Multi-Modal Retrieval-Augmented Generation

What I Read: Adversarial Attacks on LLMs

By Andrew Fairless on February 6, 2024December 19, 2023

https://lilianweng.github.io/posts/2023-10-25-adv-attack-llm/ Adversarial Attacks on LLMsLilian WengOctober 25, 2023 “Adversarial attacks are inputs that trigger the model to output something undesired.”

What I Read: Finetuning LLMs Using LoRA

By Andrew Fairless on February 5, 2024December 18, 2023

https://magazine.sebastianraschka.com/p/practical-tips-for-finetuning-llms Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)Sebastian Raschka, PhDNov 19, 2023 “Low-rank adaptation (LoRA) is among the most widely used and effective techniques for efficiently training customContinue readingWhat I Read: Finetuning LLMs Using LoRA

Tag: natural language processing