large language model – Page 9 – Andrew Fairless, Ph.D.

What I Read: 3D human pose estimation

By Andrew Fairless on February 14, 2024December 19, 2023

https://perceiving-systems.blog/en/news/third-wave-3d-human-pose-and-shape-estimation Third wave 3D human pose and shape estimationPerceiving Systems Blog04 December 2023 “…PoseGPT looks at the whole image and can reason about people in context…. we can now relateContinue readingWhat I Read: 3D human pose estimation

What I Read: Limits of Transformers on Compositionality

By Andrew Fairless on February 13, 2024December 19, 2023

https://arxiv.org/abs/2305.18654 Faith and Fate: Limits of Transformers on CompositionalityNouha Dziri, Ximing Lu, Melanie Sclar, Xiang Lorraine Li, Liwei Jiang, Bill Yuchen Lin, Peter West, Chandra Bhagavatula, Ronan Le Bras, JenaContinue readingWhat I Read: Limits of Transformers on Compositionality

What I Read: survey LLM tooling

By Andrew Fairless on February 8, 2024February 5, 2024

https://ericmjl.github.io/blog/2024/2/1/an-incomplete-and-opinionated-survey-of-llm-tooling/ An (incomplete and opinionated) survey of LLM toolingEric J. Ma1/31/24 7:00 PM “More critical than specifying specific tools, my goal with this blog post is to outline a frameworkContinue readingWhat I Read: survey LLM tooling

What I Read: Multi-Modal Retrieval-Augmented Generation

By Andrew Fairless on February 7, 2024December 19, 2023

https://blog.llamaindex.ai/evaluating-multi-modal-retrieval-augmented-generation-db3ca824d428?gi=45acebfc0a3a Evaluating Multi-Modal Retrieval-Augmented GenerationLlamaIndexNov 16 “A natural starting point is to consider how evaluation was done in traditional, text-only RAG and then ask ourselves how this ought to beContinue readingWhat I Read: Multi-Modal Retrieval-Augmented Generation

What I Read: Adversarial Attacks on LLMs

By Andrew Fairless on February 6, 2024December 19, 2023

https://lilianweng.github.io/posts/2023-10-25-adv-attack-llm/ Adversarial Attacks on LLMsLilian WengOctober 25, 2023 “Adversarial attacks are inputs that trigger the model to output something undesired.”

What I Read: Finetuning LLMs Using LoRA

By Andrew Fairless on February 5, 2024December 18, 2023

https://magazine.sebastianraschka.com/p/practical-tips-for-finetuning-llms Practical Tips for Finetuning LLMs Using LoRA (Low-Rank Adaptation)Sebastian Raschka, PhDNov 19, 2023 “Low-rank adaptation (LoRA) is among the most widely used and effective techniques for efficiently training customContinue readingWhat I Read: Finetuning LLMs Using LoRA

What I Read: Nvidia, GPU gold rush

By Andrew Fairless on January 30, 2024November 20, 2023

https://blog.johnluttig.com/p/nvidia-envy-understanding-the-gpu Nvidia Envy: understanding the GPU gold rushJohn LuttigNov 10, 2023 “In 2023, thousands of companies and countries begged Nvidia to purchase more GPUs. Can the exponential demand endure?”

What I Read: Helping AI See

By Andrew Fairless on January 23, 2024November 20, 2023

https://www.quantamagazine.org/the-computing-pioneer-helping-ai-see-20231024/ The Computing Pioneer Helping AI SeeSusan D’AgostinoOctober 24, 2023 ““Patches of color and brightness require us to connect what we’re seeing now to our memory…”… machines see what isContinue readingWhat I Read: Helping AI See

What I Read: Enterprise AI, RAG + Fine Tuning

By Andrew Fairless on January 11, 2024November 20, 2023

https://www.montecarlodata.com/blog-the-moat-for-enterprise-ai-is-rag-fine-tuning/ The Moat for Enterprise AI is RAG + Fine Tuning – Here’s WhyLior GavishUpdated Nov 09 2023 “Data needs to be secure and private; LLM deployment needs to beContinue readingWhat I Read: Enterprise AI, RAG + Fine Tuning

What I Read: Multimodality

By Andrew Fairless on January 4, 2024November 7, 2023

https://huyenchip.com/2023/10/10/multimodal.html Multimodality and Large Multimodal Models (LMMs)Chip HuyenOct 10, 2023 “For a long time, each ML model operated in one data mode… However, natural intelligence is not limited to justContinue readingWhat I Read: Multimodality

Tag: large language model