https://nlpnewsletter.substack.com/p/nlp-research-in-the-era-of-llms NLP Research in the Era of LLMs5 Key Research Directions Without Much ComputeSebastian RuderDec 19, 2023 “In an era where running state-of-the-art models requires a garrison of expensive GPUs,
What I Read: Multi-Modal Retrieval-Augmented Generation
https://blog.llamaindex.ai/evaluating-multi-modal-retrieval-augmented-generation-db3ca824d428?gi=45acebfc0a3a Evaluating Multi-Modal Retrieval-Augmented GenerationLlamaIndexNov 16 “A natural starting point is to consider how evaluation was done in traditional, text-only RAG and then ask ourselves how this ought to be
What I Read: Adversarial Attacks on LLMs
https://lilianweng.github.io/posts/2023-10-25-adv-attack-llm/ Adversarial Attacks on LLMsLilian WengOctober 25, 2023 “Adversarial attacks are inputs that trigger the model to output something undesired.”
What I Read: Distributed Training, Finetuning
https://sumanthrh.com/post/distributed-and-efficient-finetuning/ Everything about Distributed Training and Efficient FinetuningSumanth R HegdeLast updated on Oct 13, 2023 “practical guidelines and gotchas with multi-GPU and multi-node training”