https://jameschen.io/jekyll/update/2024/02/12/mamba.html Mamba No. 5 (A Little Bit Of…)James ChenFeb 12, 2024 “…I attempt to provide a walkthrough of the essence of the Mamba state space model architecture, occasionally sacrificing some
What I Read: Adversarial Attacks on LLMs
https://lilianweng.github.io/posts/2023-10-25-adv-attack-llm/ Adversarial Attacks on LLMsLilian WengOctober 25, 2023 “Adversarial attacks are inputs that trigger the model to output something undesired.”
What I Read: Deep Generative Models
https://medium.com/@jrodthoughts/microsoft-research-unveils-three-efforts-to-advance-deep-generative-models-b1d2fe3395e8 Microsoft Research Unveils Three Efforts to Advance Deep Generative ModelsOptimus, FQ-GAN and Prevalent bring new ideas to apply generative models at large scale.Jesus RodriguezApr 27 “With the emergence of