https://www.kdnuggets.com/2023/03/multimodal-models-explained.html Multimodal Models ExplainedBy Nate RosidiMarch 27, 2023 “…multimodal learning is an exciting new field of AI that seeks to replicate this ability by combining information from multiple models. By
What I Read: Textless NLP
https://ai.facebook.com/blog/textless-nlp-generating-expressive-speech-from-raw-audio Textless NLP: Generating expressive speech from raw audioSeptember 9, 2021 “GSLM leverages recent breakthroughs in representation learning, allowing it to work directly from only raw audio signals… for potentially