https://huyenchip.com//2023/05/02/rlhf.html RLHF: Reinforcement Learning from Human FeedbackChip HuyenMay 2, 2023 “…making models like ChatGPT work. One such cool idea is RLHF (Reinforcement Learning from Human Feedback)…. So, how exactly does
What I Read: Computation, Artificial Intelligence
https://www.quantamagazine.org/a-new-approach-to-computation-reimagines-artificial-intelligence-20230413/ A New Approach to Computation Reimagines Artificial IntelligenceAnil AnanthaswamyApril 13, 2023 “By imbuing enormous vectors with semantic meaning, we can get machines to reason more abstractly — and efficiently
What I Read: Multi-label NLP
https://www.kdnuggets.com/2023/03/multilabel-nlp-analysis-class-imbalance-loss-function-approaches.html Multi-label NLP: An Analysis of Class Imbalance and Loss Function ApproachesOleksii Babych, Machine Learning Engineer at ProvectusMarch 17, 2023 “Multi-label NLP refers to the task of assigning multiple labels
What I Read: Unsupervised Learning Metrics
https://www.kdnuggets.com/2023/04/exploring-unsupervised-learning-metrics.html Exploring Unsupervised Learning MetricsCornellius Yudha WijayaApril 13, 2023 “This article will discuss the metrics used to evaluate unsupervised machine learning algorithms…”
What I Read: LLM applications, production
https://huyenchip.com//2023/04/11/llm-engineering.html Building LLM applications for productionChip HuyenApr 11, 2023 “It’s easy to make something cool with LLMs, but very hard to make something production-ready with them.”