Josh Starmer and Luis Serrano Live Q/A from Uphill at Bern! Live with Jay Alammar, Josh Starmer, and Luis Serrano Direct Preference Optimization (DPO) - How to fine-tune LLMs directly without reinforcement learning Can quantum computers break the speed of information? 6d | Louis Serano GRPO - Group Relative Policy Optimization - How DeepSeek trains reasoning models 20d | Louis Serano Josh Starmer and Luis Serrano Live Q/A from Uphill at Bern! 1mo | Louis Serano Discrete Dynamical Systems - Eigenvalues and Eigenvectors 2mo | Louis Serano Mean, Variance, Skewness, and Kurtosis - Math for ML with Deeplearning.ai 3mo | Louis Serano The three steps to make a reliable chatbot: Preamble, Fine-tuning, and RAG 3mo | Louis Serano Newton's method for approximating zeros of polynomials - Math for ML with Deeplearning.ai 3mo | Louis Serano The Stone-Weierstrass Theorem - How to approximate functions 3mo | Louis Serano Keys, Queries, and Values: The celestial mechanics of attention 3mo | Louis Serano Why is ChatGPT so bad at telling jokes (yet so good at writing poems?) 3mo | Louis Serano 1 2 3 4 5 > >> Csatlakozni a csoporthoz Tagok Keresés LétrehozvaEgy nap elmúltElmúlt négy napAz elmúlt hónap Choose a GroupLouis Serano Choose a User Sorrendrelevancia szerintKedveltElőször újKönyvjelzők számaMegjegyzések száma Keresés