History
Liked
Trending
Hot Dangdut
Hot Koplo
Indonesia Dance Hotlist
Indonesia Heavy Rock Hotlist
Rap Indo
Indo Indie
Lagu POPuler
Raja Rock
Fresh Indonesian Pop
All Time Indonesian Rock Hits
Dangdut '00-an
Dangdut '10-an
Pop Indonesia '00-an
Dangdut '70-an
Dangdut '80-an
Pop Indonesia '80-an
Dangdut '90-an
Pop Indonesia '10-an
Pop Indonesia '90-an
Classic Dangdut
Best of Indonesian Pop
In Love
Akustikan
Heartbroken
Modern Indonesian Pop Hits
Pop Play Dangdut
EDutM
Hot Campursari
Indonesian Divas
International Indo
2000 Indonesia pop
Love I
campursari
Dewa 19
long ride - indo
time to cryy
pop kenangan
Lagu favoritku
golden indo
Dangdut Romantis
perjuangan dan doa
lagu dangdut
dangdut top
Menari radio
lagu lagu
Aku dan Cinta
Chill n Listen
favorit
song Indonesia
indonesia's old vocals
Chill indo
campur
Dangdut
lagu santai
rock alternatif
dangdut
Lagu Duniawi
Indonesia
dangdut
Pop Nostalgia 80an
Mood
Indo
Indo Hits
lagu kenanan
Indonesia playlist
Indonesia
olah raga
Indonesia Jadul
semua
Dangdut
Wedding Songs 💍
Bintang di Langit Senja
Mood Booster
Lagu 80an
Indonesia old
dangdut
Indonesia
Dangdut
Indonesia 2000
Indonesia Hits
ballad.
lagu lama
accoustik
indonesia songs
Indo goodies
menenangkan
indonesia 80s
Indonesia Enak
favorit
indonesia
lagu lagu indonesia
Nangis versi indo
nostalgia 90
Indonesia
lagu Indonesia
Indo
Norra Indonesia
karaokean asik
POP klasik
My Indo Song Jam
2000's soul
lagu kenangan
Indonesia Contemporary
Rizky's Playlist
【生成式AI導論 2024】第8講:大型語言模型修練史 — 第三階段: 參與實戰,打磨技巧 (Reinforcement Learning from Human Feedback, RLHF)
Length 36:58 • 39.9K Views • 6 months ago
Hung-yi Lee
📃 My History
Like
Share
Share:
Video Terkait
24:46
【生成式AI導論 2024】第9講:以大型語言模型打造的AI Agent (14:50 教你怎麼打造芙莉蓮一級魔法使考試中出現的泥人哥列姆)
42.2K
6 months ago
38:16
【生成式AI導論 2024】第10講:今日的語言模型是如何做文字接龍的 — 淺談Transformer (已經熟悉 Transformer 的同學可略過本講)
38.8K
6 months ago
1:16:15
Stanford CS224N | 2023 | Lecture 10 - Prompting, Reinforcement Learning from Human Feedback
56.1K
1 year ago
44:14
DPO V.S. RLHF 模型微调
2.6K
9 months ago
1:27:05
Transformer论文逐段精读
416.9K
3 years ago
11:29
Reinforcement Learning from Human Feedback (RLHF) Explained
11.7K
3 months ago
1:32:10
80分鐘快速了解大型語言模型 (5:30 有咒術迴戰雷)
151.7K
11 months ago
29:29
【生成式AI導論 2024】第1講:生成式AI是什麼?
222.8K
8 months ago
38:19
【生成式AI導論 2024】第7講:大型語言模型修練史 — 第二階段: 名師指點,發揮潛力 (兼談對 ChatGPT 做逆向工程與 LLaMA 時代的開始)
51.7K
7 months ago
34:26
【生成式AI導論 2024】第6講:大型語言模型修練史 — 第一階段: 自我學習,累積實力 (熟悉機器學習的同學從 15:00 開始看起即可)
53.6K
7 months ago
1:00:38
Reinforcement Learning from Human Feedback: From Zero to chatGPT
172K
Streamed 1 year ago
45:16
【生成式AI導論 2024】第11講:大型語言模型在「想」什麼呢? — 淺談大型語言模型的可解釋性
36K
6 months ago
8:55
Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained
24.3K
10 months ago
51:39
Stanford Webinar - The Frontier of Deep Learning for Robotics, Chelsea Finn
11.2K
1 year ago
1:44:31
Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)
456.5K
2 months ago
23:34
【生成式AI】讓 AI 村民組成虛擬村莊會發生甚麼事?
216.2K
1 year ago
37:02
台大資訊 深度學習之應用 | ADL 8.1: LLM Adaptation 如何改變(洗腦?)語言模型
4.1K
3 weeks ago
46:12
【生成式AI導論 2024】第12講:淺談檢定大型語言模型能力的各種方式
28.6K
5 months ago
20:13
从零开始学习大语言模型(一)
225.1K
8 months ago