History
Liked
Trending
Hot Dangdut
Hot Koplo
Indonesia Dance Hotlist
Indonesia Heavy Rock Hotlist
Rap Indo
Indo Indie
Lagu POPuler
Raja Rock
Fresh Indonesian Pop
All Time Indonesian Rock Hits
Dangdut '00-an
Dangdut '10-an
Pop Indonesia '00-an
Dangdut '70-an
Dangdut '80-an
Pop Indonesia '80-an
Dangdut '90-an
Pop Indonesia '10-an
Pop Indonesia '90-an
Classic Dangdut
Best of Indonesian Pop
In Love
Akustikan
Heartbroken
Modern Indonesian Pop Hits
Pop Play Dangdut
EDutM
Hot Campursari
Indonesian Divas
International Indo
dangdut
Lagu 80an
Lullaby
Dangdut Romantis
Indo goodies
Lagu favoritku
rock alternatif
Dewa 19
long ride - indo
Mood
Indonesia old
lagu lagu
Dangdut
dangdut
Aku dan Cinta
favorit
Indonesia Enak
nostalgia 90
campur
Indonesia Hits
Indo Hits
Indonesia
Wedding Songs 💍
campursari
indonesia
lagu lagu indonesia
Wedding
Indo
Pop Nostalgia 80an
lagu lama
pop kenangan
Indonesia Jadul
Nangis versi indo
Lagu Duniawi
Rizky's Playlist
indonesia 80s
loving day
accoustik
Manusia Indie
Dangdut
time to cryy
Menari radio
buat di motor
favorit
dangdut
Dangdut
2000's soul
karaokean asik
nostalgia
Indonesia playlist
Bintang di Langit Senja
ballad.
Indonesia Ok
menenangkan
Old Indonesian Songs
Chill indo
Indonesia
indonesia
2000 Indonesia pop
olah raga
dangdut
Dangdut
90s
Indonesia 2000
POP klasik
lagu dangdut
lagu kenanan
Indonesia Contemporary
indonesia songs
golden indo
Nostalgia Loop
lagu santai
Dangdut Azeek
【生成式AI導論 2024】第8講:大型語言模型修練史 — 第三階段: 參與實戰,打磨技巧 (Reinforcement Learning from Human Feedback, RLHF)
Length 36:58 • 39.9K Views • 6 months ago
Hung-yi Lee
📃 My History
Like
Share
Share:
Video Terkait
24:46
【生成式AI導論 2024】第9講:以大型語言模型打造的AI Agent (14:50 教你怎麼打造芙莉蓮一級魔法使考試中出現的泥人哥列姆)
42.2K
6 months ago
38:16
【生成式AI導論 2024】第10講:今日的語言模型是如何做文字接龍的 — 淺談Transformer (已經熟悉 Transformer 的同學可略過本講)
38.8K
6 months ago
1:16:15
Stanford CS224N | 2023 | Lecture 10 - Prompting, Reinforcement Learning from Human Feedback
56.1K
1 year ago
44:14
DPO V.S. RLHF 模型微调
2.6K
9 months ago
1:27:05
Transformer论文逐段精读
416.9K
3 years ago
11:29
Reinforcement Learning from Human Feedback (RLHF) Explained
11.7K
3 months ago
1:32:10
80分鐘快速了解大型語言模型 (5:30 有咒術迴戰雷)
151.7K
11 months ago
29:29
【生成式AI導論 2024】第1講:生成式AI是什麼?
222.8K
8 months ago
38:19
【生成式AI導論 2024】第7講:大型語言模型修練史 — 第二階段: 名師指點,發揮潛力 (兼談對 ChatGPT 做逆向工程與 LLaMA 時代的開始)
51.7K
7 months ago
1:00:38
Reinforcement Learning from Human Feedback: From Zero to chatGPT
172K
Streamed 1 year ago
45:16
【生成式AI導論 2024】第11講:大型語言模型在「想」什麼呢? — 淺談大型語言模型的可解釋性
36K
6 months ago
51:39
Stanford Webinar - The Frontier of Deep Learning for Robotics, Chelsea Finn
11.2K
1 year ago
21:51
【美国大选】特朗普 vs 哈里斯 经济政策有什么不一样?
1.7M
7 days ago
34:26
【生成式AI導論 2024】第6講:大型語言模型修練史 — 第一階段: 自我學習,累積實力 (熟悉機器學習的同學從 15:00 開始看起即可)
53.6K
7 months ago
8:55
Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained
24.3K
10 months ago
23:34
【生成式AI】讓 AI 村民組成虛擬村莊會發生甚麼事?
216.3K
1 year ago
37:02
台大資訊 深度學習之應用 | ADL 8.1: LLM Adaptation 如何改變(洗腦?)語言模型
4.1K
3 weeks ago
27:14
How large language models work, a visual intro to transformers | Chapter 5, Deep Learning
3.5M
7 months ago
46:12
【生成式AI導論 2024】第12講:淺談檢定大型語言模型能力的各種方式
28.6K
5 months ago