Reinforcement Learning – dr.Pep

Skip to content

knows better

Single Sentence Summaries
Baking
AI & Data
Recent posts

Copyright © 2026 Pepijn van der Laan.
All rights reserved.

Determining the value function is a difficult problem that is nonetheless key to safely and effectively using reinforcement learning

Determining the value function is a difficult problem that is nonetheless key to safely and effectively using reinforcement learning

Brian Christian – The Alignment Problem

The analogies between human and machine learning strategies are skillfully narrated, but rather drawn out.

Tags

AI (21) Analytics (6) Art (14) Autobiography (7) Biology (7) Brain science (6) Business (12) China (9) Culture (53) Data science (8) Design (6) Economics (36) Economy (17) Entrepreneurship (28) Ethics (18) Evolution (9) Hacking (6) History (98) Innovation (75) Intelligence (7) Investing (8) IT (14) Japan (7) Journalism (13) Leadership (34) Linux (8) Management (9) Marketing (12) Mathematics (8) Philosophy (19) Physics (14) Politics (73) Psychology (13) Retail (6) Science (50) Silicon Valley (13) Sociology (14) Start-ups (7) Startup (15) Statistics (18) Strategy (12) Technology (69) Ubuntu 20 (8) USA (48) War (9)

Archives

Archives

Copyright © 2026 Pepijn van der Laan.
All rights reserved.