<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>PPO on Tech Savvy Sailor</title><link>https://profile.captv.ovh/tags/ppo/</link><description>Recent content in PPO on Tech Savvy Sailor</description><generator>Hugo -- gohugo.io</generator><language>en</language><lastBuildDate>Wed, 18 Jun 2025 12:00:00 +0200</lastBuildDate><atom:link href="https://profile.captv.ovh/tags/ppo/index.xml" rel="self" type="application/rss+xml"/><item><title>Teaching a Ship (model) to Learn: From Classical Rules to Reinforcement Learning</title><link>https://profile.captv.ovh/posts/hands-on-autonomous-navigation-rl/</link><pubDate>Wed, 18 Jun 2025 12:00:00 +0200</pubDate><guid>https://profile.captv.ovh/posts/hands-on-autonomous-navigation-rl/</guid><description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; Phase 1 gave us a classical navigation system that works. Phase 2 asked a harder question: can a neural network learn to navigate without being told the rules? Short answer: yes, sort of. It&amp;rsquo;s faster. It&amp;rsquo;s sometimes smarter. It also runs aground in ways a trained officer never would. Here&amp;rsquo;s what six months of RL training taught me about autonomous ships, and about the limits of learning from scratch.&lt;/p&gt;</description></item></channel></rss>