Learn policy gradient methods for trading. Understand REINFORCE, Advantage Actor-Critic (A2C), and Proximal Policy Optimisation (PPO). Policy gradient methods enable continuous action spaces for position sizing, making them ideal for portfolio allocation and multi asset trading agents.