Search
Now showing items 21-22 of 22
Preferential proximal policy optimization in reinforcement learning
(2023-12-01)
The Proximal Policy Optimization (PPO), a policy gradient method, excels in reinforcement learning with its ”surrogate” objective function and stochastic gradient ascent. However, PPO does not fully consider the significance ...
DL-based defense against polymorphic network attacks
(2024-01-01)
Network security is of vital importance in our world dominated by internet systems. These systems are vulnerable to large-scale rapidly evolving attacks by sophisticated cyber attackers who can have an upper edge over the ...