You can find our own implementation of PPO in TensorFlow and our training loop in the TensorFlow folder.
For further details about this project, you can read the corresponding paper we wrote as part of this university project. It is simply called Paper.pdf.
The config file used for training in MLAgents is called PoliceCar.yaml.
Here you can see our results (click on image to be forwarded to the youtube video):