- An implementation of Short Horizon Actor Critic (Xu; 2022) writen in Jax
- Simulation using the Mujoco MJX simulator
Run Time: 1 min jit, 2 min training
Known Issues: For some random seeds, you get drift in the cart position.
Run Time: 1 min jit, 2 min training
Known Issues: As seen in the rewards figure, training can be unstable.
- Having great difficulty applying SHAC to get Anymal to walk with default 32-bit precision. (See Mujoco for an example with 64-bit precision)
- Hypothesis: it's because quadruped gait is very contact-rich, leading to uninformative gradients.
![](https://private-user-images.githubusercontent.com/22626914/311497820-d774bea2-ef44-4370-8b77-b84594e780a4.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2NTUxNjYsIm5iZiI6MTczOTY1NDg2NiwicGF0aCI6Ii8yMjYyNjkxNC8zMTE0OTc4MjAtZDc3NGJlYTItZWY0NC00MzcwLThiNzctYjg0NTk0ZTc4MGE0LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTUlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjE1VDIxMjc0NlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTExOTRkMTVkYWM4MmEyOGYxMDM4MDJlZTRiNmUxODM4ZjkzNTA4ZDM2MTg2NTk3ZjYwOTAzNWRiMGU4YTQzN2EmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.7Wad6Zq5vyuunl8LVzYZwPi3g_a62qFijpG_r8iJ028)
![](https://private-user-images.githubusercontent.com/22626914/311497834-6262b083-a2dc-4402-ac2e-0d25d76f5cb4.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2NTUxNjYsIm5iZiI6MTczOTY1NDg2NiwicGF0aCI6Ii8yMjYyNjkxNC8zMTE0OTc4MzQtNjI2MmIwODMtYTJkYy00NDAyLWFjMmUtMGQyNWQ3NmY1Y2I0LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTUlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjE1VDIxMjc0NlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWMwN2Y5NzY5MzdmZjQyMjgyNWMwNzQ4YjEwOTljOGMxMTBhNGZkOGYxNjEzZThhZGUzMzVhMDRjMjc5YTlkODcmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.fOFBgS1bsIh-U2CC2inuy6zYSvof3DBLYPhxUVhcaTc)
![](https://private-user-images.githubusercontent.com/22626914/311497891-a939d83f-2075-4866-8a7a-0893ef892fdf.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk2NTUxNjYsIm5iZiI6MTczOTY1NDg2NiwicGF0aCI6Ii8yMjYyNjkxNC8zMTE0OTc4OTEtYTkzOWQ4M2YtMjA3NS00ODY2LThhN2EtMDg5M2VmODkyZmRmLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTUlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjE1VDIxMjc0NlomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWQ3MWFmOTA0MzA4NmViOWNmMzAxMDMzODY1ZWUxOGNlMDA3NzczZTViZGZkN2M1YWY1MTc4ZGNkOWY3NGFhZmImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0._D_rJaxJYTZON8ZPtvlQR1VOTFFUObexjZumTmnfLd4)
32-step rollout. Ground flashes red when step jacobian is greater than 10e2.
- pip install -r requirements.txt
- Add the parent folder of this repository to your PYTHONPATH environment variable.