Robotics learning

This is a quite messy repo with a bunch of code snippets and notebooks related to tracking my progress in robotics and RL.

Projects

Currently contains two small little projects outlined below

Quadrotor Control

Altitude control of a quadrotor in Mujoco.

/rl/quadrotor_control/lqr.py - Main LQR code (yes optimal control is kind of RL)
/rl/quadrotor_control/control.ipynb - Simulation and control results

Twitter post Blog post

"Robot" arm

An exetremely simple two joint robot arm simulated in Mujoco. Uses a custom built inverse kinematics solver (uses optimization) to move the arm to a certain target. Implemented in Jax and MJX. Kind of works but still a bit janky and could be made much more dynamic. E.g relies on hand calculating Jacobian which could be done using rotation matrices (on the roadmap). Can be found in /rl/robotarm with the following files:

/rl/robotarm/mjx.ipynb - Notebook responsible for running the simulation in Mujoco
/rl/robotarm/robot_arm.xml- Mujoco XML file describing the robot arm
/rl/robotarm/lib.py - Various functions related to the inverse kinematics of the arm

A little demo of how it works (hardcoded version without inverse kinematics): Twitter post

Update - version 25th december 2024

An updated version of the IK(inverse kinematics) solver using a 3dof robot arm. Actually works now. (Big shoutout to Alexis Fraudita's blog which covers IK in mujoco really well.)The new one consists of these files:

/rl/robotarm/ik.py - Inverse kinematics solver
/rl/robotarm/ik_test.py - Testing of the IK solver. Creates two images initial.png and result.pngwhich represent the starting pose of the arm and the end pose of the arm.

The results look something like this:

Start:

End:

TODO:

Simple description of the math involved and how it was done

Multiarm bandit

A very simple simulation of the multiarm bandit where a greedy and an $$\epsilon$$-greedy method were compared. This exercise is taken from the book "Reinforcement Learning: An Introduction" by Richard Sutton and Andrew Barto from the section on Multi-arm bandits. My first real Reinforcement learning project, something is wrong since $$\epsilon$$ method should perform much better, also on the roadmap to fix. Most of the code lies in /rl/multiarmbandit.ipynb.

Currently learning

Interested in Model Predictive Control and how that can be used. Experimenting with how it can be used for commaai's controls challenge
Looking into how I can get my simple robot arm into the real world using an Arduino

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
rl		rl
.gitignore		.gitignore
README.md		README.md
initial.png		initial.png
requirements.txt		requirements.txt
result.png		result.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Robotics learning

Projects

Quadrotor Control

"Robot" arm

Update - version 25th december 2024

Multiarm bandit

Currently learning

About

Uh oh!

Releases

Packages

Uh oh!

Languages

AlexanderJernstrom/robotics-learning

Folders and files

Latest commit

History

Repository files navigation

Robotics learning

Projects

Quadrotor Control

"Robot" arm

Update - version 25th december 2024

Multiarm bandit

Currently learning

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages