UR Robotic Arm with Robotiq 2-Finger Gripper for ROS 2

Related Blog Post: For behind-the-scenes details and the full development journey, check out the companion Medium article: How I'm Building an Autonomous Pick-and-Place System with ROS 2 Jazzy and Gazebo Harmonic

The blog dives into simulation setup, robotic control, MoveIt Task Constructor, and lessons learned — perfect if you're curious about the engineering side or want to replicate the project from scratch.

This project integrates the Robotiq 2-Finger Gripper with a Universal Robots UR3 arm using ROS 2 Humble / Jazzy and Ignition Gazebo. It includes URDF models, ROS 2 control configuration, simulation launch files, MoveIt Task Constructor pick-and-place, vision-based object detection, LLM-driven task planning (Ollama), and demonstration recording for behavior cloning.

Note: This setup uses fixed mimic joint configuration for the Robotiq gripper to support simulation in newer Gazebo (Harmonic). Only the primary finger_joint receives commands — mimic joints automatically follow.

Demo

Installation

Make sure you have ROS 2 Humble or ROS 2 Jazzy and Ignition Gazebo installed.

1. Clone the Repository

git clone https://github.com/darshmenon/UR3_ROS2_PICK_AND_PLACE.git
cd UR3_ROS2_PICK_AND_PLACE

2. Install ROS Dependencies

# Set to humble or jazzy
export ROS_DISTRO=humble

sudo apt install ros-$ROS_DISTRO-rviz2 \
                 ros-$ROS_DISTRO-joint-state-publisher \
                 ros-$ROS_DISTRO-robot-state-publisher \
                 ros-$ROS_DISTRO-ros2-control \
                 ros-$ROS_DISTRO-ros2-controllers \
                 ros-$ROS_DISTRO-controller-manager \
                 ros-$ROS_DISTRO-joint-trajectory-controller \
                 ros-$ROS_DISTRO-position-controllers \
                 ros-$ROS_DISTRO-gz-ros2-control \
                 ros-$ROS_DISTRO-ros2controlcli \
                 ros-$ROS_DISTRO-moveit \
                 ros-$ROS_DISTRO-moveit-ros-perception \
                 ros-$ROS_DISTRO-simple-grasping \
                 ros-$ROS_DISTRO-cv-bridge \
                 ros-$ROS_DISTRO-tf2-ros \
                 ros-$ROS_DISTRO-tf2-geometry-msgs \
                 ros-$ROS_DISTRO-pcl-ros

Jazzy only — add these two extra packages:
sudo apt install ros-jazzy-ros-gz-sim ros-jazzy-ros-gz-bridge \
                 ros-jazzy-moveit-planners-stomp
STOMP is not packaged for Humble so leave it out there — the planner init fails silently and is harmless.

3. Install Python Dependencies

pip3 install -r requirements.txt
# Ollama is required for the LLM planner:
# Install from https://ollama.com
# Then pull your preferred model:
ollama pull llama2:latest

4. Build the Workspace

colcon build --symlink-install
source install/setup.bash

MoveIt Task Constructor Setup

This project supports MoveIt Task Constructor (MTC) for advanced pick-and-place planning.

This repo already includes a patched MTC source in src/moveit_task_constructor/ that works for both ROS 2 Humble and Jazzy — no extra cloning needed. Just build normally:

colcon build --symlink-install

MongoDB (required for warehouse_ros_mongo)

MTC uses warehouse_ros_mongo to persist planning scenes and trajectories. MongoDB must be installed and running before launching the demo:

curl -fsSL https://www.mongodb.org/static/pgp/server-7.0.asc | \
  sudo gpg -o /usr/share/keyrings/mongodb-server-7.0.gpg --dearmor

echo "deb [ arch=amd64,arm64 signed-by=/usr/share/keyrings/mongodb-server-7.0.gpg ] https://repo.mongodb.org/apt/ubuntu jammy/mongodb-org/7.0 multiverse" | \
  sudo tee /etc/apt/sources.list.d/mongodb-org-7.0.list

sudo apt-get update && sudo apt-get install -y mongodb-org
sudo systemctl start mongod && sudo systemctl enable mongod

Verify it is running: mongosh should connect to mongodb://127.0.0.1:27017.

For Humble/Jazzy API differences and troubleshooting, see ur_mtc_pick_place_demo/README.md.

Launch Instructions

Full MTC Pick-and-Place Demo

bash ur_mtc_pick_place_demo/scripts/robot.sh

Launches Gazebo + MoveIt + planning scene server + MTC demo in sequence.

Launch Full Simulation in Gazebo

ros2 launch ur_gazebo ur.gazebo.launch.py

Launch Point Cloud Viewer (Gazebo + RViz)

bash ur_mtc_pick_place_demo/scripts/pointcloud.sh

Launch RViz Visualization (UR3 + Gripper)

ros2 launch ur_description view_ur.launch.py ur_type:=ur3

Launch Gripper Visualization Alone

ros2 launch robotiq_2finger_grippers robotiq_2f_85_gripper_visualization/launch/test_2f_85_model.launch.py

Move the Arm from CLI

ros2 action send_goal /arm_controller/follow_joint_trajectory control_msgs/action/FollowJointTrajectory \
'{
  "trajectory": {
    "joint_names": [
      "shoulder_pan_joint",
      "shoulder_lift_joint",
      "elbow_joint",
      "wrist_1_joint",
      "wrist_2_joint",
      "wrist_3_joint"
    ],
    "points": [
      {
        "positions": [0.0, -1.57, 1.57, 0.0, 1.57, 0.0],
        "time_from_start": { "sec": 2, "nanosec": 0 }
      }
    ]
  }
}'

Run Arm-Gripper Automation Script

python3 ~/UR3_ROS2_PICK_AND_PLACE/ur_system_tests/scripts/arm_gripper_loop_controller.py

Grasp Detection (ur_grasp)

Estimates grasp poses from the Intel D435 point cloud. Two backends:

Backend	Method	Dependency
simple_grasping (primary)	PCL RANSAC → `moveit_msgs/Grasp[]`	`ros-$ROS_DISTRO-simple-grasping`
numpy centroid (fallback)	Colour HSV filter + centroid + height	built-in

ros2 launch ur_grasp grasp_detection.launch.py colour:=red
python3 testing/test_grasp.py --colour red --execute

Standalone Robot Control GUI

source install/setup.bash
python3 ur_llm_planner/scripts/robot_gui.py

Features: live camera feed, preset poses, gripper control (Open/Half/Close), per-joint sliders, Pilz PTP execution.

Custom Zig-Zag Motion Demo

ros2 run ur_moveit_demos custom_zigzag_motion

Wait at least 45 seconds after launching the simulation before running this.

MTC Demo Script

Make the Script Executable

chmod +x ~/UR3_ROS2_PICK_AND_PLACE/ur_mtc_pick_place_demo/scripts/robot.sh

Run the Script

~/UR3_ROS2_PICK_AND_PLACE/ur_mtc_pick_place_demo/scripts/robot.sh

This script launches the Gazebo simulation, MoveIt 2, the planning scene server, and the MTC pick-and-place demo.

Screenshots

UR3 with Robotiq Gripper in RViz

Robotiq Gripper Close-up

Simulation in Gazebo

RViz Overview

MTC Overview

Pick Error

MTC Pipeline

Loop Demo

Colour Pick

AI / ML Stack

Vision-Based Perception (`ur_perception`)

Color + optional YOLO object detection + PCL-based cluster extraction from the Intel D435 camera.

ros2 launch ur_perception perception.launch.py
ros2 topic echo /detected_objects
# Annotated feed in RViz: /detection_image

LLM Task Planning (`ur_llm_planner`)

Natural language to robot motion via local Ollama model:

ros2 launch ur_llm_planner llm_planner.launch.py
ros2 topic pub --once /llm_planner/command std_msgs/msg/String \
  "{data: 'pick up the red block and place it in the left bin'}"

Demonstration Recording + Behavior Cloning (`ur_data_collector`)

ros2 launch ur_data_collector data_collector.launch.py
ros2 service call /data_collector/start_recording std_srvs/srv/Trigger
ros2 service call /data_collector/stop_recording std_srvs/srv/Trigger

python3 ur_data_collector/scripts/train_bc.py \
  --data_dir ~/ur3_demos \
  --output_dir ~/bc_policy \
  --epochs 50

SmolVLA Vision-Language-Action Policy (`ur_smolvla`)

SmolVLA is a compact VLA model from HuggingFace that takes a camera image + joint states and predicts robot actions directly from a natural-language task description. This replaces hardcoded waypoints with a learned policy.

Install lerobot (requires Python >= 3.11):

python3.11 -m pip install "git+https://github.com/huggingface/lerobot.git#egg=lerobot[smolvla]"

Run inference against the base model:

# Terminal 1 — start simulation
ros2 launch ur_gazebo ur.gazebo.launch.py

# Terminal 2 — run SmolVLA inference
ros2 launch ur_smolvla smolvla_inference.launch.py \
  task:="pick the red block and place it in the bin"

Run with a fine-tuned checkpoint:

ros2 launch ur_smolvla smolvla_inference.launch.py \
  checkpoint:=/path/to/your/checkpoint \
  task:="pick the red block"

The inference node subscribes to /camera_head/color/image_raw + /joint_states and publishes JointTrajectory commands to /arm_controller/joint_trajectory at 10 Hz. The camera is a simulated Intel D435 mounted at 0.50 m height with a 25° downward tilt, giving a clear view of the workspace.

Workflow to fine-tune SmolVLA on your own pick-and-place demos:

Record demonstrations with ur_data_collector (saves HDF5 episodes)
Convert to LeRobot dataset format and fine-tune SmolVLA
Point checkpoint:= at your fine-tuned model and run inference

SAC RL Policy Runner (`mujoco_ur_rl_ros2`)

Note on UR3 Adaptation: The models trained in mujoco-ur-arm-rl are optimized for the UR5e arm. To use them effectively on the UR3, you will need to tweak the Gymnasium environments (to account for UR3 link lengths/workspace) and retrain the model. Furthermore, ensure spawned objects aren't placed too close to the robotic base, as this causes reachability issues.

Run a pre-trained Soft Actor-Critic (SAC) policy trained in MuJoCo directly on the simulated UR3. Two nodes are included:

ur_policy_node — basic reach policy (arm joints only, no gripper)
shared_arm_policy_node — full pick-and-place policy with arm + gripper

Run with a trained model:

# Terminal 1 — start simulation
ros2 launch ur_gazebo ur.gazebo.launch.py

# Terminal 2 — run shared-arm SAC policy
ros2 run mujoco_ur_rl_ros2 shared_arm_policy_node \
  --ros-args \
  -p model_path:=/path/to/best_model.zip \
  -p object_x:=0.45 -p object_y:=0.0 -p object_z:=0.045 \
  -p drop_x:=0.45  -p drop_y:=0.2  -p drop_z:=0.025

Or use the bundled Gazebo launch (boots simulation + policy together):

ros2 launch mujoco_ur_rl_ros2 gazebo_shared_arm_policy.launch.py \
  model_path:=/path/to/best_model.zip \
  launch_policy:=true

The policy subscribes to /joint_states and publishes JointTrajectory commands to /arm_controller/joint_trajectory and /gripper_controller/joint_trajectory at 10 Hz.

Training environments (for re-training or fine-tuning) are in mujoco_ur_rl_ros2/envs/:

Env	Description
`ur_gazebo_single_arm_env.py`	Single arm at origin — matches Gazebo layout, use this to train a policy that transfers directly
`ur_pick_place_env.py`	Basic pick-place, simple reward
`shared_arm_env.py`	Multi-arm shared policy training
`ur_dual_arm_env.py`	Dual-arm scene with proven phase-based reward

Train a Gazebo-compatible policy from scratch:

# from the repo root
python3 mujoco_ur_rl_ros2/train_gazebo_single_arm.py \
  --timesteps 2000000 \
  --n-envs 8 \
  --curriculum grasp_focus

Resume a previous run:

python3 mujoco_ur_rl_ros2/train_gazebo_single_arm.py \
  --timesteps 2000000 \
  --n-envs 8 \
  --curriculum grasp_focus \
  --resume models/gazebo_single_arm/<run>/best_model.zip

Best model saves to models/gazebo_single_arm/<run>/best_model.zip — checkpoints saved every 100k steps. Then pass that path to shared_arm_policy_node above.

Key hyperparameters (in train_gazebo_single_arm.py):

--curriculum grasp_focus — starts episodes near the object, critical for learning grasps

ent_coef=0.1 (fixed) — prevents SAC entropy from collapsing before grasps are discovered

--learning-rate, --buffer-size, --batch-size — tunable via CLI

Full Demo (all-in-one)

ros2 launch ur_gazebo full_demo.launch.py
ros2 launch ur_gazebo full_demo.launch.py use_llm_planner:=true

Contributing

Feel free to open pull requests or issues for improvements or bug reports.

Name		Name	Last commit message	Last commit date
Latest commit History 169 Commits
assets		assets
docs		docs
moveit_config		moveit_config
mujoco_ur_rl_ros2		mujoco_ur_rl_ros2
robotiq_2f_85_gripper_visualization		robotiq_2f_85_gripper_visualization
src		src
testing		testing
ur_data_collector		ur_data_collector
ur_description		ur_description
ur_gazebo		ur_gazebo
ur_grasp		ur_grasp
ur_interfaces		ur_interfaces
ur_llm_planner		ur_llm_planner
ur_moveit_demos		ur_moveit_demos
ur_mtc_demos		ur_mtc_demos
ur_mtc_pick_place_demo		ur_mtc_pick_place_demo
ur_perception		ur_perception
ur_smolvla		ur_smolvla
ur_system_tests		ur_system_tests
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

UR Robotic Arm with Robotiq 2-Finger Gripper for ROS 2

Demo

Installation

1. Clone the Repository

2. Install ROS Dependencies

3. Install Python Dependencies

4. Build the Workspace

MoveIt Task Constructor Setup

MongoDB (required for warehouse_ros_mongo)

Launch Instructions

Full MTC Pick-and-Place Demo

Launch Full Simulation in Gazebo

Launch Point Cloud Viewer (Gazebo + RViz)

Launch RViz Visualization (UR3 + Gripper)

Launch Gripper Visualization Alone

Move the Arm from CLI

Run Arm-Gripper Automation Script

Grasp Detection (ur_grasp)

Standalone Robot Control GUI

Custom Zig-Zag Motion Demo

MTC Demo Script

Make the Script Executable

Run the Script

Screenshots

UR3 with Robotiq Gripper in RViz

Robotiq Gripper Close-up

Simulation in Gazebo

RViz Overview

MTC Overview

Pick Error

MTC Pipeline

Loop Demo

Colour Pick

AI / ML Stack

Vision-Based Perception (ur_perception)

LLM Task Planning (ur_llm_planner)

Demonstration Recording + Behavior Cloning (ur_data_collector)

SmolVLA Vision-Language-Action Policy (ur_smolvla)

SAC RL Policy Runner (mujoco_ur_rl_ros2)

Full Demo (all-in-one)

Contributing

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Vision-Based Perception (`ur_perception`)

LLM Task Planning (`ur_llm_planner`)

Demonstration Recording + Behavior Cloning (`ur_data_collector`)

SmolVLA Vision-Language-Action Policy (`ur_smolvla`)

SAC RL Policy Runner (`mujoco_ur_rl_ros2`)

Packages