ReAI_explorerBot

This was the final Project for the Reintegrating AI class at Brown (CS2951x).

Abstract

In this project, we explored machine learning in an embodied setting in order to investigate the unique challenges and opportunities presented by placing a learning agent in the real world. We developed a robot, ExplorerBot, capable of navigating the environment and avoiding obstacles. ExplorerBot, a two-wheeled robot constructed from 3D-printed parts and standard low-cost electronics, served as a test-bed for two primary control policies: the first was a hand-coded policy based on values from ExplorerBot's time-of-flight (ToF) distance sensors; the second was a learned policy based on deep Q-learning. We tested deep Q-learning on input from a forward facing camera, but after extensive training this approach failed to produce desirable navigation behavior. We hypothesize that this is due to the fact that typical deep Q-networks (DQN) require on the order of millions of samples, which we were unable to collect due to limitations of sampling frequency in the real world. To validate our DQN implementation, we then trained on input from the ToF sensors, a fundamentally easier problem. Although this DQN-ToF policy was not optimal in a reward-collection test, it allowed ExplorerBot to reliably navigate in its environment without crashing.

The paper and poster can be found in the paper/ folder.
Information on the robot and STL files can be found in hardware.md
Information on setting up the software can be found in depends.md

The master branch contains our failed attempt to train the DQN on camera images. The tof_dqn branch contains our successful attempt to train the DQN based on ToF distance sensors to avoid obstancles in the environment. The TensorFlow model in models/tof_model_robot_history_slower.ckpt contains the final successful network parameters that were trained in a 9-hour session.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
camTests		camTests
paper		paper
vl6180_pi		vl6180_pi
.gitattributes		.gitattributes
.gitignore		.gitignore
DQN.py		DQN.py
README.md		README.md
depends.md		depends.md
drive.py		drive.py
drive_and_record.py		drive_and_record.py
explorer.screenrc		explorer.screenrc
hardware.md		hardware.md
hparams.py		hparams.py
hub.py		hub.py
links.md		links.md
request.py		request.py
run_server.sh		run_server.sh
server.py		server.py
tof_node.py		tof_node.py
tof_test.c		tof_test.c
tof_thread.py		tof_thread.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ReAI_explorerBot

Abstract

About

Releases

Packages

Contributors 3

Languages

IzzyBrand/ReAI_explorerBot

Folders and files

Latest commit

History

Repository files navigation

ReAI_explorerBot

Abstract

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages