CruzHacks2019(Winner) - CrowdLearning

Check out us on Dev Post

Inspiration

Looking at Amazon Turk, and how they crowdsource labeling for their datasets, we wanted to aim at a similar crowdsourcing model for reinforcement learning where imitation learning could be outsourced to the public and users could be rewarded with micropayments each time they successfully improved an existing model. This could very well turn into an App Crowd play system similar to Twitch live gaming. For example, if we were using autonomous self-driving car simulations, we could run an autonomous race league where 1000's of users would participate and compete (while training our model) and turn the event into a "Gamified micropayments for model reinforcement".

What it does

CrowdLearning is an application that outsources imitation learning using dagger to the public wherein a user can help training a model and earn micropayments through Hedera each time they help improve the existing model. To demo our project, we will be using an autonomous self-driving car simulation which is currently not 100% accurate. As users helps improve the model's accuracy in a game simulation, they earn hbar credits each time the model is improved. Conversely, if they make suggestions that potentially make the model worse than it currently is, they would be paying hbars using hedera instead.

How we built it

For the micro-payments, we used Hedera's golang sdk and turned it into a micro-service that is first dockerized into a container and then hosted on a kubernetes cluster to handle load balancing.

For the imitation learning, the initial autonomous self-driving car model is not fully trained. This simulation is then deployed on a unity frontend as a driving simulation game where the user can correct the autonomous simulation and help improve its accuracy. We use tensorflow to test the users input against the model (almost in real time) to check if it has improved the model or not and accordingly instantly issue a micropayment to the user's account.

What's next for CrowdLearning

Turn this into a full-fledged online app crowd play where 1000+ users would join in to play the simulation and help model reinforcement.

Set-up and Running RL

Clone the repo.
Download the simulator from the following link : https://drive.google.com/open?id=1aG-NwW_NK8FGET_EBApdHBpxSHwG4EkQ
Use the pretrained model from : model.h5
Spin up the behaviour clone nerual network using python3 drive.py model.h5.
Start the simulator and choose Autonomous driving.
Use Left and Right arrow keys to improve the movement of the car to get rewards.

Behind the scenes

The micropayment service API is hosted on AWS with autoscaling feature based on the CPU load. The requests from the business models reach the API service, which validates the operator account, target account and the transaction amount. In future, the idea is to orchestrate via Kubernetes and scale it in enterprise level load.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
RL		RL
ScreenShots		ScreenShots
pay-per-info-frontend		pay-per-info-frontend
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.go		app.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CruzHacks2019(Winner) - CrowdLearning

Inspiration

What it does

How we built it

What's next for CrowdLearning

Set-up and Running RL

Behind the scenes

About

Releases

Packages

Contributors 4

Languages

neilmshah/CruzHacks2019

Folders and files

Latest commit

History

Repository files navigation

CruzHacks2019(Winner) - CrowdLearning

Inspiration

What it does

How we built it

What's next for CrowdLearning

Set-up and Running RL

Behind the scenes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages