DS222-Distributed-Logistic-Regression

This repository contains a comparative study implementing Logistic Regression in single machine and a distributed environment. The results are summarized in the Report.pdf and the implementation details are as discusstion below.

Running the codes

The above codes are developed in python 2.7 and requires the following libraries:

numpy==1.15.1
tensorflow==1.10.1
scipy==1.15

Before running any code of the codes run: python data_prep.py

Optionally for ease of experimentation the preprocessed data is already present in the repository in stored in sparse format.

In memory

This code can optionally take a input argument taking one of the three arguments: constant,decay and increase corresponding to the three strategies for varying learning rate.

Run the code: python LogitR_memo.py -lr "decay"

Distributed Tensorflow

For all the three settings namely bulk synchronous, stale-synchrnous and asynchronous 2 parameter server nodes and 2 worker nodes are used. To run any of the above codes go to the respective nodes and run the following commands:

pc1: python LogitR_stsynchro.py --job_name="ps" --task_index=0
pc2: python LogitR_stsynchro.py --job_name="ps" --task_index=1
pc3: python LogitR_stsynchro.py --job_name="worker" --task_index=0
pc4: python LogitR_stsynchro.py --job_name="worker" --task_index=1

Inside the code the ip address of the above nodes need to be specified in the following section:

parameter_servers = ["10.24.1.218:2222","10.24.1.214:2223"]

workers = ["10.24.1.215:2224", "10.24.1.217:2225"]

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LogitR_asynchro.py		LogitR_asynchro.py
LogitR_memo.py		LogitR_memo.py
LogitR_stsynchro.py		LogitR_stsynchro.py
LogitR_synchro.py		LogitR_synchro.py
README.md		README.md
Report.pdf		Report.pdf
data_prep.py		data_prep.py
x_test.npz		x_test.npz
x_train.npz		x_train.npz
y_test.npy		y_test.npy
y_train.npz		y_train.npz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DS222-Distributed-Logistic-Regression

Running the codes

In memory

Distributed Tensorflow

About

Releases

Packages

Languages

229Swapnil/DS222-Distributed-Logistic-Regression

Folders and files

Latest commit

History

Repository files navigation

DS222-Distributed-Logistic-Regression

Running the codes

In memory

Distributed Tensorflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages