build locally

source $HOME/.bash_profile
mkdir -p cmake-build-debug
cd cmake-build-debug
cmake -DOPENSSL_ROOT_DIR=/opt/homebrew/opt/openssl@3 ..
make

docker build & run

## Create Network

docker network create --subnet=172.32.0.0/16 raft_network


##    Build

# For Mac (local)
docker build -t cbdp_raft_lb --target loadBalancer .
docker build -t cbdp_raft_node --target raft .
docker build -t cbdp_raft_client --target client .

# For Linux
docker buildx build --platform linux/amd64 -t cbdp_raft_lb --target loadBalancer .
docker buildx build --platform linux/amd64 -t cbdp_raft_node --target raft .
docker buildx build --platform linux/amd64 -t cbdp_raft_client --target client .


## Create Volumes


docker volume create cbdp-volume-0
docker volume create cbdp-volume-1
docker volume create cbdp-volume-2
docker volume create cbdp-volume-3


##    Start Containers

# system components
docker run -d --network=raft_network --ip 172.32.0.20 --name=loadBalancer cbdp_raft_lb
docker run -d --network=raft_network --ip 172.32.0.21 --mount source=cbdp-volume-0,target=/cbdp/cbdp --name=raft-0 -e RAFT_NODE_NUMBER='0' cbdp_raft_node
docker run -d --network=raft_network --ip 172.32.0.22 --name=raft-1 -e RAFT_NODE_NUMBER='1' cbdp_raft_node
docker run -d --network=raft_network --ip 172.32.0.23 --name=raft-2 -e RAFT_NODE_NUMBER='2' cbdp_raft_node
docker run -d --network=raft_network --ip 172.32.0.24 --name=raft-3 -e RAFT_NODE_NUMBER='3' cbdp_raft_node


# -v --> volume mount
# -v <local folder path>:<container folder path>
# Initializa
docker run -d --network=raft_network --ip 172.32.0.21 -v /Users/egekocabas/Desktop/data/data-0:/data --name=raft-0 -e RAFT_NODE_NUMBER='0' cbdp_raft_node
docker run -d --network=raft_network --ip 172.32.0.22 -v /Users/egekocabas/Desktop/data/data-1:/data --name=raft-1 -e RAFT_NODE_NUMBER='1' cbdp_raft_node
docker run -d --network=raft_network --ip 172.32.0.23 -v /Users/egekocabas/Desktop/data/data-2:/data --name=raft-2 -e RAFT_NODE_NUMBER='2' cbdp_raft_node
docker run -d --network=raft_network --ip 172.32.0.24 -v /Users/egekocabas/Desktop/data/data-3:/data --name=raft-3 -e RAFT_NODE_NUMBER='3' cbdp_raft_node


# run client
docker run -d --network=raft_network --ip 172.32.0.25 --name=client -it cbdp_raft_client /bin/bash
docker exec -it a70e3e021c87cd09f2447a84a1e5e7ee8fc5ea0c05a1dbd42293fc361d92b539 /bin/bash

Prerequisite

touch $HOME/.bash_profile
echo "export PROTOC_INSTALL_DIR=$HOME/.local" >> $HOME/.bash_profile
echo "export gRPC_CPP_PLUGIN_EXECUTABLE=$PROTOC_INSTALL_DIR/bin/grpc_cpp_plugin" >> $HOME/.bash_profile
echo "export PATH="$PROTOC_INSTALL_DIR/bin:$PATH"" >> $HOME/.bash_profile
/Users/turkerkoc

-DgRPC_PROTOBUF_PROVIDER=package -DgRPC_PROTOBUF_PACKAGE_TYPE=CONFIG -DCMAKE_FIND_PACKAGE_PREFER_CONFIG=TRUE

gRPC_CPP_PLUGIN_EXECUTABLE=/Users/egekocabas/.local/bin/grpc_cpp_plugin

Project: URL shortener

The goal of this project is to implement the storage backend for a transactional URL shortener service (c.f. bit.ly). The backend should be fault tolerant and ensure that URLs produce a consistent short value.

Implementation

To keep the mappings between URL and short id consistent, implement the Raft consensus protocol between storage nodes (see Lecture 5). Our workload is a simple key-value storage, where we transactionally insert a new mapping, if it does not exist already. For each insert, ship the insert logs to all replicas to ensure a consistent state. Replicas should, thus, also be able to answer read-only lookups from a short id to the full URL. Remember to keep these lookups efficient by using an index structure.

Workload

For the workload, you can use the same CSV files that we used in the last assignments. You can also use the larger ClickBench hits dataset.

Deployment

Running and deploying your project should be similar to Assignment 4. In the containerized environment, be aware that the local container filesystem is stateless. When shutting down a container (or when it crashes), its local files are not preserved. However, we want that our shortened URLs are persistent, even if we restart all nodes.

For a local Docker setup, you can use volumes:

docker volume create cbdp-volume
docker run --mount source=cbdp-volume,target=/space ...

In Azure Container Instances, Azure file shares have similar semantics:

az storage share create --name cbdp-volume ...
az container create --azure-file-volume-share-name cbdp-volume \
   --azure-file-volume-mount-path /space ...

The default configuration in Azure only allocates 1GB of memory to your instances. If your implementation uses more memory for your workload, you can increase the allocated memory when creating an instance. E.g., you can create an instance with 4GB:

az container create --memory 4 ...

Our Questions

When we will learn old assignments passed or not?
Does followers can get write request and forward it to the leader?
How does initialization works?
- Where do we keep current leader information and node information?
- Does the number of nodes pre-determined? (Can a new node enter to the already running system?)
How does end user communicate with the system? Do we need to implement a command line interface?
- Does the leader waits for write input from end-user?
- Also does the followers waits for read from end-user?
- Do we need threads to wait for inputs while updating logs?
Do we need to process the provided dataset immediately when the program starts? Or do we need to wait for input data from user?
Does URL shortener need to return only a unique ID?

Report Questions

Describe the design of your system -> to do!
How does your cluster handle leader crashes? -> raft handles this (new leader election)
- How long does it take to elect a new leader? -> depends on timeout and implementation also network
- Measure the impact of election timeouts. Investigate what happens when it gets too short / too long. -> too short = too many leader election (even no leader crash), too long = determining crashed leader will be longer and this will increase write latency
Analyze the load of your nodes:
- How much resources do your nodes use? -> will see
- Where do inserts create most load? -> leader node (writes goes from leader)
- Do lookups distribute the load evenly among replicas? -> yes
How many nodes should you use for this system? What are their roles? -> leader and multiple followers
Measure the latency to generate a new short URL -> will see
- Analyze where your system spends time during this operation -> waiting ack from followers takes most time.
Measure the lookup latency to get the URL from a short id -> directly read from map.
How does your system scale? -> new node learns logs from others
- Measure the latency with increased data inserted, e.g., in 10% increments of inserted short URLs -> should increase since we wait for ack's from followers
- Measure the system performance with more nodes -> will se

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.idea		.idea
protos		protos
src		src
.clang-format		.clang-format
.clang-tidy		.clang-tidy
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
CMakeLists.txt		CMakeLists.txt
Dockerfile		Dockerfile
README.md		README.md
bashUtilFunctions.sh		bashUtilFunctions.sh
build.sh		build.sh
cloud_presentation.pdf		cloud_presentation.pdf
groupMembers.txt		groupMembers.txt
report.pdf		report.pdf
testCorrectness.sh		testCorrectness.sh
testResilience.sh		testResilience.sh
testWorkload.sh		testWorkload.sh
writeTime.txt		writeTime.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

build locally

docker build & run

Project: URL shortener

Implementation

Workload

Deployment

Our Questions

Report Questions

About

Releases

Packages

Contributors 3

Languages

TurkerKoc/Distributed-URL-Shortener

Folders and files

Latest commit

History

Repository files navigation

build locally

docker build & run

Project: URL shortener

Implementation

Workload

Deployment

Our Questions

Report Questions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages