Sherlock

This project revolves around the implementations of a scraping and sniffing mechanism: locating the position of any CDN's surrogate server(s). We provide a Virtual Machine (Username: sherlock, Password: 1234) in this Google Drive folder (for users with x86-64 architecture, all libraries will be included), but you can follow the Installation Section Guide to eventually install the software manually (for users running on a different architecture, on a Linux Operating System).

If the VM starts correctly head to usage section.

If you can't run the VM follow this guide.

Software installation

If the VM on our drive isn't working for you:

Recommended Operating System

It is recommended, as compatibility, to install OS version 20.X or greater. We suggest installing the following operating system for ease of installation and use:

Ubuntu
Minimum requirements include:
- 2 GHz dual core 64-bit processor;
- 4 GiB RAM (system memory)
- 12 GB of hard-drive space;
- 1024x768 screen resolution;

If you prefer a lighter version to save space, consider:

Xubuntu
Minimum requirements include:
- 1.5 GHz dual core 64-bit processor;
- 1 GiB RAM (system memory)
- 10 GB of hard-drive space;

This is the same operating system used in our provided VM.

Note: You might encounter errors during the installation process, such as:

{user} is not in the sudoers file
{path} is not in $PATH
systemd-resolve command not found

Refer to the troubleshooting section to resolve these, following the steps we've taken.

Browser installation

It is required to have an internet browser installed on the current system (either Chrome, Chromium or Firefox). If no browser is already in your system, it is suggested to install Chromium for its size.

Chromium

sudo apt-get install chromium-browser 
sudo apt-get install chromium-chromedriver

Chrome

wget https://dl.google.com/linux/direct/google-chrome-stable_current_amd64.deb
sudo dpkg -i google-chrome-stable_current_amd64.deb

Firefox

sudo snap install firefox

VirtualDisplay installation

It is also required X-server video frame buffer:

sudo apt-get install xvfb

How to install all Libraries

Please note that several libraries are required to make the code working:

PyVirtualDisplay
Requests
selenium
tabulate
scapy
pyzmq

Pip & Git are requirements in order to install the packages:

sudo apt-get install python3-pip
sudo apt-get install git

Proceed by cloning this repository and open the appropriate directory:

git clone https://github.com/PaoloGit99/sherlock
cd sherlock

Create and activate a virtual environment called myenv in this directory, using virtualenv. To use the appropriate python version, check the one installed in the system with python3 --version, and replace "X":

sudo apt-get install -y python3-virtualenv

python3 --version
virtualenv --python=3.X myenv
source myenv/bin/activate

To install all libraries in myenv, use the following commands:

pip install -r requirements.txt

Usage

To execute the program, run this instruction inside the directory of the project:

python3 start.py

It is also possible adding a specific provider, by running:

python3 start.py {provider_name}

Where provider_name is one of the following:

bbc
facebook
instagram
tiktok
twitch
twitter
youtube

Each output will be saved inside the "output" directory, and it will be saved as follows:

measure_{provider_name}_hh:mm:ss Weekday dd-mm-yyyy.json

It will be automatically created a .tar archive in sherlock directory containing all results, you can share that archive with us.

Some parameters can be tuned and are located in the init.txt file;

SNIFFER_TIMEOUT, set in seconds as the time to wait before interrupting the execution if packets are no longer received
N_REQUESTS, set as the number of requests for packet loss estimation towards content
TH_BYTES, set as the threshold of received bytes from a cache server to stop the execution
TRACEROUTE_MAXHOPS, set as the maximum number of hops for tracerouting
REQ_TIMEOUT, set as the timeout for web API requests (geolocation, AS info, ...)
SHOW_count_and_dns, set as True to terminal-print DNS and received traffic tables.
SAVE, set as True to write tables of DNS and CountBytes in "./output"

If none are set, default values will be used.

(Optional) To observe the results properly formatted, run the following command:

python3 table.py

If you have executed the code in a virtual environment, you can interrupt it with the command deactivate.

All useful code

sudo apt-get install -y chromium-browser 
sudo apt-get install -y chromium-chromedriver
sudo apt-get install -y xvfb
sudo apt-get install -y git
sudo apt-get install -y python3-pip
sudo apt-get install -y python3-virtualenv

git clone https://github.com/PaoloGit99/sherlock
cd sherlock

python3 --version
virtualenv --python=3.X myenv
source myenv/bin/activate
pip install -r requirements.txt

python3 start.py
python3 start.py {provider_name}
python3 table.py
deactivate

Troubleshooting

Sudoers

In case of "{user} is not in the sudoers file" error:

cd
su root
nano /etc/sudoers

And, in the "# User privilege specification", add after root line:

{username}	ALL=(ALL:ALL)	ALL

Path

In case of "{path} is not in $PATH" warning:

cd
su root
nano ./.bashrc

Then add at the end of the file:

export "/new_path:$PATH"

And reboot

systemd-resolve command not found

In this case:

sudo ln /usr/bin/resolvectl /usr/bin/systemd-resolve

License

This project is under the terms of MIT license - see LICENSE for details.

Contacts

Consoli Flavio @ La Sapienza Università di Roma: [email protected]

Ranieri Paolo @ La Sapienza Università di Roma: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
contents		contents
ext		ext
output		output
plot		plot
.gitignore		.gitignore
20231101.as-rel.txt		20231101.as-rel.txt
LICENSE		LICENSE
README.md		README.md
browser_c.py		browser_c.py
browser_cm.py		browser_cm.py
browser_f.py		browser_f.py
dns.py		dns.py
functions.py		functions.py
init.txt		init.txt
requirements.txt		requirements.txt
ripe.py		ripe.py
sniffer.py		sniffer.py
start.py		start.py
table.py		table.py
tr.py		tr.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sherlock

Software installation

Recommended Operating System

Browser installation

Chromium

Chrome

Firefox

VirtualDisplay installation

How to install all Libraries

Usage

All useful code

Troubleshooting

Sudoers

Path

systemd-resolve command not found

License

Contacts

About

Releases

Packages

Languages

License

netlab-sapienza/sherlock

Folders and files

Latest commit

History

Repository files navigation

Sherlock

Software installation

Recommended Operating System

Browser installation

Chromium

Chrome

Firefox

VirtualDisplay installation

How to install all Libraries

Usage

All useful code

Troubleshooting

Sudoers

Path

systemd-resolve command not found

License

Contacts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages