Travel Trends in Instagram

os 환경 : window 10 pro

Set up

Install Requirements
```
pip install -r requirements.txt
```

Instargram Crawling & Scrapping

Chrome version check

chrome 실행 후 chrome://settings/help 접속하여 version 확인
ChromeDriver Download

https://chromedriver.chromium.org/downloads 접속 후 자신의 chrome version과 맞는 것을 찾아 download

해당 경로에 위치

.\jeju-travel-trend-dashboard\etc\chromedriver.exe

Enter email & password

insta_crawling.py

def main(keyword_list):
    email = '*********'
    password = '*********'

    insta = InstaKeywordCrawling(email, passwokrd)
    insta.login()
    for keyword in list(keyword_list.split(',')):
        insta.new_crawling(keyword)
        insta.save_result()
    insta.driver.quit()

Run script
```
bash ./crawling.sh
```

kakao API setting

kakao API를 사용하기 위해 개발자 등록을 해야합니다.

utils.py 의 get_lat_and_lon 함수안에서 headers의 Authorization의 value를 수정해야 합니다.

Get API key
1. https://developers.kakao.com/ 접속
2. 로그인 후 개발자 등록, 앱 생성
3. REST API Key 복사

Enter REST API Key

utils.py

def get_lat_and_lon(place):
    insta_places_info = pd.read_csv('./etc/insta_places_info.csv')
    insta_places_info['insta_names'] = insta_places_info['insta_names'].apply(lambda x:list(map(lambda x:re.sub(r"[']", "", x).strip(), x[1:-1].split(','))))
    searched_places = set(insta_places_info['insta_names'].sum())

    if place.strip() in searched_places:
        name, lat, lon = insta_places_info[insta_places_info['insta_names'].apply(lambda x:place.strip() in x)][['name', 'lat', 'lon']].iloc[0]
        return [name, lat, lon, place.strip()]

    url = f'https://dapi.kakao.com/v2/local/search/keyword.json?query=제주도 {place}'
    headers = {"Authorization":"******"}

    places = requests.get(url, headers=headers).json()['documents']
    place_loc = places[0]
    name = place_loc['place_name']
    lon, lat = place_loc['x'], place_loc['y']
    
    return [name, lat, lon, place]

Elasticsearch setting

Elasticsearch download
1. https://www.elastic.co/downloads/elasticsearch 접속
2. window 압축 파일 다운로드 (version 확인)
3. 압축 해제

directory setting

jeju-travel-trend-dashboard
├── data
│   └── all.csv
├── elasticsearch-"version"
├── etc
│   ├── chromedriver.exe
│   ├── insta_places_info.csv
│   └── stoptags
├── crawling.sh
├── insta_crawling.py
├── README.md
├── travel_trends.py
├── utils.py
└── requirements.txt

Enter path

elasticsearch-"version"/config/elasticsearch.yml

# ----------------------------------- Paths ------------------------------------
#
# Path to directory where to store the data (separate multiple locations by comma):
#
path.data: C:\Users\user\Desktop\jeju-travel-trend-dashboard\elasticsearch-"version"\logs\data
#
# Path to log files:
#
path.logs: C:\Users\user\Desktop\jeju-travel-trend-dashboard\elasticsearch-"version"\logs
#

Enter port

elasticsearch-"version"/config/elasticsearch.yml

# ---------------------------------- Network -----------------------------------
#
# By default Elasticsearch is only accessible on localhost. Set a different
# address here to expose this node on the network:
#
#network.host: 192.168.0.1
#
# By default Elasticsearch listens for HTTP traffic on the first free port it
# finds starting at 9200. Set a specific HTTP port here:
#
http.port: 9200
#
# For more information, consult the network module documentation.
#

Run elasticsearch

./elasticsearch-"version"/bin/elasticsearch.bat

Dashboard
- Run Dashboard
```
streamlit run travel_trends.py
```
- address : http://localhost:8501/

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Travel Trends in Instagram

Set up

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
etc		etc
README.md		README.md
Travel Trends.pdf		Travel Trends.pdf
crawling.sh		crawling.sh
ezgif.com-gif-maker.gif		ezgif.com-gif-maker.gif
insta_crawling.py		insta_crawling.py
requirements.txt		requirements.txt
travel_trends.py		travel_trends.py
utils.py		utils.py

PJHgh/jeju-travel-trend-dashboard

Folders and files

Latest commit

History

Repository files navigation

Travel Trends in Instagram

Set up

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages