Skip to content

Latest commit

 

History

History
41 lines (25 loc) · 1.09 KB

File metadata and controls

41 lines (25 loc) · 1.09 KB

Tweet Collection (Search)

Setup

Setup your twitter credentials, sendgrid credentials, and demonstrate your ability to connect, as described in the README.

Also choose a database (sqlite or bigquery), and if bigquery, run migrations and test your ability to connect, as described in the README.

Migrating tables:

DATASET_ADDRESS="YOUR_PROJECT.YOUR_DATASET" python -m app.tweet_collection.bq_migrations

Usage

Collect tweets:

python -m app.tweet_collection.job

# pass custom params:
START_DATE="2022-07-01" END_DATE="2022-07-01" QUERY="lang:en #january6thcommittee" PAGE_LIMIT=3 python -m app.tweet_collection.job

# store to bigquery:
# DATASET_ADDRESS="YOUR_PROJECT.YOUR_DATASET"  ...
STORAGE_MODE="bq" START_DATE="2022-07-01" END_DATE="2022-07-01" QUERY="lang:en #january6thcommittee" PAGE_LIMIT=3 python -m app.tweet_collection.job

Collect domains and entities:

# python -m app.tweet_collection.twitterdev

python -m app.tweet_collection.db_seeds

# DATASET_ADDRESS="YOUR_PROJECT.YOUR_DATASET" ...
python -m app.tweet_collection.bq_seeds