Skip to content

Latest commit

 

History

History
20 lines (16 loc) · 983 Bytes

DOCS.md

File metadata and controls

20 lines (16 loc) · 983 Bytes

download.py

download.py downloads and cleans Instagram captions for a specific hashtag

Flags

  • --tag: Hashtage page that you want to scrape for captions exclude the # [Required]
  • --caption-queries: Each query returns ~150 captions (default: 60)
  • --min-likes: Only use captions with >= min_likes (default: 10)

tune_transformer.py

tune_transformer.py train the model and generate captions

Flags

  • --tag: Hashtag page that we have scraped for captions exclude the # [Required]
  • --train: Should we train the model (default: False)
  • --generate: Should we generate captions (default: False)
  • --prompt: Give the model something to start with when generating text 1-5 words will due (default= My\ Day)
  • --max-length: Max length of caption text (default=60)
  • --min-length: Min length of caption text (default=20)
  • --num-captions: Number of captions to generate, some of these captions will be dropped because they are duplicates (default=40)