Skip to content

GabrieleDettori/Newspaper-articles-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Newspaper Articles Extractor

This notebook extracts automatically more than 90'000 web articles from ilfattoquodiano.it and larepubblica.it The script extracts the content of the articles and its metadata and creates a database to store the data.

Tools used:

  • xml.etree.ElementTree
  • pandas
  • BeautifulSoup
  • urlopen
  • requests

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published