Test script to parse useful information from http://legislature.vermont.gov/bill/status
List css selectors to grab html tag containing desired information
All interesting information contained inside of Selector:"div #main-content"
Selector:h1
Parse from URL?
Selector:"h4 .charge"
Inside of Selector:"dl .summary-table"
Inside dd tag after dt tag containing "Location"
Inside of Selector:"dl .summary-table"
Inside dd tag after dt tag containing "Sponsor(s)"
Inside table Selector:"#bill-detailed-status-table"
"FULL STATUS" is 4th column
Grab url, no css selector
H.159
An act relating to abandoned swimming pools
Location:
House Committee on General, Housing and Military Affairs
Sponsor(s):
Rep. Jim Masland
Additional Sponsors
Rep. Timothy Briglin
Read First Time and Referred to the Committee on General, Housing & Military Affairs
http://legislature.vermont.gov/bill/status/2016/H.159
Parsing arguments: https://docs.python.org/2/library/argparse.html
Fetching Webpage: https://docs.python.org/2/library/urllib.html
Parsing HTML: http://www.crummy.com/software/BeautifulSoup/