duncan.parkes
af6aea014f
This is a big refactor of the scrapers.
The database table which says which scrapers are where will now be filled in automatically,
which should result in rather less in the way of manual editing errors.
I've also redone the python PublicAccess scraper and set all the PublicAccess sites to use
it (removing the PHP PublicAccess scrapers).
15 anos atrás
duncan.parkes
7a5a50ed58
Update PublicAccess scraper to work with BeautifulSoup.
Add all publicaccess sites to the python scraper.
16 anos atrás
duncan.parkes
1bc23b0b9c
Fix Lichfield scraper.
16 anos atrás
duncan.parkes
cb6ab6e894
Fix Mole Valley scraper.
16 anos atrás
duncan.parkes
4ea9836f1e
Fix Tamworth scraper (change of IP address)
16 anos atrás
duncan.parkes
f37d02d3de
Add Thomas' scraper for Solihull.
16 anos atrás
duncan.parkes
70a94ec280
Add scraper for Weymouth and Portland.
16 anos atrás
duncan.parkes
887abe9652
Add scraper for Mendip.
Make display method on a planningapplication work out the postcode if it isn't set.
16 anos atrás
duncan.parkes
77e9d3388f
Add scraper for Broxtowe.
16 anos atrás
duncan.parkes
7810f01d83
Add scraper for Calderdale.
16 anos atrás
duncan.parkes
689474a703
Add scraper for the Cairngorms National Park.
16 anos atrás
duncan.parkes
70c0650637
Add scraper for Leicestershire County Council.
16 anos atrás
duncan.parkes
f076ecc304
Add scraper for Lichfield. Remove another unused import.
16 anos atrás
duncan.parkes
f0a0912836
Add parser for Kirklees. Get rid of some unnecessary imports.
16 anos atrás
duncan.parkes
1761fa79b8
Add python parser for West Dorset, and remove the non-working perl one.
16 anos atrás
duncan.parkes
ddc81f06ea
Add scraper for Gosport.
Factor out CookieAddingHTTPRedirectHandler.
16 anos atrás
duncan.parkes
a4f3ce9dac
Fix Dorset County Council scraper (the council seem to be using an IP address now rather than the domain they had
before).
16 anos atrás
duncan.parkes
2c979a07f5
Fix the Dacorum perl parser.
16 anos atrás
duncan.parkes
748f3b30b5
Add Caerphilly.
16 anos atrás
duncan.parkes
49a32a74ca
Change some PlanningExplorer scrapers to use date_registered rather than date_received.
16 anos atrás
peter@peter.uk.to
42bd542634
Highland, North Ayrshire, Redbridge: updated based on changes made to planning websites
16 anos atrás
duncan.parkes
a20a53535b
Add Waltham Forest.
16 anos atrás
duncan.parkes
8e40e8a961
Fixes for Lincoln and Crewe.
Make Hackney use date registered rather than date received.
16 anos atrás
duncan.parkes
6cf496dfb9
Add scraper for Eastbourne. The info and comment links won't work since they require you to have a cookie. If you go
back to them once you have the cookie, you're fine...
16 anos atrás
duncan.parkes
d030ce81db
Add scraper for Exmoor. Fix name of Herefordshire.
16 anos atrás
duncan.parkes
48ec82b485
Add scraper for Herefordshire.
Alter PlanningUtils to CDATA everything, scrapping the xmlquote function.
16 anos atrás
duncan.parkes
2bacbbb25a
Add scraper for Hastings. Sadly, no decent info urls again. Had to use the search page. The real info url is only
accessible with a referer.
16 anos atrás
duncan.parkes
77b46a033d
Add Hampshire scraper.
16 anos atrás
duncan.parkes
420356966c
Adding scraper for Halton.
Also adding the pycurl scraper for Westminster, just in case it is useful to remind us how to do stuff later.
16 anos atrás
duncan.parkes
7f7c8a00bc
Go back to the urllib2 version of Westminster. This works on disruptiveproactivity.
16 anos atrás
duncan.parkes
d303944e39
Add the debug back in.
16 anos atrás
duncan.parkes
7410196fdd
Try a pycurl version of the Westminster scraper.
16 anos atrás
duncan.parkes
ef6d27ee0a
Fix Lewisham comments email address.
16 anos atrás
duncan.parkes
28aaf2eba5
Oops - printing the sutton results twice...
16 anos atrás
duncan.parkes
ec5b631342
Add more debug.
16 anos atrás
duncan.parkes
775b7f8cbc
Try moving prints to above the scrape (for Haringey and Westminster problem).
16 anos atrás
duncan.parkes
f12fa60f29
Add newlines to the debug stuff in Westminster.
16 anos atrás
duncan.parkes
3cc4d48397
Some debug (mostly for westminster).
16 anos atrás
duncan.parkes
827f6a3c53
Carlisle url changed.
16 anos atrás
duncan.parkes
cc961b2bce
Daventry have replaced their nice url with an IP address...
16 anos atrás
duncan.parkes
797cedf1d3
Try declaring the charset as utf-8
16 anos atrás
duncan.parkes
0e21adea7e
Fix Sutton parser.
16 anos atrás
duncan.parkes
fa73ab577a
Add scraper for Westminster.
16 anos atrás
duncan.parkes
7b5165b8bf
Adding scraper for Harrow.
The info url situation here is not really good enough.
All we get is a page with the last 7 days apps on it with no info urls.
I'm using that page as the info url for the moment, but it will obviously
be no use after seven days...
16 anos atrás
duncan.parkes
da2be2c394
Add scraper for Hounslow.
16 anos atrás
duncan.parkes
3c85f0d0dd
Add scraper for Kingston upon Thames.
16 anos atrás
duncan.parkes
08e63c7566
Add scraper for Birmingham.
16 anos atrás
duncan.parkes
06a293dc26
Add Berwick scraper.
16 anos atrás
duncan.parkes
fcd3543d40
Add Carmarthenshire scraper.
16 anos atrás
duncan.parkes
af50d991f3
Add scraper for Brent.
16 anos atrás