I’m in a situation where I’m looking for jobs local to me with decent pay and benefits. I did searches on all the major job sites with my criteria, sorted how I want, striped out the session ids and saved the links in an auto hotkey script so I can quickly check each site every day.
I still have issues with irrelevant results, such as jobs on towns I have no desire to commute to, companies I would never work for, and jobs with requirements I don’t have, such as nursing.
I’m not really a programmer, but I can tinker around with things to an extent. It would be nice to be able to scrape these sites in order to reduce the irrelevant postings, I just don’t know how to go about it.
I’m very lazy so I’d probably start by looking at filters on those sites, if i really wanted to tackle this with programming, i’d:
see if there’s an api, or rss feed for these sites, if so i’d pull that down with a cron job and do filtering locally with probably regex.
if not i’d scrape the html and pull out the relevant links with whatever the latest html parser is for the language i use (i.e. it used to be beautiful soup for python, but there’s i think a new better one).
but as i said i’m rather lazy, and haven’t been on the prowl for jobs for some time.