Before you sign up this is a paid product that doesn’t show you any information about the company unless you pay $15 per month or something. I feel like this is a bit of a dark pattern to not mention charges before getting people to sign up.
I think one rule around Show HN is that you allow people to see content without signing up, let alone paying for it. So this is a violation of that rule.
Edit: actually search is not behind the paywall (although that's not very obvious)
Also, there's no way to delete your account, or remove your email. Which is not just frustrating -- it's flatly against GDPR, CCPA, etc etc.
I relatively recently was looking for a new job, so I was curious to see how well my own search process compared to this. Then I was mildly annoyed to discover that they require email signup to actually view any details about the company or the job. So I give them my email. And then I was even more disappointed to discover that they require you to pay money to even see the link to a single actual job posting. Sorry, not gonna do that if I'm just trying to scratch the curiosity itch. So then I go to delete my account, and... nothing. No can do.
Honestly one of the quickest turnarounds from "oh, neat" to "jeez, what a disappointment" that I've had in recent times.
I made a point to collect almost no information (only email) from the user (as opposed to Linkedin which asks for all sorts of data to sell) but happy to delete your account if you just email support. I also dislike having my data used or sold. That is not the purpose here, the feature to delete your account is just not yet there, appologies
How do you deal with companies that post jobs on their corporate website, have hired someone months ago, but haven't taken down the listing? I imagine that is most of them.
A potential solution to this is to have this software solution generate emails to recruiting contacts posing as a human, inquiring if the req is still open. Reverse ATS of sorts maintaining light state about job openings.
Good point, I'm not sure the numbers that accounts for this yet. I also have the reverse issue of an employer removing a job posting that has been filled so the scrapers need to visit each listing to see if it is indeed still up (where most listing sites will have an expiration date or the employer will pull it once filled)
I'd like to extend features for job seekers to log steps in the interview process. If users are not getting past the initial application then it would be able to show this feedback for a particular job listing (giving it less credibility)
You can’t really. I was a recruiter working internally at a company, and a hiring manager wanted to keep certain jobs open to collect resumes even though he had no headcount to hire anyone at that time. There was nothing I could do other than object.
If you’re not already, consider scraping and building a list of company workday public instances and pulling job postings from there as well. Should be straightforward to find breadcrumbs in search engines with those URLs, or render permutations for testing based on known domain patterns.
I'm not yet but thank you that is an interesting idea. Currently it is quite dumb and just looks for listings on each company website but in the future it would be nice to connect to workday or other such services
I'm not familiar with workday and new to coding. Is this beneficial compared to scraping company pages because workday will have consistent structure across companies?
Workday (if used, there are many other HRIS systems out there) is traditionally the source of truth as an HRIS (HR information system), is built into the org’s recruiting workflow, and the data has consistent structure. You don’t know where the data is coming from when directly on the company website, who is updating it and on what cadence (if any), etc.
You wouldn’t, so you would have to scrape both and reconcile potentially. But where is the most accurate, current information likely to be sourced from?
You can browse job listings without subscribing to see whether there are enough job listings in your domain to add value to your job search. The "U" tag is assigned to job listings without search engine results for paid platforms (indeed, glassdoor, monster, etc.) and then it is checked daily for changes.
Currently 4 worker services run headless chrome browsers and visit company websites (about 10k URLs per service per day). If certain keywords and phrases match a valid dataset of job listings and titles then the page content is saved and parsed w various tags (locations, technologies, etc.)
Ah yea no LLMs but probably some of the similar ideas were used by having a validating dataset. I just have ~800 common locations (city, state, countries) that are parsed out of the text with validating logic wrapped around each one. Far from perfect but it is now pretty accurate. I found the index of the location matters a lot for accuracy. Its still a WIP
Are you able to add the ability to search either for a keyword or a product? I want to search for Google SecOps SIEM (fka Chronicle) and Google SecOps SOAR (fka Chronicle fka Siemplify). Searching fro cybersecurity engineer unfortunately doesnt really tell me which SIEM/SOAR it is, thus i have no idea if its worth me paying for.
Ah I understand, that is an interesting use case. Currently the only searchable keywords are tagged on the job listings upon creation, but I can see if these keywords come up often enough to make tags for them. I'm not familiar with those Google products. Appreciate the comment
Not sure I understand this. You've basically set up a job board where employers can post listings for free and job seekers have to pay a monthly fee to access them. So what value does one get over visiting Indeed/LinkedIn/Glassdoor etc. which are free for applicants and charge the companies instead?
Employers don't post anything, the job listings are scraped directly from company websites. So even if the employer has not paid for an ad on Indeed/LinkedIn/Glassdoor but they leave the listing up on their company website then the scraper will pick it up (which is a large amount of listings). But yea asking the user to pay for the service is certainly different but I think beneficial because now they are the customer (rather than the employer)
As a user you are now tje customer and the value that you get it's to not have to deal with LinkedIn and similar, and find jobs that are not posted on those platforms.
OP here. I found my last day job using a scraper that visits company websites in search of job listings. Now I've turned that into an app for others to use and access jobs found on company websites (rather than paid employer ads on Indeed or wherever). This gives the job searcher an advantage to find jobs not listed on job search sites and show the company you have taken time and interest to visit their company website. Would certainly love any feedback
Great job! It seems you have around 200k companies to list. How do you handle scraping at that scale – all websites are different. What if the schema and markup change? interested to hear what the DevOps aspect looks like.
Thank you so much. In some cases I was able to standardize where the title and location are located on the page (Greenhouse, Lever, etc.). But this mostly uses a valid dataset of job descriptions that matches lists of phrases for plain text within a page (with markup removed). Also the scraper remembers companies and career pages that have job listings. It will prioritize those companies that have listings and visit more often than those that don't. Currently there are 4 worker services that visit about 10k company websites per day (each).
I didn't build this, but here my guess: Most companies use a handful of ATS's (applicant tracking systems), like
Greenhouse, Lever and Workday. Almost all of the jobs posted on these platforms are public and their top-level pages are indexable.
If I built something like this, I would start by searching for pages that have HTML fragments indicative to those systems a few times per week (since job listings don't change much).
While this won't do anything to reveal "real" ghost jobs (job reqs that are hidden or generic enough to be used for interesting referrals), it's probably a minor edge over LinkedIn Jobs (the home of stale jobs). Many of these companies cross post to those platforms anyway.
It was a slow aggregation over a few years from various resources but I recently came across https://www.thecompaniesapi.com/ and now that is the source for most of the company data
Some feedback. I love that this is open source. I would love it to remain open source, a rising tide lifts all boats sort of thing. But I also want to give money to someone who runs it as a service. Make my problem go away (not me per se, but the ability to buy gift codes and such to help other people who are struggling to find new roles). I am time poor, but am willing to pay to make problems go away. Consider charging to run a platform for users while remaining open source for those willing to run it themselves.
Actually "rust" was too difficult to yet parse out of a job description because it is such a commonly used word but I am in the process of working on this. Very sorry it is not included yet
EDIT: corrected price $$$
reply