More

jamesbrady · 2024-12-04T13:51:50 1733320310

Elicit | San Francisco + remote (US timezones) | https://elicit.com/careers

Elicit is an AI research assistant that uses language models to help researchers figure out what’s true and make better decisions. We've scaled to >$2MM annual revenue and 400k MAU with our small team. We're hiring for multiple roles and are primarily interested in people with early-stage company experience, who are comfortable in high-agency, fast-paced teams.

Front-end engineer: https://elicit.com/careers?ashby_jid=b5e218b8-8730-4254-b026...

Machine learning engineer: https://elicit.com/careers?ashby_jid=913a03d5-bd26-4c64-8346...

Data engineer: https://elicit.com/careers?ashby_jid=4617f630-f971-4716-b753...

jamesbrady · 2024-07-01T20:32:29 1719865949

Elicit (https://elicit.com/careers) | Oakland, CA and Hybrid | Frontend, AI, and Full-stack software engineering roles.

Elicit is automating high-quality reasoning so that we can help the world make more breakthroughs in every domain: from climate change to the gut microbiome to longevity and economic policy.

We’ve scaled to over 200,000 monthly users purely by word of mouth and recently crossed $1.5MM in annual revenue, 7 months after launching subscriptions.

We’re now building out our software engineering team, and hiring across several technical roles.

If you'd like to know more about some of the work we're doing you could check: - A recent blog post current UX work: https://blog.elicit.com/living-documents-ai-ux/ - Me talking about our "AI engineer" role on a podcast a couple of weeks ago: https://www.latent.space/p/hiring

Check out the careers page linked above, or email me at james@elicit.com

jamesbrady · 2024-05-16T14:47:32 1715870852

Elician here!

Our main focus is a little different to SciSummary actually. We're focussed on understanding researchers broader workflows, and providing a research assistant (i.e. rather than a particular narrow tool for summarisation or search).

The workflows we're most excited about at the moment are literature and systematic reviews: we think we can make these orders of magnitude faster and higher quality.

jamesbrady · 2024-05-16T14:44:13 1715870653

Actually, it's not an LLM!

We do use LLMs, but the secret sauce is an approach we call Factored Cognition which we wrote about here: https://ought.org/research/factored-cognition

(Elicit the company and app was spun out from Ought the research lab).

We do joke internally about the homophone (in fact, IIRC we did a little joke on our CEO by rebranding for his birthday in 2022) but I'm sorry to report that we're all careful, ethical, and well-behaved people :(

dpflan · 2024-05-16T14:48:47 1715870927

Cool, thanks for more info, nice to see other approaches. What data is used for training?

jamesbrady · 2024-05-16T14:40:04 1715870404

Elician here: thank you for sharing our tool and for this praise!

We're glad you're enjoying it.

jamesbrady · 2024-05-16T14:38:33 1715870313

Elician here.

This is a good point! (Hopefully) obviously, if we knew a particular claim was fishy, we wouldn't make it in the app in the first place.

However, we do do a couple of things which go towards addressing your concern:

1. We can be more or less confident in the answers we're giving in the app, and if that confidence dips below a threshold we mark that particular cell in the results table with a red warning icon which encourages caution and user verification. This confidence level isn't perfectly calibrated, of course, but we are trying to engender a healthy, active, wariness in our users so that they don't take Elicit results as gospel. 2. We provide sources for all of the claims made in the app. You can see these by clicking on any cell in the results table. We encourage users to check—or at least spot-check—the results which they are updating on. This verification is generally much faster than doing the generation of the answer in the first place.

jamesbrady · 2024-05-16T14:30:47 1715869847

Elician here! Thanks for your comment.

I'm not sure I agree that those rule-of-thumb statistics are "arbitrary" or "fictional"… I guess it depends on what you mean by that. I can say that on our part they're a good faith attempt to help users calibrate how best to use the tool, using evaluations of Elicit based on real usage.

Definitely accept that the tool can work better or worse depending on your domain or workflow though!

One way we do try to distinguish ourselves from vanilla LLMs is that we provide sources for all of the claims made. I mention this because we hope our users can approach the falsification process you mention for Google. We want to show people where particular claims come from such that we earn their trust.

Walking citation trails and verifying transitive claims is something we've talked about but need more people to implement! (https://elicit.com/careers)

nicklecompte · 2024-05-16T14:39:56 1715870396

> I'm not sure I agree that those rule-of-thumb statistics are "arbitrary" or "fictional"… I guess it depends on what you mean by that.

Sorry for the confusion: I meant that fragmede's comment was arbitrary and fictional, not the 90% figure. I was talking about these numbers:

  if it takes 1 hour to get one answer by hand, but only 20 minutes for the machine, and 20 minutes to check the answer, the user still comes out ahead

jamesbrady · 2024-05-16T14:41:14 1715870474

Oh, my bad—I misunderstood, thanks for the clarification

jamesbrady · 2024-05-16T14:23:03 1715869383

Elician here!

Accuracy and supportedness of the claims made in Elicit are two of the most central things we focus on—it's a shame it didn't work as well as we'd like in this case.

I'd appreciate knowing more about the specifics so we can understand and improve

Euphorbium · 2024-05-16T15:50:33 1715874633

https://analyticalsciencejournals.onlinelibrary.wiley.com/do... elicit summarises this paper abstract: “Psilocybin was present at 0.47 wt% in the mycelium.”

Actual quote from the abstract: “ No tryptamines were detected in the basidiospores, and only psilocin was present at 0.47 wt.% in the mycelium.”

It does not differentiate between psilocin and psilocybin, those are two different molecules.

jamesbrady · on Sept 26, 2023

Elician here.

The two main problems we're addressing now are:

1. Finding papers / claims / data across an academic literature which is ballooning in size.

2. Using these raw materials to to answer questions in a reliable manner.

#2 is where the bulk of the tricky ML work is, and where vanilla language models often fall short because of limited context windows and hallucination.

We're also working to expand Elicit to help academics with other parts of their research, like surfacing critiques, suggesting related prior work, brainstorming related research questions, identifying risks of bias, …

jamesbrady · on Jan 4, 2023

This is not true.