Hi all,
The open-source ML and LLM community is experiencing rapid development, and I've been closely following its progress for the past few months. As a frontend developer with over a decade of experience and some backend dabbling, I'm eager to gain practical experience in working with these powerful tools on small side projects to unleash their potential.
My tinkering nature has led me to set up VPS instances, build data scraping tools, and even create complex iOS apps with CoreData in Swift, apart from my professional frontend work (building big FE applications). I'm not afraid to learn new languages like Python or learn to use libraries like Pandas. However, in the realm of ML and LLM, I feel a bit overwhelmed, unsure where to start or if the skills I possess can be effectively applied.
What I'm aiming for is hands-on implementation of this technology in real-world scenarios. For instance, I meticulously categorize my expenses each month and would like to create a model that automatically classifies transactions based on historical data. Can open-source tools help me achieve this, and if so, what should I learn to make it happen?
Another project I'm working on involves a straightforward analytics tool. I envision using NLP in its search interface to guide users seamlessly toward their goals. Imagine users asking questions like "What's my conversion rate for X in June?" and being navigated to a preconfigured page with the right filters. Or posing a query like "How is retention defined for Y?" and receiving a snippet of relevant documentation with a link for further reading.
I'm genuinely curious and eager to hear from all of you on the best path forward. What ML libraries or NLP frameworks would suit my projects? How can I effectively leverage these open-source tools for practical applications? Your insights and advice would be invaluable in helping me take the first steps.
Thank you in advance!
Dan
As for the former classifier you can try doing zero-shot classification between n number of categories + others. Models like Flan-T5/T5/Flan-UL2/DistillBART(also ~7B-40B param LLMs can also do this but would be overkill).