Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: Raman-01 – A Pocket Physics Solver LLM (huggingface.co)
5 points by Sai_Praneeth 39 days ago | hide | past | favorite
I built a tiny physics solver LLM that performs surprisingly well on easy-to-medium difficulty physics problems. Most LLMs today still struggle with physics QA (as PhyBench recently highlighted), so I wanted to see how far I could push a small model with careful data and minimal compute.

Model: Qwen3-1.7B

Supervised Finetuning: ~1500 curated examples spanning kinematics, EM, acoustics, and more

RL Fine-tuning: GRPO, 1-shot RLVR style (single example, 70 steps)

Total cost: ~$5 on H100

It started with a cold-start SFT (~3 epochs, loss to 0.3), then I ran RL with accuracy reward that climbed from 0.1 → 0.8.

Goal: Create a lightweight physics solver that’s small enough to deploy anywhere—think of it as a "pocket tutor" for foundational physics.

Still working on evaluations—most benchmarks focus on very hard problems, while I want something that evaluates basic correctness, reasoning, and unit sense on easy/medium problems. If anyone has suggestions, I’d love to hear them.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: