Hacker News new | past | comments | ask | show | jobs | submit login
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models [pdf] (storage.googleapis.com)
6 points by alekandreev 54 days ago | hide | past | favorite | 1 comment



Code here: https://github.com/google-deepmind/recurrentgemma

Checkpoints here for both base pre-trained model and an IT version for dialogue: https://www.kaggle.com/models/google/recurrentgemma




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: