LocalLLaMA

20 readers

1 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 1 year ago

MODERATORS

SkySyrup@sh.itjust.works

pax@sh.itjust.works

OpenLM Research Releases FOSS 3B and 7B Models Trained on 1T RedPajama Tokens; 13B Model Coming Soon (github.com)

submitted 1 year ago* (last edited 1 year ago) by library_patron@lemmy.blahaj.zone to c/localllama@sh.itjust.works

4 comments fedilink hide all child comments

From the latest commits:

We are happy to release our final 1T token version of OpenLLaMA 3B and 7B. We’ve updated the evaluation results. We are also happy to release a 600B token preview of the 13B model, trained in collaboration with Stability AI.

Haven't tried it yet, and the 13B model is still in the works, but hopefully this will be a better foundation than the leaked Meta AI model, not only for more reproducible research, but because nonacademics will be completely in the clear from a legal standpoint to run this stuff locally.

top 4 comments

sorted by: hot top controversial new old

[–] lemann@lemmy.one 3 points 1 year ago (1 children)

Nice work from these guys. I wonder how the open source reproduction compares side-by-side to the original LLaMA model...

[–] library_patron@lemmy.blahaj.zone 4 points 1 year ago

Depends on the task, but looks like about the same on average.

[–] SkySyrup@sh.itjust.works 1 points 1 year ago (1 children)

I hope this model gets fine-tuned similar to how the original LLaMA got fine-tuned, it really improves it!

[–] library_patron@lemmy.blahaj.zone 2 points 1 year ago

Yep. There do appear to be some such plans.