this post was submitted on 09 Jun 2023
16 points (100.0% liked)

LocalLLaMA

20 readers
1 users here now

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

founded 1 year ago
MODERATORS
 

From the latest commits:

We are happy to release our final 1T token version of OpenLLaMA 3B and 7B. We’ve updated the evaluation results. We are also happy to release a 600B token preview of the 13B model, trained in collaboration with Stability AI.

Haven't tried it yet, and the 13B model is still in the works, but hopefully this will be a better foundation than the leaked Meta AI model, not only for more reproducible research, but because nonacademics will be completely in the clear from a legal standpoint to run this stuff locally.

top 4 comments
sorted by: hot top controversial new old
[–] lemann@lemmy.one 3 points 1 year ago (1 children)

Nice work from these guys. I wonder how the open source reproduction compares side-by-side to the original LLaMA model...

Depends on the task, but looks like about the same on average.

[–] SkySyrup@sh.itjust.works 1 points 1 year ago (1 children)

I hope this model gets fine-tuned similar to how the original LLaMA got fine-tuned, it really improves it!

Yep. There do appear to be some such plans.