There are quite a few of them out there. Just a few from my notes (in reverse chronological order)
- https://github.com/persimmon-ai-labs/adept-inference
- https://github.com/shellc/laozy
- https://github.com/imartinez/privateGPT
- https://github.com/Josh-XT/AGiXT
- https://github.com/tloen/alpaca-lora
The self-hosted AI/ML system has been here for a while. Granted, the vast majority of them require downloading pre-trained models before they can be used due to how much it costs to build a system of weights from scratch.
I sometimes wonder if it would be possible to build and train a truly open source model with BOINC or something. The last 30 years of history show that it's entirely feasible to build a massively distributed computing cluster, why not leverage this to build a model? I know how naive that sounds immediately after writing it, mostly because of the difficulty of getting a large enough training data set, which unfortunately has risk written all over it (read: people poisoning the model, ala Microsoft's experiment with Tay on birbsite some years back).