I'm mostly a layman with ML stuff, so I might be doing something wrong, but I've...

refulgentis · on March 17, 2023

There’s this gold rush going on, you’re right, any B without RLHF is meh.

The things getting published as “on device LLM” focus on bitcrushing the lowest B model with minimal RLHF and then pronouncing we have on device LLMs. We’ll definitely get there but signal >>> noise currently.

First person to admit this and write their blog post with A / B tests vs. a Markov chain deserves the gold.

simonw · on March 17, 2023

Have you tried Alpaca yet? It's a massive improvement on base LLaMA.