anish · Mar 13, 2023 · 7:57 AM UTC

anish · Mar 13, 2023 · 7:57 AM UTC

@ggerganov's LLaMA works on a Pixel 6! LLaMAs been waiting for this, and so have I

Mar 13, 2023 · 7:57 AM UTC

anish · Mar 13, 2023 · 7:57 AM UTC

Generations are very slow, seems it's due to load time. Might be cause I'm running it on Termux

Yellow Pepuk · Mar 13, 2023 · 10:59 AM UTC

Does it uses Pixel 6 TPUs?

anish · Mar 13, 2023 · 4:21 PM UTC

No, it's not even optimized for the pixel. I quantized it on my mac and transferred the weights over due to space issues on my phone

Matt👊 · Mar 14, 2023 · 5:35 PM UTC

Oh man, I need to deploy this on my pixel 6! Is there a repo you have posted anywhere? I can't keep up with all these ai LLaMA advances!

Deedy · Mar 14, 2023 · 1:18 AM UTC

Only 26 seconds / token but it’s something!

ANDREW CRATON · Mar 14, 2023 · 2:32 PM UTC

324secs on Pixel 6 is too slow for building an app around, but maybe this model version params cut be cut down somehow?

bunny farmer · Mar 13, 2023 · 11:18 PM UTC

I wonder how it will do with @kdrag0n NestBox!

Nathan Cooper · Mar 13, 2023 · 11:21 PM UTC

Does the pixel have that new tensor chip? Wonder if we could get it to leverage it to speed up generation

0x · Mar 13, 2023 · 10:14 PM UTC

Holy shit is termux still a thing?