Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I just installed Google Ai Edge Gallery on my iPhone 16 pro, here are the results of the first benchmark with GPU, Prefill Tokens=256, Decode Tokens=256, Number of runs: 3. Prefill Speed=231t/s, Decode Speed=16t/s, Time to First Token=1.16s, First init time=20s


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: