Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, although the number of parameters is not directly linked with the flops/speed of inference. What's nice about this AE architecture is that most of the compute (message embedding, and merging) is done at low resolution, same idea as behind latent diffusion models


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: