Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It looks very adhoc

So ideally the ml network would solve the problem end to end but these authors seem to be using a network for only one step of their otherwise classic image processing pipeline



What a smear.

A lot of applied work in vision and audio glues together different existing modules, instead of training the whole thing end-to-end.

In an ideal world, things are ideal. But the world isn't a grad student's wet dream.

But there are not as many influential papers in the world as there are people who respond: "Why didn't they train the whole thing end-to-end?"


"Ideally they'd solve the whole problem, but it looks like they just solved one part of the problem"




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: