Hacker Newsnew | past | comments | ask | show | jobs | submit | nopurpose's commentslogin

I remember listening to Oxide & Friends (or it was On the Metal?) podcast few years ago and had an impression they wrote their own training code.

It's a more available option on AMD chips, intel AFAIK kept it a secret blob.

Ultimately oxide got to run customised firmware deal and AFAIK even got custom PSP firmware


Not a user, but in what sense they are getting wiped on the flor? 4th place on llmarena looks solid: https://huggingface.co/spaces/lmarena-ai/arena-leaderboard

I was missing magit, but then found `gitu` CLI and now use it happily for rebasing.

Reminds me of "false sharing" effect: hidden common dependency and bottleneck for what looks like independent variables on the surface.

How do those companies make money? Qwen, GLM, Kimi, etc all released for free. I have no experience in the field, but from reading HN alone my impression was training is exceptionally costly and inference can be barely made profitable. How/why do they fund ongoing development of those models? I'd understand if they release some of their less capable models for street cred, but they release all their work for free.

Chinese companies don't always operate on purely capitalistic principles, there is sometimes government direction in the background.

For China, the country, it's a good thing if American AI companies have to scramble to compete with Chinese open models. It might not be massively profitable for the companies producing said models, but that's only a part of the equation


China seems to combine the best points of capitalism (many companies taking many shots on goal, instead of the eastern bloc way of one centrally-mandated solution that either works or not) with the best points of communism (state-sponsored industries that don't have to generate a profit, for the glory and benefit of the state).

There is a certain advantage to being able to go "I want a factory city here, that will manufacture ... Toasters"

The small spend may be worth it to destroy US proprietary AI companies.

How do US tech companies make money? They don't until the competition has been starved.

Ostensibly, a mix of VC funding and that they host an endpoint that lets them run the big (200+GB) models on their infrastructure rather than having to build machines with hundreds of gigs of llm-dedicated memory.

But on inference they have to compete with other inference provider that just has a homepage, a bunch of GPUs running vllm and none of the training cost. Their only real advantage are the performance optimizations that they might have implemented in their inference clusters and not made public

Qwen, at least, IIRC has some proprietary models, specifically the Max series. IIRC these have larger context windows.

As someone active in both English and Chinese media, I always feel like who relying on only one is brainwashing, just like Wumao. There's no difference here; it's always about the government control,destroying US company... In reality, free services have always been a competitive strategy for businesses in China, from ride-hailing to bike-sharing, all about grabbing market share and competing for potential users. Daily active users are what Chinese companies care about most.

Adjacent to it are PR reviews. Suggesting simpler approach in PR almost always causes friction: work is done and tested, why redo? It also doesn't make a good promotion material: keeping landscape clear of overengineered solutions is not something management recognises as a positive contribution.

Depends on the management and whether they're involved in coding. Any engineering manager, architect, senior / lead developer etc should appreciate lower complexity.

Of course, if it's the person in charge introducing said overengineering there is a problem.


they can recognise on the informal level, but you can't put it into end of the year review document. What it will be? "Kept N PRs from introducing cruft into our systems?". Fixing or building things is much more visible, than just maintaining high standards.

Worse, to suggest a simpler approach checking existing products/APIs or even preparing toy prototype is required to be confident in own advice. This hidden work is left entirely unnoticed even by well meaning managers/engineers: they simply don't know if you knew or had to discover simpler solution.


Because it is RNG, their 5th can be my 1st.


You could make same argument in "information superhighway" days, but it turned out to be the opposite: no company monopolised internet services, despite trying hard.

With so many companies in AI race it is already pretty competitive landscape and it doesnt seem likely to me that any of them can build deep enough moat to come ahead.


Internet services have been centralised into a few ISPs and a few websites everyone visits


a few? all sorts of websites and services are thriving on the Internet even after significant consolidation of attention social media caused. Not even close to a dystopian picture parent comment paints.


90% of eyeball views are using the 5 sites each filled with screenshots of the other 4


Is there a good tool for background migrations?

For example add temporarily nullable column to a large table, deploy new code which starts writing to the new column, in background populate that column for existing rows in batches and finally alter column to be mandatory non-nullable.

Another example of non-trivial schema management case is to make schema change after new version rollout completes: simple migration at the start of the container can't do that.

It must be a solved problem, but I didn't see a good tool for it which would allow expressing these imperative changes in a declarative way which can be comitted and reviewed and tested along the app code. It is always bunch of adhoc ugly scripts on a side and some hand waving deployment instructions.


I tend to prefer to hand-roll schema migrations... but I use grate[1] for the most part. That said, I've created similar tooling for different scenarios.

1. https://grate-devs.github.io/grate/

Pretty easy to setup/use in a dev environment as well... see docker-compose.yaml and run/dbup script.

https://github.com/tracker1/FastEndpoints-SqlJobQueues


> They don’t understand it and think it will replace them so they are afraid.

I don't have evidence, but I am certain that AI replaced most of all logo and simple landing pages designers already. AI in Figma is surprisingly good.


I doubt it, you’ll still need humans to create novel ideas and designs because things will get stale after a while and trends/styles will continue to evolve.


Exactly. People are getting very good at detecting AI-generated designs -- because everyone can play around with it themselves and see in what ways they always tend to look alike.

To make an impression, it will become even more important to go with a real designer who can work in creative ways to regain people's attention.

But I have little doubt that a lot of the bread-and-butter, not-too-important, I-just-need-to-have-something jobs will no longer be contracted to actual designers.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: