More

nopurpose · 2026-03-12T22:13:56 1773353636

I remember listening to Oxide & Friends (or it was On the Metal?) podcast few years ago and had an impression they wrote their own training code.

p_l · 2026-03-13T08:50:08 1773391808

It's a more available option on AMD chips, intel AFAIK kept it a secret blob.

Ultimately oxide got to run customised firmware deal and AFAIK even got custom PSP firmware

nopurpose · 2026-03-10T14:10:03 1773151803

Not a user, but in what sense they are getting wiped on the flor? 4th place on llmarena looks solid: https://huggingface.co/spaces/lmarena-ai/arena-leaderboard

nopurpose · 2026-03-10T14:04:39 1773151479

I was missing magit, but then found `gitu` CLI and now use it happily for rebasing.

nopurpose · 2026-03-05T09:59:46 1772704786

Reminds me of "false sharing" effect: hidden common dependency and bottleneck for what looks like independent variables on the surface.

nopurpose · 2026-03-05T08:09:00 1772698140

How do those companies make money? Qwen, GLM, Kimi, etc all released for free. I have no experience in the field, but from reading HN alone my impression was training is exceptionally costly and inference can be barely made profitable. How/why do they fund ongoing development of those models? I'd understand if they release some of their less capable models for street cred, but they release all their work for free.

theshrike79 · 2026-03-05T13:12:25 1772716345

Chinese companies don't always operate on purely capitalistic principles, there is sometimes government direction in the background.

For China, the country, it's a good thing if American AI companies have to scramble to compete with Chinese open models. It might not be massively profitable for the companies producing said models, but that's only a part of the equation

miki123211 · 2026-03-05T18:29:38 1772735378

China seems to combine the best points of capitalism (many companies taking many shots on goal, instead of the eastern bloc way of one centrally-mandated solution that either works or not) with the best points of communism (state-sponsored industries that don't have to generate a profit, for the glory and benefit of the state).

theshrike79 · 2026-03-05T20:00:27 1772740827

There is a certain advantage to being able to go "I want a factory city here, that will manufacture ... Toasters"

rwmj · 2026-03-05T09:02:20 1772701340

The small spend may be worth it to destroy US proprietary AI companies.

gmerc · 2026-03-05T14:28:56 1772720936

How do US tech companies make money? They don't until the competition has been starved.

indrora · 2026-03-05T08:26:32 1772699192

Ostensibly, a mix of VC funding and that they host an endpoint that lets them run the big (200+GB) models on their infrastructure rather than having to build machines with hundreds of gigs of llm-dedicated memory.

wongarsu · 2026-03-05T09:23:09 1772702589

But on inference they have to compete with other inference provider that just has a homepage, a bunch of GPUs running vllm and none of the training cost. Their only real advantage are the performance optimizations that they might have implemented in their inference clusters and not made public

MarsIronPI · 2026-03-05T17:47:32 1772732852

Qwen, at least, IIRC has some proprietary models, specifically the Max series. IIRC these have larger context windows.

raven12345 · 2026-03-06T02:30:57 1772764257

As someone active in both English and Chinese media, I always feel like who relying on only one is brainwashing, just like Wumao. There's no difference here; it's always about the government control，destroying US company... In reality, free services have always been a competitive strategy for businesses in China, from ride-hailing to bike-sharing, all about grabbing market share and competing for potential users. Daily active users are what Chinese companies care about most.

nopurpose · 2026-03-04T16:45:13 1772642713

Adjacent to it are PR reviews. Suggesting simpler approach in PR almost always causes friction: work is done and tested, why redo? It also doesn't make a good promotion material: keeping landscape clear of overengineered solutions is not something management recognises as a positive contribution.

Cthulhu_ · 2026-03-04T16:47:44 1772642864

Depends on the management and whether they're involved in coding. Any engineering manager, architect, senior / lead developer etc should appreciate lower complexity.

Of course, if it's the person in charge introducing said overengineering there is a problem.

nopurpose · 2026-03-04T17:02:13 1772643733

they can recognise on the informal level, but you can't put it into end of the year review document. What it will be? "Kept N PRs from introducing cruft into our systems?". Fixing or building things is much more visible, than just maintaining high standards.

Worse, to suggest a simpler approach checking existing products/APIs or even preparing toy prototype is required to be confident in own advice. This hidden work is left entirely unnoticed even by well meaning managers/engineers: they simply don't know if you knew or had to discover simpler solution.

nopurpose · 2026-02-16T07:40:27 1771227627

Because it is RNG, their 5th can be my 1st.

nopurpose · 2026-02-09T08:38:57 1770626337

You could make same argument in "information superhighway" days, but it turned out to be the opposite: no company monopolised internet services, despite trying hard.

With so many companies in AI race it is already pretty competitive landscape and it doesnt seem likely to me that any of them can build deep enough moat to come ahead.

direwolf20 · 2026-02-09T11:51:25 1770637885

Internet services have been centralised into a few ISPs and a few websites everyone visits

nopurpose · 2026-02-09T13:53:42 1770645222

a few? all sorts of websites and services are thriving on the Internet even after significant consolidation of attention social media caused. Not even close to a dystopian picture parent comment paints.

direwolf20 · 2026-02-10T01:21:19 1770686479

90% of eyeball views are using the 5 sites each filled with screenshots of the other 4

nopurpose · 2026-02-05T10:42:33 1770288153

Is there a good tool for background migrations?

For example add temporarily nullable column to a large table, deploy new code which starts writing to the new column, in background populate that column for existing rows in batches and finally alter column to be mandatory non-nullable.

Another example of non-trivial schema management case is to make schema change after new version rollout completes: simple migration at the start of the container can't do that.

It must be a solved problem, but I didn't see a good tool for it which would allow expressing these imperative changes in a declarative way which can be comitted and reviewed and tested along the app code. It is always bunch of adhoc ugly scripts on a side and some hand waving deployment instructions.

tracker1 · 2026-02-05T19:21:14 1770319274

I tend to prefer to hand-roll schema migrations... but I use grate[1] for the most part. That said, I've created similar tooling for different scenarios.

1. https://grate-devs.github.io/grate/

Pretty easy to setup/use in a dev environment as well... see docker-compose.yaml and run/dbup script.

https://github.com/tracker1/FastEndpoints-SqlJobQueues

nopurpose · 2026-01-11T18:32:44 1768156364

> They don’t understand it and think it will replace them so they are afraid.

I don't have evidence, but I am certain that AI replaced most of all logo and simple landing pages designers already. AI in Figma is surprisingly good.

archerx · 2026-01-12T05:23:27 1768195407

I doubt it, you’ll still need humans to create novel ideas and designs because things will get stale after a while and trends/styles will continue to evolve.

Anamon · 2026-01-12T11:58:19 1768219099

Exactly. People are getting very good at detecting AI-generated designs -- because everyone can play around with it themselves and see in what ways they always tend to look alike.

To make an impression, it will become even more important to go with a real designer who can work in creative ways to regain people's attention.

But I have little doubt that a lot of the bread-and-butter, not-too-important, I-just-need-to-have-something jobs will no longer be contracted to actual designers.