Four DB2 Code Bases?

yeukhon · on Dec 23, 2017

I am just going to drop a few quick comments. It's 6:39AM in NY, and I just woke up. Maybe not relevant at all, but...

I love reading about biography like these. I own a couple books, for example, "Where Wizards Stay Up Late: The Origins Of The Internet" did keep me awake for a few days on an early train. This one, absolutely easy to digest, thank you for sharing.

1. So do the four code bases share code they reuse (think of a common repository they depend on)?

2. Are the query planners all different because they needed to be platform specific?

3. Is DB2 still on four code bases?

Would be awesome if we have one central catalog of stories like this one.

BTW, I didn't know much about the author, so I did some quick googling: http://mvdirona.com/jrh/work/

(I hope HN won't flood it to 404! Mr. James, is this on AWS!? It looks like it might be.)

paulie_a · on Dec 23, 2017

Totally off topic but I just grabbed that book of my shelf to reread over the holidays while traveling

yeukhon · on Dec 23, 2017

Commentary: totally too many names to keep track of so to future readers, my adivce is avoid a long gap between each reading. But it is fun to learn all the names and what was going on at that time.

jhalstead · on Dec 23, 2017

If you have time to list the other books that you own like this and "Where Wizards Stay Up Late: The Origins Of The Internet", I'd definitely appreciate it. I love reading these types of stories too!

atombender · on Dec 23, 2017

"Showstopper!: The Breakneck Race to Create Windows NT and the Next Generation at Microsof" by G. Pascal Zachary is great, about an interesting period of Microsoft's life.

krylon · on Dec 23, 2017

Not sure if it fits your bill, but I found "I Sing The Body Electronic" by Fred Moody a very interesting read.

Moody was able to get some pretty intimate insights into the development of a software product at Microsoft at the beginning of the 1990s (1992-1993, I think).

I was surprised to learn afterwards that Moody had (has?) a reputation for being very pro-Microsoft; this book does not embellish the problems the people on the project team face, nor their conflicts.

polarix · on Dec 24, 2017

I liked "The Friendly Orange Glow: The Untold Story of the PLATO System and the Dawn of Cyberculture" which sounds related... I hadn't heard about PLATO before, and the book was strongly redolent of "online culture".

yeukhon · on Dec 24, 2017

I will add as well:

"The Chip: How Two Americans Invented the Microchip and Launched a Revolution"

segmondy · on Dec 23, 2017

"We were burning: Japanese Entrepreneurs And The Forging Of The Electronic Age"

spenczar5 · on Dec 23, 2017

I’m one of those youngish programmers who has never really worked on anything but Linux, so the idea of these IBM operating systems is pretty foreign to me. The article describes System i as “very advanced” but I have never even really heard of anyone using it or referencing it.

How do you write and deploy software for these things? I am so used to just being able to, like, build a Go binary and run it on a Linux box. Are they better in some ways that I am missing out on? Just curious.

snuxoll · on Dec 23, 2017

IBM i is advanced in some interesting ways, but the design has plenty of drawbacks. The database (DB2) is baked right into the operating system and is an integral component virtually every application relies on it - you can access tables via SQL or directly through record access (equivalent to fopen(), but against database tables). In fact, the “traditional” UNIX like filesystem (the IFS) on IBM I is an entirely different world, the object storage subsystem (the database) is where 99% of everything live on IBM I including most applications. Applications aren’t compiled to machine code directly, outside the lowest level code needed to get the system running - they’re all stored as bytecode and compiled to machines code on first run, which is how IBM has taken advantage of newer POWER releases without requiring customers beg vendors to upgrade or rebuild their code. There’s a hell of a lot more, but these are two key components of the design.

As for writing software, you basically have no choice but to use tooling IBM has support for. There is a emulated POSIX environment (PASE) but it’s mostly used by IBM to port things from AIX - the traditional IBM i languages are RPG plus C/C++ as a newer addition, but they have runtimes for Java, Python, Node and a few others. The traditional languages are the only way you have to make classic green screen apps, while the latter are there primarily to host web applications.

krylon · on Dec 23, 2017

> but the design has plenty of drawbacks

As somebody who has never been near an AS/400, what are those drawbacks? Besides the environment being weird in the sense that it is unlike anything else?

The only bad thing I have ever someone say about the AS/400 was a supervisor I worked for during my training who had at one point written RPG programs, and she said ... some pretty unfriendly things about RPG. Which is not say that's the only problem; like I said, I have no knowledge of the system beyond what Wikipedia tells me about it.

snuxoll · on Dec 24, 2017

There’s extremely limited compatibility with POSIX software, PASE exists but you aren’t just going to be porting applications with minimal or zero effort. Honestly IBM i makes little sense as a modern application platform, and DB2 for i is wonky plus licensing for it sucks (if you’re connecting from anything but Java where you can use open source drivers expect to pay through the node for DB2 Connect unless your application also runs on i).

DrScump · on Dec 24, 2017

  As for writing software, you basically have no choice but to use tooling IBM has support for.

Is that really the case now? Even back in the 1980s, other software publishers made great tools and compilers for OS/MVS mainframes. Boole and Babbage's Resolve did a lot of cool stuff, including giving you the mainframe equivalent of "su" right down to the TSO terminal level. And we didn't even use IBM compilers; Capex had optimized compilers with great system dump formatting.

pjmlp · on Dec 23, 2017

IBM i, aka AS/400 is indeed quite advanced.

It was developed in PL/S, a PL/I dialect, back when C was only relevant inside AT&T walls.

The binaries use a bytecode format known as TIMI, and IBM i contains a kernel level JIT. So any application gets AOT compiled at installation time, or when you feel like refreshing their native image.

The filesystem is follows a database model known as catalog.

Main programming languages on the classical mainframe environment are RPG II, Fortran, Cobol, PL/S, C, C++ and Java.

Yes, IBM i Java makes use of TIMI bytecodes as well.

Then there is the POSIX emulation layer to help port UNIX applications into IBM i.

At old good mainframe fashion, these environments are each its own little world.

When the need to modernize IBM i came to be, IBM started porting PL/S code to C++.

If one wants to actually target native code and not TIMI, there is a C compiler called Metal C.

This is just about IBM i, IBM z and Unisys ClearPath are also quite interesting. All written in systems programming languages much safer than straight C, and with lots of nice features which UNIXes still haven't adopted.

mdaniel · on Dec 23, 2017

Unisys ClearPath

I've been trying to find an emulator for Unisys for years, as I've come in contact with those systems twice in my career, but both times it was "oh, those are used by [hushed tones] the legacy team" which isn't good for learning about how they work.

Do you know of something akin to Hercules, or even eBay something, which would allow a sufficiently curious person to kick to the tires on a Unisys OS?

_egt6 · on Dec 23, 2017

They have a free education/hobbyist version officially available that runs as a VM on top of Windows. (http://www.unisys.com/offerings/clearpath-forward/clearpath-...)

JasonFruit · on Dec 24, 2017

There is a reply from a dead person with a educational/hobbyist version. They give the url http://www.unisys.com/offerings/clearpath-forward/clearpath-...

mdaniel · on Dec 24, 2017

Fantastic! Maybe I wasn't the only one who wanted to learn more about that ecosystem. My thanks to you both for bringing that offering to my attention.

Also, for future comers-along, that link has all the same language as the download registration, minus the download registration link :-(

Thankfully, I was able to find the real one from the Unisys MCP Express youtube channel:

http://www.unisys.com/offerings/clearpath-forward/clearpath-...

Separately, the idea that one runs OS 2200 on Windows using (apparently) .NET 4.5 blows my mind. I can't wait to see what's up with it.

cesarb · on Dec 24, 2017

If you click on the comment timestamp you can find the "vouch" option which makes the comment visible. I've done so, so the comment is no longer [dead].

bluedino · on Dec 23, 2017

Zend has a full PHP environment for the i, and you can install things like Apache and MySQL on it. You can even virtualize Linux on it.

Most older programs are written in RPG but there are ways to write programs in other language like C or COBOL. You can use tools like IBM VisualAge to have a "modern" environment instead of hammering away at green screen text terminals.

It's a very odd environment, unlike both Windows and UNIX. There is a job scheduler, output queues, print devices, security levels, objects...

Check out the young i developers group if you want to learn more. They embrace modern tech like node and github

http://yips.idevcloud.com/wiki/

snaky · on Dec 23, 2017

i Series (System/38, AS/400) is very interesting to learn about, even today. "Inside the AS/400" book by Frank G. Soltis is good, "AS/400: Designed for the Cloud" video (https://www.youtube.com/watch?v=0dTZVyRDnds) is inspiring, and "Laughing Boardroom" commercials series aren't bad either.

snaky · on Dec 24, 2017

Current state of affairs in IBM i world

https://www.itjungle.com/2017/12/13/2017-ibm-year-review/

https://www.itjungle.com/2017/06/21/seven-bright-spots-ponde...

guitarbill · on Dec 23, 2017

I have a lot of respect for databases in general, and DB2 is no different. But if you've ever tried to install DB2, you'll know it's a monstrosity, and even internally it was hard to get a license. The result of this is that most IBMers I know would rather use e.g. Postgres for other projects, not DB2. Now I'm not silly enough to think Postgres will replace DB2 at e.g. banks, but the effect is that Postgres reputation and knowledge spreads, and DB2 reputation and knowledge die. In the long run, I think this is a good thing.

Also, if you read between the lines, there's a ludicrous amount of infighting in IBM, which is probably how they ended up with four code bases. Still happens to this day.

gaius · on Dec 23, 2017

Now I'm not silly enough to think Postgres will replace DB2 at e.g. banks

You might be surprised then, because that's not silly at all. DB2 may hang on in the Back Office, the least technically sophisticated bit of banking for various reasons for a while longer, but it's long gone from the Front Office and Postgres is one of the replacements. I don't think anyone is starting new projects on DB2 even in BO.

ghshephard · on Dec 23, 2017

Are there any databases successfully in production in high-value transactional environments other than oracle or DB2? I have to believe a lot of the pain coinbase is experiencing stems from their decision to use mongodb - which I love and use daily, but would never, ever have chosen over Oracle for high transaction bitcoin exchange.

I wonder if anybody has done a write up on this oracle/db2 versus “others” in the super-high-value back office scenario.

gaius · on Dec 23, 2017

Are there any databases successfully in production in high-value transactional environments other than oracle or DB2?

Yes, Postgres is often used and in-house developed DBs are fairly popular too. On the commercial side, Sybase still exists too! And of course SQL Server, tho' I am not aware of anyone using it on Linux in production yet.

The inertia keeping DB2 in the BO is all the third party apps running on it - if they move they will need to persuade all their vendors to move en-masse to whatever comes next. FO software is a lot more bespoke and anything from outside tends to be chosen by people who know what they're doing.

I have to believe a lot of the pain coinbase is experiencing stems from their decision to use mongodb

Almost certainly. I would go in-house before Mongo any day of the week. That's if Postgres wasn't already a better JSON store than Mongo is...

dmd · on Dec 23, 2017

Are there any databases successfully in production in high-value transactional environments other than oracle or DB2?

Teradata!

And IMS, but that's not relational.

snaky · on Dec 24, 2017

And ADABAS (officially declared fully supported through the year 2050 and beyond). Most often considered non-relational too.

High-value transactional environments most often neither relational nor even running on operating system (in modern sense at least), like zTPF "because a real operating system is too high level and therefore too slow for real transaction processing needs". Current users of TPF include Sabre, VISA, American Airlines, American Express, HP SHARES (formerly EDS), Holiday Inn, Alitalia, KLM, Amtrak, Marriott International, Travelport, Citibank, Citifinancial, Air Canada, Delta Air Lines, Japan Airlines and many others.

hunterjrj · on Dec 23, 2017

I think SAP keeps Sybase Adaptive Server Enterprise on life support due to it still being used for financial applications.

kchoudhu · on Dec 23, 2017

Both investment banks I worked at were massive Sybase ASE shops. Oracle costs too much for what you get.

ghshephard · on Dec 23, 2017

In terms of cost - I've always thought that one reason Oracle can charge the many millions of dollars per Server for a Database License is that, over the total cost of a project (with sometimes many hundreds of engineers), and the value of the project (Driving 100s of millions or more per year) - that the Database License cost would end up being round off error.

The corollary to this, of course, is that anybody who isn't involved in projects of that scale, is going to be far better served with Postgres/MySql.

It will be interesting to see what the uptake on products that try and fit in the middle - like Amazon Aurora - High scale and affordable.

kakwa_ · on Dec 23, 2017

Licenses costs are generally quite low compared to the overall price of a project, that's true.

But licenses can also be a bit of a nightmare to manage. I've seen many cases where licenses were violated due to needs for testing/development environments. It can be difficult go through the purchase process in big companies, specially if the software in question is not wildly used. Handling licenses can also be hard technically, when your licenses are tied to hardware (a MAC address for example) or configuration (hostname or IP). When migrating some legacy hosts to VMs, I had numerous headaches when dealing with flexlm servers. It can also be really annoying when you don't have perpetual licences and the editor goes down or is not able to generate a new licence for the (obsolete) version you are using.

Basically, you must add specific processes and even additional infrastructure (inventory, CMDBs) just to manage it. It can also slow down development.

I understand that in many fields, no viable OSS alternatives exist, or even could exist, specially for specialised software.

But if I can avoid proprietary, I generally do.

I'm not against paying for software, but other for of payments, like a support contract, are generally far less annoying.

gaius · on Dec 23, 2017

They can charge it because if you really need Oracle, there really is no substitute. 12c is genuinely state-of-the-art, Exadata does things that only mainframes can otherwise do and a bunch of stuff that they can't.

There are far fewer people however that do really need it, than believe that they do... The vast majority of Oracle shops could probably switch to Postgres for less than their annual licenses, then start saving the money from the second year on...

My rule of thumb is that Postgres gets you 50% of Oracle for 0% of the price. Most of what most people need is somewhere in that 50%. SQL Server gets you 90% for 25% of the cost. I think Oracle will lose a lot of seats to SQL Server once it's stable on Linux.

jorams · on Dec 23, 2017

> My rule of thumb is that Postgres gets you 50% of Oracle for 0% of the price. Most of what most people need is somewhere in that 50%. SQL Server gets you 90% for 25% of the cost.

Can you elaborate on what kinds of things fill the 40% difference between PG and MSSQL? Having used both PostgreSQL and SQL Server, my experience has been pretty much the other way around. When I need to do something with data in a table, PG probably has a feature allowing me to do it easily, and MSSQL probably doesn't. The simplest questions asked on Stack Overflow are answered with functions spanning tens of lines, often with various caveats.

The one area where MSSQL is clearly better than PG is GUI tooling, but even there I'm not particularly impressed. More than once have I read that someone absolutely loved SQL Server Management Studio, but the one conclusion I can come to after using it is "it works". For example, "Select Top 1000 Rows" is a fine view with autocompletion and everything, but then "Edit Top 200 Rows" has an SQL panel hidden behind a context menu, without autocompletion, and when you execute your modified query it rewrites it and removes all comments.

gaius · on Dec 23, 2017

Can you elaborate on what kinds of things fill the 40% difference between PG and MSSQL?

SQL Server is well ahead of PG in query parallelisation and complex replication topologies, even as of PG 10.1. SQL Server is also ahead in change data capture, but PG 10 narrowed that gap. Managing a large user base is easier on SQL Server. If you have those use cases it is well worth the money.

If I need to do something clever with the contents of the table, SQL Server's embedded R is very good.

Having used both PostgreSQL and SQL Server,

Well indeed, most people only use a subset of their DB's full feature set, but it will be a different subset for each person :-) I don't really use GUI tools so I can't comment on that but I've heard that pgAdmin 4 is awful.

jorams · on Dec 23, 2017

Thank you, that's the kind of answer I was looking for. Those are indeed not things I've worked with.

> I've heard that pgAdmin 4 is awful.

I've heard the same. I wasn't a fan of pgAdmin 3 either, so I've stuck with the psql command line.

orf · on Dec 23, 2017

> I've heard the same. I wasn't a fan of pgAdmin 3 either, so I've stuck with the psql command line.

In case you haven't heard of it, I highly highly recommend pgcli[1] if anyone reading this is doing the same as you and me but hasn't highly customized their psql shell yet. It's one of those tools I wish I had known about earlier.

1. https://www.pgcli.com/

gaius · on Dec 23, 2017

Thank you, that's the kind of answer I was looking for

I love that in the HN community we can have these conversations without it descending into a platform war :-)

I did alot of Oracle up to SQL Server 2012 then just didn't need it any more, SQL Server + PG 9 and now 10 hit all my use cases for a tiny fraction of the cost. Now I wonder why so many people still pay so much for it.

grumpydba · on Dec 23, 2017

Well as a postgres/sybase dba, there are several management features I miss in postgres.

For one, when there is no space left for wals, your instance juste crashes. In sybase/mssql your processes would just be suspended untill you add some disk space.

Also, spreading your databases on several disks is way easier. On postrgres adding tablespaces makes pg_basebackup lose functionality.

Those are some several limitations that the developper might not see, but they impact availlability.

Also, reclaiming unused space online can be tedious, and failover from one instance to another is not easy. You have to use 3rd party tools such as repman to do so.

All of thoses missing features have caused several outages and should be addresses in my opinion to be able to meet the needs of the most demanding workloads.

Still, I view postgres as the best solution for most business cases.

orf · on Dec 23, 2017

> For one, when there is no space left for wals, your instance juste crashes. In sybase/mssql your processes would just be suspended untill you add some disk space.

Are these not equivalent from the point of view of external clients?

barrkel · on Dec 24, 2017

Blocking is not normally equivalent to termination for most processes, and not all database activity is from human interaction.

merb · on Dec 23, 2017

> 12c is genuinely state-of-the-art

i know a lot of companies that are still on 11g. and this is not in any way state-of-the art, heck it does not even have a limit clause. It's hard to actually trust your statement that 12c is so much better, when 11g was basically trash.

gaius · on Dec 23, 2017

11g was basically trash

Because it was lacking a tiny piece of syntactic sugar? Just use ROWNUM.

merb · on Dec 23, 2017

if it would've been just that... but if your "express" edition is harder to configure/install than a kubernetes cluster (the hard way) than something is really really wrong.

orf · on Dec 23, 2017

It's a bit more than that. Django has a lot workarounds[1] for Oracle and it's strange ways.

1. https://github.com/django/django/blob/master/django/db/backe...

gaius · on Dec 24, 2017

No one who needs Oracle cares about quirky syntax. They care that Flashback Query or some other major feature works.

orf · on Dec 24, 2017

Good for them, but that doesn't reduce the maintenance burden for anyone trying to write database agnostic software. I guess that's a good business strategy if you are in oracle's position.

In any case your original quote was "Because it was lacking a tiny piece of syntactic sugar?", and I was merely pointing out that it is a bit more than some syntactic sugar.

gaius · on Dec 24, 2017

anyone trying to write database agnostic software.

Who are these people, really? Noone tries to write language agnostic code then gets upset when it won't compile as both C++ and Java. People who try to write code that will compile on both Unix and Windows accept that they must make compromises... Or they have lots of platform-specific code hidden in a framework somewhere. So that is a silly argument.

If you are investing in a platform - and I don't just mean licenses but TCO, incl hiring experienced people - then it makes sense to exploit all of its features to their utmost and extract the most value. If you aren't doing that then you are coding with one hand tied behind your back.

orf · on Dec 24, 2017

We are getting a bit off point here. I agree with your overall point, but that's not what I'm arguing. I'm simply saying that there are reasons to criticize Oracle's DB offering that extends past 'limit/offset syntactic sugar', and there are a lot of quirks.

As a cheap, easy example of such quirks you can look at software that has to work around them, like Django.

gaius · on Dec 24, 2017

reasons to criticize Oracle's DB offering that extends past 'limit/offset syntactic sugar', and there are a lot of quirks

Sure. But the point I was making is that if you really do need Oracle's higher-end features, there aren't many or sometimes any substitutes. How long have the PG crew been trying to get AS OF working? But Flashback's been in Oracle for years and if you need it, you need it. And if you don't, then Postgres is a pretty safe choice. But Oracle is very far from "basically trash" because some guy doesn't like some bit of syntax!

pjmlp · on Dec 23, 2017

pgSQL is still not quite there when compared to PL/SQL.

And PostgreSQL lacks tooling at the same level as APEX, SQL Developer, JDBC/.NET drivers (specially bulk operations and distributed transactions).

snaky · on Dec 24, 2017

> pgSQL is still not quite there when compared to PL/SQL.

Yes, but maybe it's a good thing after all. pgSQL is small, and mostly clear and restrictive, there's no bunch of imperative features and dark corners. The main point of PostgreSQL always was extendability, so it supports a broad list of programming languages for stored procedures.

To address efficiency concerns, executor's JIT is on the way - https://postgrespro.com/roadmap/56509

fiddlerwoaroof · on Dec 23, 2017

I work for an Oracle shop and I actively try to avoid using SQL Developer: on Mac at least, it’s a resource hog and fairly slow and, additionally, it’s autocomplete facilities are fairly flaky. These days I just use the DB integration built into IntelliJ which is much nicer. (And not limited to one DB)

gaius · on Dec 23, 2017

If you colour between the lines, APEX is fantastically productive, agreed.

kchoudhu · on Dec 23, 2017

I've been part of exciting conversations with desk heads attempting to justify database licensing fees in flat/falling trading environments where traders and people are being let go.

Given those experiences, I personally stopped optimizing for today's flush tech budget. I now budget for what I know is a realistically shitty future state of affairs where we have to keep the lights on [1] while spending as little as possible.

[1] The one worse thing you can tell a trader during a 2008-level crisis than "I need more money for databases" is "you can't trade, we don't have enough capacity to handle the additional market vol".

justonepost · on Dec 23, 2017

DR, journaling, auditing & security, data integrity - at scale. These are all Oracle mainstays.

emmelaich · on Dec 23, 2017

Sybase is pervasive in investment banks. Not sure why.

Other popular big databases are Ingres and Informix.

At a big airline I worked at, we had three different major databases on our Unix systems: Informix, Ingres and Oracle.

The mainframes had a few more.

Here's a HN poll on what dbs people use. https://news.ycombinator.com/item?id=2684620

gaius · on Dec 23, 2017

Sybase is pervasive in investment banks. Not sure why.

Sybase were always very willing to customise their product for financial services customers. Someone I knew there told be about the changes they’d made to their DATE datatype to make it easy for one bank to handle bonds from the Napoleonic era for example.

gadders · on Dec 23, 2017

It also used to be quite a bit faster 20 odd years ago.

mianos · on Dec 23, 2017

Sybase was good, ran on Unix and did not cost the earth and had more transparent pricing, as opposed to Oracle. Back in the late 90s some trading systems used Informix but the big exchanges I worked on always chose Sybase over our internal use of Informix. Sybase was pretty good aside from a few gotchas. Does anyone remember the the exploding execution time issues when joining more than three tables?

kchoudhu · on Dec 23, 2017

Was this before 11? I regularly did 10+ table joins on Sybase, and the execution was stellar as long as you had the right indices set up.

mianos · on Dec 24, 2017

Version 10. It has long long been fixed. I was always a fan.

DrScump · on Dec 24, 2017

There's some big Belgian bank that runs or ran Informix; they tried to recruit me back then.

thriftwy · on Dec 23, 2017

These days you often have multiple systems. One for rock solid storage, one for fast moving development on the same dataset. Apache Ignite tries to become the database you can trust that kind of data, but it's "not there yet" to be used as the only storage.

pjmlp · on Dec 23, 2017

SQL Server is also used in such scenarios.

zhte415 · on Dec 24, 2017

BO has all been about cost over the past few years, and that's only increasing.

'Robotics' solutions are what new projects are, which are simply layers on existing solutions. Clamp over existing things, rather than re-implementing them.

gaius · on Dec 24, 2017

'Robotics' solutions are what new projects are

It's funny: a few years ago in the FO we heard a rumour that the bank was starting work on robotics, and were curious and some of my colleagues were keen to get involved. Robots! We didn't realise at the time that "robotics" is the word BO use to mean "any sort of automation, whatsoever". Even VBA. Banks are now trying to recruit graduates with the promise of working on "robotics", ho ho ho.

zhte415 · on Dec 27, 2017

Yes. The term robotics as used in banking is a bit ridiculous. EUC is deeply frowned upon by audit however; a different name is used.

The access to quality tech BO has is terrible. Lack funding for people that can DIY and lack time and budget to experiment.

Fintech is not driven by tech but by regulation where banks are forced to make an investment in change, not just talk about it.

mey · on Dec 23, 2017

Did an exhaustive research for some payment processing system where we examined rdbms, document and other DB types. We could find no niche or feature set that wasn't better satisfied by either SQL server, Postresql, or Oracle RAC (in the rdbms area). Depending what you needed to optimize for (cost, clustering, ease of use, performance, tooling).

keeler · on Dec 24, 2017

I put our DB2 installation process into an Ansible playbook. Installation time is now like 12-15 minutes, tops.

CrLf · on Dec 23, 2017

The article mentions Informix, which has also been an IBM product for a number of years now. That makes it at least five major RDBMS code bases inside IBM.

I spent a few years managing DB2 instances at a previous employer and I've seen it hold its own against mixed workloads (i.e. shitstorms of bad-behaved applications and people running ad-hoc queries) that I don't think even a well tuned PostgreSQL could handle today.

DB2 has some really annoying quirks(1) and that IBM'ish feel that's very hard to explain, but it's also very understandable (from a tuning perspective) for such a complex piece of software.

DB2 for Windows seemed mostly crap compared to Linux or AIX, though.

(1) For example, at least since I've last looked into it, DB2 had no decent MVCC(2) and it was a real pain to deal with all the lock-related problems that would arise from large write/update transactions running in parallel with small reads.

(2) At some point (9.5, IIRC) they tried to simulate MVCC by letting transactions read previous row versions from the transaction log. This seemed to come with its own set of significant drawbacks.

geophile · on Dec 23, 2017

I interviewed with Informix shortly before IBM acquired it, and after they acquired Illustra (a commercialized Postgres, company started by Stonebraker). My recollection was that Informix was a mess having decided to basically build their next major version on the untested Illustra. Different locations, lots of politics, two massive codebases. I remember thinking that Informix shot themselves in the head, and that IBM then acquired them cheap.

derriz · on Dec 23, 2017

That brings back memories. I was involved in trying to build webapps on Illustra in the mid/late 90s. We (a small dev shop) ended up with it because it was sold to our non-technical manager on the basis that it was the future for the web and it was ridiculously expensive for the kind of bottom-feeder projects we were doing - which was kind of galling given we were being paid peanuts.

But even I was initially excited - the feature list looked cool - multimedia, rich/extensible data types (called blades?), OO/table inheritance, built-in dynamic web page generation, etc. But the performance was absolutely terrible and all the advanced features turned out to be less than useful or slightly broken. I remember my disapointment when I first tried out the "search for images LIKE a reference image" feature and it matched a butterfly with a racing car. We struggled for a few months with the general flakiness and horrible performance of Illustra and eventually convinced them to allow us to use Informix instead and went back to storing images as files served by HTTP and using PHP or Perl for the dynamic HTML generation. The Informix sales dudes seemed convinced that Illustra was the future.

The performance was much better but for the web apps we were writing, we could have just stuck with mSQL or MySQl and saved a lot of time and effort.

gaius · on Dec 24, 2017

multimedia, rich/extensible data types (called blades?)

Yeah, DataBlades. I too was burned by them in the 90s.

DrScump · on Dec 24, 2017

It became a mess as Phil White ran the place into the ground.

Sadly, he never would have been brought in but for the financial disaster that was Informix's acquisition of Innovative Software; the resulting financial stress necessitated "new upper management with Wall Street credibility". Hence came Phil White and his minions, and the rest is a sad history.

barake · on Dec 23, 2017

Informix is a pretty great database. Much like you mentioned with DB2, it can take really varied workloads on the same database and keep on trucking. Just requires some basic indexing, no need to carefully tune to a particular use case.

The best part is how resource light it is. I was using Informix on a dual 200mhz HP K class box until about 8 years ago. HP literally just gave us a RP3440 to get us off it...

DrScump · on Dec 24, 2017

There's a little-known link between Informix and DB2.

Back in '86, DB2 lacked a mechanism to run ad hoc SQL from a file. They had SPUFI for terminal use but apparently nothing you could just wrap in JCL and fire off.

Informix developed a product called Batch/SPUFI to do exactly that, along with porting their Ace report writer to use on DB2.

We got it working, but the company abandoned the entire mainframe product line soon thereafter.

gaius · on Dec 23, 2017

I've only done one real project on Informix but I really enjoyed it. The Informix culture is great, all the DBAs seem to be former C programmers who happily write their own utilities in C where other DBAs would have used Perl or something.

DrScump · on Dec 24, 2017

Two of IBM/Informix's top engineering guys started at Informix itself in the 1980s and are still there now with IBM; one even left Google to return. I worked with both for years.

pjmlp · on Dec 23, 2017

How is Informix doing?

Last time I used it, it was still an independent company and in all these years I thought it was gone.

DrScump · on Dec 24, 2017

It's been an IBM property for 15 years.

pjmlp · on Dec 24, 2017

That I know.

Question was about how it was doing on the market, given that I never saw it again deployed.

gaius · on Dec 23, 2017

Because DB2, like Watson, is an overall brand name for a suite of products. This is a standard IBM practice.

BrentOzar · on Dec 23, 2017

The timing of the post is interesting given Microsoft SQL Server 2017's new support for Linux. Microsoft's doing it with one code base to span Windows, Linux, and Azure SQL DB, and it's interesting to read why IBM took a different route.

My favorite line out of it: "so I ended up taking a long weekend and writing support for a primitive approach to supporting greater-than-2GB tables." Wow.

megaman22 · on Dec 23, 2017

That line is especially great in the context of the preceding paragraph:

> At the time I was lead architect for the product and felt super-strongly that we needed to address the table size limitation of 2GB before we shipped. I was making that argument vociferously but the excellent counter argument was we were simply out of time. Any reasonable redesign would have delayed us significantly from our committed product ship dates. Estimates ranged from 9 to 12 months and many felt bigger slips were likely if we made changes of this magnitude to the storage engine.

>I still couldn’t live with the prospect of shipping a UNIX database product with this scaling limitation so I ended up taking a long weekend and writing support for a primitive approach to supporting greater-than-2GB tables.

It's amazing how often one person with a solid grasp of a problem can run circles around teams that are shackled by process and cya and politics.

whatusername · on Dec 23, 2017

I'm fairly sure that DB2 for LUW (Linux/Unix/Windows) likely counts as a single code base. And I imagine that's x86 and power (and maybe Sparc? probably big or little endian. And support for Solaris, Aix, etc).

It's the esoteric platforms like mainframe or as/400 that need custom code bases

pavlov · on Dec 23, 2017

Db2 for z/OS will live on for the life of the IBM mainframe ...

This raises a question which is interesting to me (not knowing anything about mainframes).

How long is the life of the IBM mainframe going to be? Are there specific plans already in place to replace these systems on a known schedule? Or are they going to be around indefinitely?

delinka · on Dec 23, 2017

I suggest they’ll be around indefinitely. While large customers are willing to spend large sums of money to maintain their systems and avoid the risks of new technology upgrades, IBM will continue to serve those customers.

AstroJetson · on Dec 23, 2017

I agree, mainframes will live forever. Companies are always coming up with new loads and ways to use them.

I (kiddingly) poke at the people at work that I'm starting to teach my 6 year old grand child COBOL and CICS, they will have lifetime employment.

0x445442 · on Dec 23, 2017

My inclination is to agree but I work for a bank and we're in process of replacing all systems hosted on our mainframes. Will we ever get there is another question but that's the current road map.

mseebach · on Dec 23, 2017

Forever is such a long time. There are no new projects started on mainframes and there's a definite, if still quite flat, trajectory towards moving things off the mainframe. So eventually it will die, but it will be decades.

delinka · on Dec 23, 2017

As we approach the limit (of zero) that tail will just continue to get longer and longer. And by “indefinite” I do not mean “forever.” I mean that there’s no definite date that the ‘mainframe industry‘ will end— IBM hasn’t published any roadmap to end their use, and many institutional users are happy making few changes to business operations.

Aloha · on Dec 24, 2017

New workloads do come to mainframes - but they look like an easy way to virtualize a large number of linux boxes now.

glhaynes · on Dec 23, 2017

Last I’d heard (it’s been a while) there was significant interest in running z/OS and the like on emulators, usually on Intel. I’d be interested to know how much of that goes on these days. Seems sensible (to ignorant me, at least) that mainframe hardware would disappear in the handful-of-decades timeframe but I can imagine the software living on indefinitely.

amptorn · on Dec 23, 2017

> plans already in place to replace these systems

I think a typical mainframe customer is more likely to just cease to exist due to random chance/bad business decisions/market shifting/stuff than it is to move off mainframes to another platform. We're talking about very long time spans and very stable systems here.

Spooky23 · on Dec 23, 2017

Inertia is a powerful thing. There are still companies using Unisys “mainframes”, which are nothing more tha emulators of old Univac, Sperry, and other platforms running on a Xeon, but metered at some ridiculous MIPS leasing rate.