June 08, 2026
Explained: RAS on z/OS
A memory chip fails inside a running mainframe. In a typical server, this would be a crash: a blue screen, a kernel panic, an outage. On an IBM Z system, the error is detected, the affected memory is reconstructed from redundant data held elsewhere in the memory subsystem, and the repair is logged. The running workload continues without interruption. No operator is notified. No ticket is opened. The system fixed itself. This is not exceptional be… Read More
by Phee Jay
June 08, 2026
Explained: SMF Records
A batch job ran overnight and consumed three times its usual CPU. A CICS transaction that normally completes in 12 milliseconds is suddenly taking 200. A user account was used to access a sensitive dataset at 3am. A DB2 tablespace is approaching its storage limit. None of these events sent an alert. Nobody was watching at the time. But every one of them left a record. Because on z/OS, almost everything that happens is written down. That writing-do… Read More
by Phee Jay
June 08, 2026
Explained: z/OS Connect
A mobile banking app developer needs to display a customer's account summary. The data lives in a CICS COBOL program that has run the bank's core logic for thirty years. The developer knows REST and JSON. She has never heard of CICS, has no idea what a commarea is, and has no intention of learning. She needs an endpoint she can call with a standard HTTP GET request and get back JSON. The CICS web services approach covered in the previous … Read More
by Phee Jay
June 08, 2026
Explained: MQ on z/OS
A customer submits a mortgage application on a bank's website. The web server accepts it immediately and shows a confirmation. Behind the scenes, that application needs to pass through a fraud check, a credit bureau lookup, an underwriting system, and a document management platform. Each of those systems runs on different infrastructure, at different speeds, and is maintained by different teams. The web server cannot wait for all of them. It… Read More
by Phee Jay