hpc.social - Aggregated Personal Blog

ISC’24 recap

2024-05-28T05:24:00-06:00

I had the great pleasure of attending the ISC High Performance conference this month, marking the fifth time I've attended what has become one of my top must-attend industry conferences of the year. This year was particularly meaningful to me because it is the first time that:

I attended ISC as a Microsoft employee. This is also the first time I've attended any HPC conference since I changed my focus from storage into AI infrastructure.
I attended ISC in-person since before the pandemic. It's also the first time I've visited Hamburg which turned out to be an absolute delight.

Although registrations have been lower since the pandemic, this year's final registration count was over 3,400 attendees, and there was no shortage of old and new colleagues to bump into walking between the sessions at the beautiful Congress Center Hamburg.

This year’s theme was “Reinvent HPC,” and that idea—that HPC needs to reinvent itself—was pervasive throughout the program. The whole industry had been pulling towards exascale for the better part of a decade, and now that there are two exaflop systems on Top500 and the dust is settling, it feels like everyone is struggling to figure out what’s next. Is it quantum? AI?

It was difficult for me to draw a line through all the topics worth reviewing at this year's ISC, as it was a very dense four days packed with a variety of topics, discussions, vendors, and events. I only experienced a fraction of everything there was to be seen since so many interesting sessions overlapped, but I thought it might be worthwhile to share my perspective of the conference and encourage others to do the same.

Reinventing HPC (and blast those hyperscalers!)

The need to reinvent HPC was the prevailing theme of the conference from the very first session; with the listing of Aurora as the second system on Top500 to break the 1 exaflops barrier, the community is in search of a new milestone to drive research (and funding!). At the same time, commercial AI has rapidly risen up largely in an independent, parallel effort with a speed and scale that begs the question: how important was the decade-long drive to break the exaflops barrier if the AI industry could catch up so quickly without the help of the institutions that have historically posted the top HPL scores? If the commercial AI industry overtakes scientific computing as the world leader in deploying at scale, how can “HPC” be reinvented so it can continue to claim leadership in another dimension?

Kathy Yelick's opening keynote

ISC’s opening keynote was given by Kathy Yelick, where she provided commentary on two recent government-commissioned reports on the future of HPC:

Charting a Path in a Shifting Technical and Geopolitical Landscape: Post-Exascale Computing for the National Nuclear Security Administration, commissioned by the National Academies
Can the United States Maintain Its Leadership in High-Performance Computing?, commissioned by the US Department of Energy’s Advanced Scientific Computing Research program

Living up to her reputation, Dr. Yelick’s talk was fast and insightful, describing the insatiable demand for computing driven by scientific research, the struggle to expose continuing amounts of parallelism to make use of newer processors, and some promising directions to address that disconnect. However, her talk started in a direction that I didn’t like when she went into describing the disruptors that necessitate reinventing HPC:

The above slide implied that AI, quantum, or cloud may pose an existential threat to the HPC community gathered at ISC this year; this immediately raised my hackles, as it cast the relationship between “HPC” and “AI”/“cloud” as having some sort of adversarial tension. As the talk went on, I realized that “HPC” didn’t really mean “high-performance computing” to her. Rather, it was used to refer to something much more narrowly scoped—high-performance computing to solve scientific problems. Slide after slide, the presentation kept doubling down on this idea that “HPC” as the audience knows it is being threatened. For example, Yelick talked through this slide:

The picture she painted is that “HPC” (denoted by companies with blue bars) no longer has influence over technology providers because the “hyperscalers” (green bars) have such an outsized amount of investment. She then used this to call on the audience to think about ways “we” could influence “them” to produce technologies that are useful for both scientific computing and low-precision AI workloads.

Her talk culminated in this slide:

Which was accompanied by this conclusion:

"So what’s a post-exascale strategic for the scientific community? It's the beat 'em or join 'em strategy. The beat 'em strategy says we’re going to design our own processors. [...] The join 'em strategy says let's leverage the AI hardware that's out there. [...] The sort of sneaky way of doing this is getting embedded in the AI community and trying to convince them that in order to make AI better for commercial AI applications, you really want to have certain features. Like don't throw away your 64-bit arithmetic and things like that."

I found myself getting increasingly unsettled through the keynote, because this "us versus them" mentality put me, a long-standing member of this HPC community, in the camp of "them." It was as if I was suddenly an outsider in a conference that I've been attending for years just because I no longer work for an organization that has been doing HPC since the early days of computing. Even though the clusters I support use the same NVIDIA and AMD GPUs, the same InfiniBand fabrics, and the same Lustre file systems that "HPC" uses, I am no longer in "HPC" because I am "hyperscale" or "cloud" or "AI."

The underlying message is one I get; GPUs are trending in a direction that favors massive gains in lower-precision computation over FP64 performance. And the cost of HBM is driving the overall value (in FP64 FLOPS per dollar) of accelerators backwards for the first time in the history of scientific computing. But the thesis that the scientific computing community needs to be sneaky to influence the hyperscale or AI players seemed way off the mark to me. What seemed absent was the recognition that many of the "hyperscalers" are her former coworkers and remain her colleagues, and "they" sit in the same audiences at the same conferences and share the same stages as the "HPC" community. All that is true because "HPC" is not somehow different than "cloud" or "AI" or "hyperscale." If there really is a desire to influence the hyperscale and AI industry, the first step should be to internalize that there is no "us" and "them."

Closing keynotes on the future

Just as the conference was opened with a talk about this "us versus them" mentality, it was closed with a talk about "us versus them" in a keynote session titled, "Reinventing HPC with Specialized Architectures and New Applications Workflows" which had two speakers followed by Q&A.

Chiplets for modular HPC

John Shalf gave one half of the closing keynote, where he gave his usual rally for investments in chiplets and specialized processors for HPC:

He gives a variant of this talk at every ISC, but this year he lasered in on this notion that the "HPC" community needs to do what the "hyperscalers" do and use chiplets to develop custom ASICs. It was an energetic and impassioned talk, but this notion that hyperscalers are already executing on his idea for the future sounded a little funny to me seeing as how I now work for one of these hyperscalers and his message didn't resonate.

If you really follow the money, as Shalf suggested, a huge amount of it is flowing into GPUs, not specialized processors. It wasn't clear to me what specialization he was thinking of when he referred to custom silicon being developed by the likes of Meta, Google, AWS, and Microsoft; it's true that these companies are developing their own silicon, but those efforts are largely addressing cost, risk, and supply, not improving performance beyond more general-purpose silicon like GPUs. And it turns out that a significant fraction of the (non-US) HPC community is already developing custom silicon for the same reasons as the hyperscalers; Japan, China, and Europe are all developing their own indigenous processors or accelerators for scientific computing at leadership scales. In that sense, Shalf was preaching to the choir given that, on the international stage, his government is the odd one out of the custom silicon game.

He also suggested a dichotomy where the HPC community would either have to just (1) make every scientific problem an AI problem or (2) join this journey towards making domain-specific accelerators, ignoring the significant, unexplored runway offered by using mixed precision arithmetic in scientific applications. He called for partnering with hyperscalers, but his examples of implementing a RISC-V-based stencil accelerator and a SambaNova-based DFT processor didn't draw a clear line to the core missions of the large hyperscalers he extolled. He briefly said that partnering would benefit hyperscalers by addressing some capital cost challenges, but seeing as how the annual capital expenditures of the hyperscalers outstrips those of the US national HPC effort by orders of magnitude, I couldn't understand what the hyperscalers would stand to gain by partnering in this way.

Integrating HPC, AI, and workflows

Rosa Badia gave the second half of the closing keynote where she proposed ideas around complex scientific workflows and the novel requirements to support them. This talk felt a lot more familiar, as the focus was squarely on solving scientific computing challenges by connecting traditional HPC resources together in nontraditional ways using software whose focus goes beyond cranking out floating point arithmetic.

As she spoke, I couldn't help but see parallels between the challenges she presented and the sort of technologies we live and breathe every day in cloud services. For example, she showed this slide:

Dr. Badia obviously wanted to make a cloud-tie in by calling this "HPC Workflows as a Service," but what I'm not sure she realized is that this model almost exactly describes platform-as-a-service frameworks that already exist in commercial clouds. For example,

What she calls a "Data Catalog" is a public or private object storage account (a blob container, an S3 bucket) or a PaaS abstraction built atop them
What she calls a "Software Catalog" is a container registry (Azure Container Registry, Amazon Elastic Container Registry) or an abstraction built atop them
A "Workflow Description" is something like an AzureML pipeline or SageMaker pipeline
A "Workflow Registry" is just a Github repository containing pipelines
The "Portal" is the web UI provided by AzureML or SageMaker

I don't think there's anything truly new here; the challenges she described lie in wedging these workflows into HPC infrastructure which lacks the platform features like robust identity and access management (i.e., something better than LDAP that supports more modern authentication and authorization flows and finer-grained access controls) and data management (i.e., something better than a parallel file system that depends on POSIX users, groups, and permissions and implicit trust of clients).

She went on to describe a workflow data management system that reinvented a bunch of infrastructure that is already baked into commercial cloud object stores like Azure Blob and AWS S3:

As she was describing the requirements for such a workflow data management layer, it struck me that what the scientific data community calls "FAIR principles" are the same basic requirements for operating in commercial environments where data may be subject to strict privacy and compliance regulations. The notion of findable data may be aspirational for scientific datasets, but when a company is having to find datasets because it's being sued or subpoenaed, findability is a bare-minimum requirement for any data management system. Similarly, tracking the provenance of data may be a nice-to-have for scientific data, but it is a hard requirement when establishing a secure software supply chain. Cloud storage systems solved many of these challenges a long time ago, and I can't help but wonder if this idea that workflows in HPC pose a new set of challenges is another manifestation of "us" not realizing "they" might have done something useful and applicable for science.

Badia's final slide had a particularly poignant statement which read, "Systems can only be justified if we have applications that need them." I think she was trying to call for more investment in application development to exploit new systems, but I think the inverse is also true. If modern scientific applications truly require more complex orchestration of compute and data, maybe the scientific computing community should stop building computing platforms that make it really difficult to integrate different systems.

Again, "HPC" is not the opposite of "cloud;" it's not an either/or decision. There are technologies and tools that were designed from the beginning to simplify the secure connection of services and resources; they just weren't invented by the HPC community.

Top500 and Aurora

One of the cornerstones of ISC is the semiannual release of the Top500 list, and unlike at SC, the Top500 announcements and awards do not overlap with any other sessions, so it tends to have a higher profile and draw all attendees. This go-around, there were no dramatic changes in the Top 10; the new Alps system at CSCS was the only new entry, and the order of the top five systems remained the same. Notably, though, Aurora posted a significantly higher score than at SC'23 and broke through the exaflops barrier using 87% of the system, cementing its place as the second exascale system listed. But let's start at the top.

#1 - Frontier

Frontier at Oak Ridge remained #1, but it squeezed twelve more petaflops out of the same node count and is now just over 1.2 EF. Nothing groundbreaking, but it's clear evidence that ORNL is continuing to tune the performance of Frontier at full system scale.

#2 - Aurora

Aurora, on the other hand, finally eked over the exaflops line with 1.012 EF using 87% of the system's total 63,744 GPUs. Rick Stevens gave a short talk about the achievement which is summed up on this slide:

I was a little surprised by how honest Stevens was in this talk; the typical game that is played is that you stand up on stage, talk about how great of a partnership you had with your partners to realize this achievement, extol the virtues of the technologies on which your system was built, and talk about how this HPL score is just the start of a lot of great science.

Stevens didn't do that though.

He started out by telling the conference that Intel had bad product names, then explained that their low Graph500 and HPCG scores were the result of their exclusive focus on breaking the exaflops barrier with HPL, implying they didn't have time or ability to run Graph500 or HPCG at the same 87%-89% scale as their HPL and HPL-MxP runs. Based on this, it sounds like Aurora is still a ways away from being stable at scale, and we're unlikely to see any Gordon Bell-nominated papers at SC'24 this November.

After this session, folks seemed to relish in dunking on Aurora; its window to be #1 is likely to have closed and it has some power efficiency issues. But I don't think anyone involved with the Aurora project needs to be told that; if what Stevens implied is true, the folks at ALCF, Intel, and HPE have been struggling for a long time now, and topping out over 10¹⁸ was a hard-sought, major milestone to be celebrated. The Aurora project has been thrown more curveballs than I would have ever guessed a single HPC project could have, so all parties deserve credit for sticking it through all this way rather than just walking away. With any luck, Aurora will stabilize in the next six months, and we'll see full-scale runs of Top500, Graph500, HPCG, and science apps by November.

#3 - Eagle

The third highest system on the list was Eagle, whose HPL score was not updated since the system was first listed at SC'23 last year. Through a few twists of fate, I wound up being the person who accepted the award on-stage, and I now have a Top500 award for the #3 system sitting in my home office. Here's a photo of me goofing around with it:

It's not entirely inappropriate that I was the one to accept it since my teammates are the ones carrying pagers for the on-call rotation of that system, and we were also the hands-on-keyboard when that HPL run was conducted. Still, it was a bit surreal to walk on-stage to pick up such a noteworthy award immediately following two actually important people (both of whom have "director" in their titles) accepting the same award. By comparison, most of my career highlights to date have been just trolling HPC people on Twitter (as the esteemed Horst Simon actually said out loud as I was leaving the stage!)

It was weird.

That said, I take this to mean that it is now my duty to be the friendly face from Microsoft who can speak intelligently about the #3 system on Top500. To that end, I'll answer some questions that I was asked at ISC about the system and Azure HPC clusters in general below. None of this is new or secret information!

Why didn't you run HPL again and post a higher score to beat Aurora? Because the day after that HPL run completed, that system was put into production. Once systems are in production, people are paying to use them, and taking a time-out to re-run HPL costs a ton of money in either real dollars (if a customer runs it) or lost revenue (if the HPL run is blocking customer workloads). This is quite different from public-sector HPC systems which never have to pay for themselves.
Can I get access to Eagle for a Gordon Bell run or to test software? That's not really how it works. Whereas a traditional supercomputer might allow users to ssh in and submit jobs to a Slurm queue, cloud-based supercomputers allow users to deploy virtual machines through a REST API. Those virtual machines can allow ssh, run Slurm, and support MPI jobs like HPL, but that OS environment is managed by Azure users, not Azure itself. You can get a taste for what's required to run a basic MPI job by reading some instructions I wrote on provisioning an MPI cluster on Azure.
Is it just a bunch of GPU nodes scattered around a bunch of data centers? No, all the nodes on any given Azure HPC cluster (like Eagle) share an InfiniBand fabric. There are countless InfiniBand clusters in Azure, but each one is a real supercomputer by any definition of a supercomputer, and they are designed to run tightly coupled job across all their GPUs.
What parallel file system does it use? Don't think about it that way. You can provision a Lustre file system and mount that to any or all cluster nodes if you want to, or you can access data directly from object storage.
Are there any photos of it? You can see a photo of one of the Microsoft-designed nodes that comprise the system on my SC'23 recap blog post. Beyond that, there's not much to look at because Azure HPC clusters are not meant to be photogenic like, say, Cray supercomputers. There's no rack graphics (or even rack doors!). It's just tons and tons of air-cooled racks with InfiniBand optics coming out of each one. Maybe the only unique thing is that the racks are painted white instead of the typical black. Not sure why.

Getting back to that false separation between "HPC" and "cloud," Eagle is strong evidence that they aren't different. What the "hyperscalers" do is not that different from what traditional HPC centers do. Perhaps the biggest difference is that cloud supercomputers get all the benefits of cloud infrastructure like software-defined infrastructure like virtual machines and virtual networking, integration with identity and access management that transcends simple Linux UIDs/GIDs, and the flexibility to integrate with whatever storage systems or ancillary services you want from any compute node.

Other notable tidbits

It is tradition for Erich Strohmaier to talk through some highlights and trends of the latest Top500 list every time a new one is announced, and in the past, I've been critical of how he's presented conclusions from the list with this implicit assumption that computers that never post to Top500 simply don't exist. This year felt different, because Dr. Strohmaier made the explicit statement that China has completely stopped submitting to Top500. Their exascale systems aren't listed, but neither are any new systems in the past three years at the bottom. They simply don't play the game anymore, making it undeniable that Top500 is no longer an authoritative list.

Just as the whole conference's theme was reinventing HPC, I felt a sense that even the most stalwart proponents of Top500 are now recognizing the need to reinvent the Top500 list. Kathy Yelick said as much during her keynote ("Shall we replace Top500? What are the metrics in post-exascale computing that are important?"), and Erich implored the audience to help expand the HPL-MxP (formerly HPL-AI; an HPL-like benchmark that can use the mixed-precision capabilities of tensor cores) list. Nobody seems to know how to quantify what makes a leadership supercomputer nowadays, but accepting that HPL scores (or appearing on the Top500 list!) won't cut it is a good first step.

That all said, Top500 is still a valuable way to track technology trends in the industry. For example, this edition of the list where NVIDIA's new Grace-Hopper node started appearing in force. The only new entrant in the Top 10 was the 270 PF GH200 component of CSCS's Alps system, and HPEhad these EX254n GH200 blades on display on the show floor.

To HPE/Cray's credit, they seem to have gotten the system up and running with Slingshot without the delays that plagued early Cray EX systems like Frontier and Aurora. Hopefully this is a sign that the Cray EX platform and Slingshot-11 have graduated from being risky and not-quite-production-ready.

The other notable entrants on this year's Top500 are a trio of early MI300A APU-based Cray systems being built around the El Capitan program at Lawrence Livermore National Laboratory. This is a positive sign that MI300A is up and running at modest scale, and HPE also had one of these EX255a blades on display at their booth:

The strong showing of MI300A suggests that we may see El Capitan take the top spot in the next edition of the Top500 list coming in November.

Everyone is an AI expert!

Since I now work on a team responsible for AI infrastructure, I tried attending as many of the AI-focused talks and panels as I could this year. Unsurprisingly, these sessions largely carried the same undertones of "reinventing HPC," and speakers opined on how AI would affect scientific computing and offered examples of what their institutions were doing to extend their leadership in the HPC space into the AI space. There was a fair amount of grasping going on (as there always is when AI is discussed at non-AI conferences), but this year I was struck by how confused so many speakers and attendees were about concepts related to applying AI.

To be clear: I am no expert in AI. However, my day job requires that I be steeped in some of the largest AI training workloads on the largest AI supercomputers on the planet, and I have to have a cursory understanding of the latest model architectures and techniques to anticipate how future system designs will have to evolve. It's from this perspective that I made the following observation: there are a lot of HPC people speaking very confidently about AI based on an outdated understanding of the state of the art. The AI industry generally moves much faster than the government-funded research community, and I couldn't help but wonder if some community leaders assumed that the AI industry today is the same as it was the last time they wrote their AI grant proposal.

Of course, there were also some really insightful perspectives on AI for science shared as well. Let's talk through some examples of both.

The Exascale AI Synergies LLM Workflows BOF

This realization that the ISC community is not keeping up with the AI community first slapped me in the face when I ducked into a BOF session titled, "Tales of Exascales – AI and HPC Supercomputing Platforms Synergies for Large Language Models (LLMs) and Scientific Workflows." I sometimes wonder if the organizers who propose titles like that are intentionally creating word salad, but in this case, it was apt session name; the discourse around HPC and AI was all over the board throughout the hour.

The session started on a strong, positive note by Simon McIntosh-Smith describing Bristol's new Isambard-AI system, a GH200-based Cray supercomputer funded under the broad charge of "AI research." While I'm usually skeptical of such nebulously defined "AI research" machines, Dr. McIntosh-Smith's description of the project quickly checked a bunch of boxes on how a real AI research platform should be developed. In particular,

Isambard-AI was developed and deployed at the pace of AI rather than HPC for scientific computing. Whereas government-funded, large-scale HPC systems typically take years to procure, Simon said that the first discussions started in August 2023, and in the nine months that followed, they had built the site, the team, and the system itself to the degree that a piece of the final system is already on Top500. By comparison, LLNL's El Capitan supercomputer also debuted on Top500 this month, but its contract was signed five years ago, and its procurement began at least two years before that. The AI industry would not exist if the systems it trains on took seven years to procure.

Isambard-AI deliberately avoided exotic AI accelerators to remain future-proof. Simon rightly pointed out that the AI industry moves too quickly to anticipate whether a bespoke AI accelerator would even be relevant to whatever the hottest model architecture will be in a year. GPUs were chosen because they are the most flexible way to accelerate the widest range of AI workloads, regardless of if they are dense models, sparse models, inferencing, training, and whatever level of quantization makes sense. The reality is that cutting-edge research is done on GPUs, so aligning an AI supercomputer on the same technology will ensure that the algorithms developed by industry are immediately usable for scientific research.

A reasonable definition of "AI for science" was defined from the outset. Rather than blurting out "we need to research AI!" and asking for a sack of money to buy GPUs, Simon outlined a vision of training AI models using data generated by physical simulation on a more conventional HPC system. Training models on models to create surrogate models is not particularly new, but it does establish a few reasonable architectural decisions such as having a robust data management and sharing platform, close coupling to the HPC system performing simulation, and aligning software stacks and programming environments as closely as possible.

Simon's contribution to the discussion stood out to me as the most impressive, and the discourse seemed to fall into a trap of familiarity following. Rather than focusing on the new and exciting prospects of AI, some panelists and audience members wanted to focus on the aspects of AI they understood. For example, an uncomfortable time was spent on a back-and-forth on how HPC centers can support Kubernetes and random I/O (which is what defines AI vs. HPC?) instead of Slurm and Lustre. If your biggest challenge in delivering infrastructure to support AI workloads is figuring out how to deploy both Kubernetes and Slurm, you haven’t even reached the starting line. This is a trivial issue in cloud environments, where entire AI clusters can be built up and torn down in minutes. Again, this is evidence that the scientific computing community isn’t ready to keep pace with the AI industry.

I jotted down a few of the questions and comments that I heard during this BOF that seem to reflect the level of familiarity the average ISC attendee has with AI:

"Would be nice if there were more models for science." I wasn't sure sure what this means. All the leading LLMs are pretty good at "science," and domain-specific models aren't readily transferable between different science domains or problems.
Scientific problems "have to validate outputs for correctness, unlike LLMs." I think the speaker was making a sidelong reference to hallucinations, but like with any model (large language or physics-based), validating outputs for correctness is certainly necessary and readily possible.
"The demands of inference of LLMs are completely different from those for training. How do you buy inference infrastructure?" I wonder where this notion came from. If your infrastructure can train a model, it can definitely inference that model. Cost-optimizing infrastructure for inferencing is a separate matter (you can cut corners for inferencing that you wouldn't want to cut for training), as is building the service infrastructure around inferencing to deliver inferencing as a service. But I don't think that's what this question was about.
"Working safely with sensitive data / isolating workloads on big shared clusters." This is a problem that arises only when you try to wedge AI workloads into infrastructure designed for traditional physics-based simulation. If you have sensitive data, don't use big shared clusters. Provision separate clusters for each security domain on a shared, zero-trust infrastructure.
"How different are the files and filesystem access while training for LLMs, image generation models, reinforcement learning?" This question reflects a general misunderstanding of data and storage in HPC overall; how data is organized into files and how that data is accessed by a workload is an arbitrary decision made by the application developer. You can organize piles of text into one giant file or a million little files.

There were a few questions that came up that touched on deeper issues on which the HPC community should reflect:

"What are the first steps for scientific groups wanting to get ready for using AI in the future?" This is probably the purest question raised in the entire session, and I think this is something the scientific computing community as a whole needs to figure out. What does "using AI" really mean for scientific groups? Is it training models? Fine-tuning models? Inferencing using pre-trained models on HPC infrastructure? Is it integrating simulation applications with separately managed inferencing services? Who manages those inferencing services? Does inferencing even require HPC resources, or can suitable models run on a few CPU cores? I think the first step to answering this question is ensuring that the scientific computing community reaches a common baseline level of understanding of "using AI" means. And a lot of that probably means ignoring what some self-professed AI experts in the HPC community claim is the future.
"Care to predict what that ChatGPT moment will be for AI for Science? Had it already happened?" This question was addressed directly by panelist Séverine Habert who rightly pointed out that the ChatGPT moment occurred when a complex and esoteric topic was suddenly put in the hands of hundreds of millions of laypeople across the world. It was the moment that the common person walking on the street could suddenly interact with the most cutting-edge technology that had been previously understandable only to the headiest of researchers in industry and academia. That will likely never happen in AI for science because science, by definition, requires a higher baseline of education and understanding than the average layperson has.
"How to effectively train the existing workforce when we are already struggling to retain talent in research/academia?" This question strikes at the same theme that Kathy Yelick's opening keynote confronted: what is the role of the scientific computing community now that it turns out that you don't need decades of institutional experience to deploy and use HPC resources at leadership scale? As offensive as it may sound, perhaps the public-sector HPC community should accept that their role is not training future researchers and academics, but training future practitioners of AI in industry. This is how the wider tech industry generally works; neither startups nor tech giants make hires assuming those people will still be around in ten years. Why does the public-sector HPC industry think otherwise?

Finally, I was also struck but how fiercely the discourse clung to the idea that large language models are the answer to all AI problems in science. I get that this panel was focused on exascale, and LLM training is one of the rare cases where AI requires exascale computing capabilities. But there was no acknowledgment that trillion-parameter models are not actually a good idea for most scientific applications.

AI Systems for Science and Zettascale

This singular focus on creating massive LLMs for science was front-and-center in a talk given by Rick Stevens titled "The Decade Ahead: Building Frontier AI Systems for Science and the Path to Zettascale." The overall thesis that I heard was something like...

Science needs its own trillion-parameter foundation models
Training trillion-parameter foundation models requires a lot of GPUs
We need $25 billion from the U.S. government

However, Stevens never answered a very basic question: what does a foundation model for science do that any other foundation model cannot do?

He showed slides like this which really don't sound like foundation models for science as much as a generic AI assistants:

Is the scientific computing HPC community really the most qualified bunch to reinvent what existing foundation models like GPT-4 or Claude 3 have already done? Even if you argue that these proprietary models aren't as good at "science" as they could be, who would have a better chance of addressing this with a billion dollars of federal funding: the companies who developed GPT or Claude, or a collection of government scientists starting from scratch?

I think the answer to this question was in other parts of Stevens' talk. For example, he started with this slide:

While robust requirements are good when there's no urgency, this slide is also a tacit admission that the government takes years to general a perspective on AI. Do you think the creators of Llama-3 or Mistral Large gathered wide community input from over 1,300 researchers before deciding to build a supercomputer and train a model? Even if science needs its own foundation models, this slide is strong evidence that, by the time the scientific HPC community agrees on a path forward, that path will be years out of date relative to what the commercial AI industry is doing.

A great example of this already happening is the basic premise that creating a foundation model with a trillion parameters is the best way to apply AI to solve science problems. This certainly was the leading thought two years ago, when transformer scaling laws were published that suggested that the best way to get better-performing LLMs was to simply add more parameters to your transformer and train on more data. But there's a reason all the leading models have stopped advertising how many parameters they use.

Dealing with massive transformers is really expensive. They're not only really expensive to train, but they're really expensive to use for inferencing too. This has led to a bunch of innovation to develop model architectures and approaches to training that result in dramatically higher quality outputs from a fixed parameter count. Dense transformer architectures with a trillion parameters have become the blunt instrument in developing foundation models since 2022, so it took me by surprise to hear Stevens put so much stock into this notion that the need for a trillion-parameter model is essential for science.

To repeat myself, I am no expert in AI. I've never been called in front of Congress to talk about AI or been invited to give talks on the topic at ISC. There might be something basic that I am missing here. But when I look at the science drivers for AI:

I know that you do not need to train your own trillion-parameter model to do most of this stuff. Even the use cases that do require generative AI, like code generation and math theory, don't actually require trillions of parameters. Small language models, such as that described in Textbooks Are All You Need (published in 2023, after the reports Stevens cited in his talk), can produce amazing results with very small models when you train them using high-quality data instead of garbage from Reddit. And when you create or fine-tune a small language model for a specific science domain, not only do you save yourself from having to buy a billion-dollar supercomputer for training, but you get a model that is much more accessible to scientists around the world because they won't need a million dollars' worth of GPUs to inference with it.

So, if there's one question that was never answered across any of the AI-themed sessions at ISC this year, it is this: Why does science need to train its own large language models? My intuition is that either fine-tuning existing large language models or training small language models for domain-specific applications, would be a better investment in actually advancing science. However, if we cynically assume the real goal of LLMs-for-science is to justify buying massive GPU systems, suddenly a lot of the talks given at ISC on this topic make a lot more sense.

Real applications of generative AI for science

As frustrated as I got sitting through sessions on AI where it sometimes felt like the blind leading the blind, there was one really good session on actual applications of generative AI for science.

Mohamed Wahib of RIKEN gave an insightful presentation on the unique challenges of using generative AI in science. His summary slide touched on a lot of the key challenges:

And his actual talk focused largely on the model and data aspects of generative AI. What struck me is that the challenges he described reflected the experience of someone who has actually tried to do what many other AI experts at the conference were claiming would be the future. For example,

He recognized the importance of training scientific models with high-quality datasets, not just garbage scraped off of social media. This means not only scraping or generating high quality data for training, but curating and attributing that data and applying reinforcement learning with human feedback as the model is being trained. This is uniquely challenging when creating models for scientific applications, as managing the quality of scientific data requires deep domain expertise. This contrasts with a generic chat bot whose inputs and outputs can often be assessed by anyone with a basic education.
He also talked about the tendency of scientific data to be highly multimodal and multidimensional. Whereas multimodal chatbots may combine text and vision, scientific data often contains observations of the same phenomenon from many different sensors (for example, pressure, temperature, density, strain fields, ...), and the output of a generative model for science may require multiple modalities as well. These capabilities are not well developed in LLMs designed for human language.
Dr. Wahib also pointed out that scientific datasets tend to be huge compared to text and images, and this may require developing ways for models to have context windows can fit multi-petabyte datasets' tokens to identify long-range correlations. Relatedly, he also pointed out that tokenization of scientific data is a new set of challenges unique to this community, since industry has been focused on tokenizing low-dimensional data such as text, audio, and images.

The good news is that industry's quest towards both commercializing generative AI and achieving AGI will touch on some of these challenges soon. For example, training domain-specific models using high-quality datasets is an essential component of the small language models I described in the previous section, and these small language models are what will enable privacy-preserving and cost-effective generative AI on laptops and phones. Effectively infinite context windows are also a major hurdle on the path to AGI, as industry is hard at work developing AI agents that can remember every conversation you've ever had with them. Finding more scalable approaches to attention that do not sacrifice accuracy are a part of this.

François Lanusse, currently at the Flatiron Institute, also gave a nice presentation that clearly explained how generative AI can be used to solve inverse problems—that is, figuring out the causes or conditions that resulted in a collection of measurements. A precise example he used applied generative AI to figure out what an image distorted by gravitational lensing might look like in the absence of those distortions. As I understood it, he trained a diffusion model to understand the relationship between images that are affected by gravitational lensing and the masses that cause lensing through simulation. He then used that model instead of an oversimplified Gaussian model as part of a larger method to solve the inverse problem of un-distorting the image.

The details of exactly what he did were a little over my head, but the insight piece for me is that combining generative AI and science in practice is not as straightforward as asking ChatGPT what the undistorted version of a telescope image is. Rather, almost all of the standard, science-informed approach to solving the inverse problem remained the same; the role of generative AI was simply to replace an oversimplified part of the iterative process (the Annealed Hamiltonian Monte Carlo method) to help it converge on better answers. It really is a combination of simulation and AI, rather than an outright substitution or surrogate model.

Dr. Lanusse also showed this slide which demonstrated how this approach can be generalized to other scientific domains:

The general approach of pretraining, fine-tuning ("adapt"), and combining foundation models with other physics-based models seems reasonable, although I admit I have a difficult time wrapping my head around exactly how broadly scoped he envisions any given pretrained foundation model to be. I can see such a model trained on extensive sky survey data being useful for a number of astrophysical and cosmological tasks, but it's less clear to me how such a model might be useful in unrelated domains like, say, genomics.

You might also ask why I think this vision of foundation models for science is reasonable while Rick Stevens' vision didn't ring true; the difference is in scale! The foundation models cited on Lanusse's slide are vision transformers which have many orders of magnitude fewer parameters than the trillion-parameter models that others talk about. Whereas a trillion-parameter model might need to be distributed over dozens of H100 GPUs just to produce one inference result, the largest of the vision transformers can probably be squeezed on to a single high-end desktop GPU. Again, you don't need billion-dollar supercomputers to train these models for science.

Frank Noé from Microsoft Research then talked about how generative AI can be applied to solve problems in simulating biological systems. Like the talk before his, Dr. Noé followed this pattern where a larger, physics-based framework had one statistical technique replaced by a method based on generative AI, and then a physics-based model is used to quantify the likelihood that the result is reasonable. He contrasted this with convention approaches (to, say, protein folding) where you just simulate for really long times in the hopes that your simulation randomly wanders into a situation where you capture a rare event.

His talk wasn't about generative AI as much as the previous speakers, but he offered a litany of ways in which AI models can be useful to molecular modeling:

Markov state models provide a statistical framework that lets you replace one long simulation (that hopefully captures every possible scenario) with a bunch of short, chopped-up simulations that hopefully capture every possible in parallel. He cited an example that took 20,000 GPU-days on V100 GPUs that would've otherwise taken a million GPU-years if done in one long simulation.
Coarse-grained models use machine learning to develop surrogate models to simulate the physics of relatively uninteresting parts of molecular systems. The example he used was simulating the water molecules surrounding a biomolecule; water can be very difficult to accurately model, and the example he cited led to a surrogate model that was 100x faster than directly simulating water molecules.
Boltzmann generators can generate 3D molecular structures based on a known probability distribution defined by the energy states of the system. This is another fast way to find rare but stable molecular configurations without having to throw darts at a dartboard.

What struck me is that, in all these cases, the AI model is never generating results that are blindly trusted. Instead, they generate molecular configurations which are then fed into physics-based models which can quantify how likely they are to be valid.

Both Lanusse's and Noé's examples of combining AI and simulation painted a picture to me where generative AI can be really useful in solving problems where a researcher would otherwise have to make educated guesses about what physical phenomenon is really happening based on incomplete information. So long as there is a way to apply a physics-based model to check the accuracy of each guess, generative AI can be trained to predict the relationships between incomplete information and what's really going on and get to probable answers much faster than relying on physics alone.

More broadly, I couldn't help but think about the Sora video showing pirate ships battling in a cup of coffee as I left this session. Like that video, these talks demonstrated that it's possible to train generative AI models to reproduce physical phenomena (like the fluid dynamics of coffee) without explicitly embedding any laws of physics (like the Navier-Stokes equations) into the model itself and still get really compelling results. The part of this that was lacking from the Sora video—but was present in these talks—was closing the loop between generated results and the laws of physics by feeding those generated results back into the laws of physics to figure out if they are probable.

High Performance Software Foundation

ISC'24 wasn't all about AI though! I wound up attending the launch of the High Performance Software Foundation (HPSF), a new Linux Foundation effort spearheaded by Todd Gamblin and Christian Trott (from Livermore and Sandia, respectively) aimed to promote the sustainability of the software packages relied upon within the high-performance computing community.

I haven't paid close attention to HPC software in a long time since most of my work was in platform architecture and storage systems, so a lot of the background context remains a little murky to me. That said, it seems like HPSF was formed to be like the Cloud Native Computing Foundation for the HPC community in that:

it will serve as a neutral home for software projects that aren't tied to any single university or government institution
it provides mechanisms to ensure that critical HPC software can continue to be maintained if its original author gets hit by a bus
it will help with the marketing, promotion, and marketing of HPC software

Its governance seems pretty reasonable, with different levels of membership being accompanied by different levels of rights and obligations:

There is a Governing Board is comprised of paying members (and predominantly those who pay the most), while the Technical Advisory Council carries out the more technical tasks of forming working groups and onboarding projects.

There are three levels of membership, and the highest (premier) has a $175,000 per year buy-in and comes with a seat on the Governing Board. Right now, the founding seats are held by AWS, HPE, LLNL, and Sandia.

Below that is a general membership tier whose cost is on a sliding scale based on the organization size, and AMD, Intel, NVIDIA, Kitware, ORNL, LANL, and Argonne have all committed at this level. The associate tier is below that, and it is free to nonprofits but comes with no voting rights.

It seemed like the exact functions that HPSF will have beyond this governing structure are not fully baked yet, though there were six "prospective" working groups that provide a general scope of what the HPSF will be doing:

My read of the description of these working groups is that

CI/testing will supply resources (GPUs) on which HPSF projects' code can be automatically tested.
Software stacks will maintain E4S.
User engagement sounds like it will figure out what users of HPSF projects' software are looking for. It sounds like this will provide some product management-like support for projects.
Facility engagement is probably like user engagement, but for the sites deploying code on behalf of their users. Again, this sounds like product management functions.
Security sounded like stewarding SBOM-like stuff for member projects' software.
Benchmarking would make a framework for benchmarking HPC applications.

That all said, it still wasn't clear what exactly HPSF would do; what would all those membership dues go towards supporting? Based on some Q&A during this BOF and follow-up afterwards, I pieced together the following:

HPSF will not be funding developers, much in the same way that OpenSFS doesn't fund Lustre development. That said, Todd Gamblin later said that not funding software development was a financial constraint more than a policy one, with the implication that if more members join, there may be opportunity for HPSF to fund projects.
HPSF likely will be hosting events and conferences (perhaps like the CNCF hosts KubeCon), providing scholarships, developing and providing training related to member projects, and "increasing collaboration" (whatever that may mean!).

HPSF also has some influence and ownership over its member projects:

HPSF will co-own its projects' GitHub repos to ensure continuity in case the other repo owner abandons it.
HPSF will own the domain for the project for the same reasons as above.
Member projects still manage their own software development, roadmaps, releases, and the like. The HPSF won't dictate the technical direction of projects.
HPSF will own the trademark and logos of its member projects so it can prevent corporations from profiting off of repackaging products without respecting trademark.

This establishes an interesting new direction for the sorts of software projects that are likely to become member projects. Historically, such projects developed by the member organizations (i.e., DOE labs) have been wholly controlled by the labs that funded the work, and those software projects lived and died at the whims of the government funding. The HPSF offers a new vehicle for software projects to live on beyond the end of the grants that created them, but at the same time, it requires that the DOE surrender control of the work that it sponsored.

I left the session still wondering a few pretty major things, likely borne out of my own ignorance of how similar organizations (like CNCF or the Apache Foundation) work:

How does a software project actually become a member project? The HPSF folks said that the Technical Advisory Committee onboards new projects, but what is the bar if I have an open-source project used by the community that I no longer want to maintain myself? I assume it's not a pay-to-play arrangement since that defeats the purpose of sustaining software after its seed funding runs out.
What do stakeholders actually get out of joining HPSF? I see obvious value for organizations (like the DOE labs) who develop open-source software but may not want to be exclusively responsible for sustaining it forever. But would an HPC facility get any obvious benefit from joining and paying dues if it is simply a consumer of member projects' software? What does a cloud vendor like AWS get by being a premiere member? Is HPSF just a way to get someone else to cover the overheads of maintaining open-source software that comes out of, say, R&D organizations rather than product organizations?

Hopefully the answers to these questions become clearer as the foundation gets off the ground and we get to see what member organizations contribute under the HPSF banner.

Ultimately though, I see this as a really positive direction for the HPC software community that might help resolve some uncertainty around key pieces of HPC software that have uncertain ownership. For example, I wound up as a maintainer of the IOR and mdtest benchmark because I was the last one to touch it when its previous maintainer lost interest/funding. I don't even work in I/O performance anymore, but the community still uses this benchmark in virtually every procurement of parallel file systems either directly or through IO500. It would be wonderful if such an important tool didn't rest on my shoulders and had a more concrete governance structure given how important it is.

Quantum computing

Besides AI and cloud, quantum computing was cited in Kathy Yelick's opening keynote as the third disruptor to HPC for scientific computing. At the time, I thought citing quantum was just an obligation of any opening keynote speaker, but quantum computing was particularly high-profile at ISC this year. I was surprised to see over a dozen quantum computing companies on the vendor exhibition floor, many of whom were Europe-based startups.

In addition, this year's Hans Meuer award (for best research paper) was given to a paper on quantum computing by Camps et al. This is particularly notable since this is the first time that the Meuer award has ever been given to a paper on a topic that isn't some hardcore traditional HPC like MPI or OpenMP advancements; by comparison, this award has never been given to any papers on AI topics. Granted, the winning paper was specifically about how to use conventional HPC to solve quantum problems, but this recognition of research in quantum computing makes a powerful statement: quantum computing research is high-performance computing research.

Reinvent HPC to include urgent computing?

I was invited to give a lightning talk at the Workshop on Interactive and Urgent High-Performance Computing on Thursday, and urgent/interactive HPC is not something I'd really paid attention to in the past. So as not to sound like an ignorant fool going into that workshop, I opted to sit in on a focus session titled "Urgent Computing" on Tuesday. I had two goals:

Make sure I understood the HPC problems that fall under urgent and interactive computing so I could hold an intelligent conversation on this topic at the Thursday workshop, and
See if there are any opportunities for cloud HPC to provide unique value to the challenges faced by folks working in urgent HPC

I'll describe what I came away with through these lenses.

The Urgent Computing focus session

What I learned from the focus session is that urgent computing is not a very well-defined set of application areas and challenges. Rather, it's another manifestation of reinventing HPC to include any kind of computation for scientific purposes.

Much to my surprise, this "Urgent Computing" focus session was actually a session on IoT and edge computing for science. Several speakers spoke about getting data from edge sensors on drones or telephone poles into some centralized location for lightweight data analysis, and the "urgent" part of the problem came from the hypothetical use cases of analyzing this sensor data to respond to natural disasters. There wasn't much mention of anything requiring HPC-like computing resources; at best, a few talks made unclear references to using AI models for data analysis, but it felt like grasping:

The above conclusion slide was presented by one of the speakers, and to be honest, I don't understand what any of it means. Granted, I know very little about urgent computing, IoT, or edge computing so there may be some domain jargon here that's throwing me off. But based on this, as someone working in the area of HPC and AI in the cloud, I don't think I have a role to play here. I'm sure cloud computing can help, but the challenges would be in general-purpose cloud rather than HPC.

The Interactive and Urgent HPC workshop

Fortunately for me, the Thursday workshop on Interactive and Urgent HPC was much less about edge/IoT and more about developing software infrastructure and workflows that allow scientific data analysis of large datasets to happen before the results become obsolete. It was a fascinating workshop for learning about specific science drivers that require fast access to HPC resources, and how different HPC providers are enabling that through non-traditional services and policies. Below are a few highlights.

Sam Welborn (NERSC) presented his team's efforts to convert a streaming data workflow from its current file-based approach into one that streamed directly into compute node memory. The specific use case was the initial data processing for image information coming off of a scanning transmission electron microscope at 480 Gbps, totaling 750 GB per shot. As he described it, the current technique involves streaming those data to files at the microscope, then copying those files to the parallel file system of a remote supercomputer, then reading, processing, and writing that data within the HPC environment to prepare it for downstream analysis tasks. And for what it's worth, this is how I've always seen "streaming" HPC workflows actually work; they're actually using file transfers, and the performance of both the file system at the source and destination are in the critical path.

The problem with this approach is that parallel file systems on HPC systems tend to be super flaky, and there's no real reason to bounce data through a storage system if you're just going to pick it up and process it. So, Dr. Welborn showed a true streaming workflow that skipped this file step and used ZeroMQ push sockets at the microscope and pull sockets on the HPC compute nodes to do a direct memory-to-memory transfer:

Seeing software like ZeroMQ used to enable communication in an HPC environment instead of forcing this workflow to fit into the MPI paradigm is an encouraging sign in my eyes. ZeroMQ, despite not using purpose-built HPC technology like RDMA, is the right tool for this sort of job since it supports much better resilience characteristics than messaging libraries designed for tightly coupled HPC jobs. Workflows like this that combine beefy GPU nodes with software developed in the commercial tech space suggest that the world of HPC is willing to abandon not-invented-here ideology.

It wasn't clear to me that there's a great opportunity for cloud HPC to be uniquely useful in use cases like this; while you certainly can provision beefy CPU and GPU nodes with InfiniBand in Azure, cloud services can't obviously simplify this ZeroMQ-based workflow beyond just supplying general-purpose VMs on which the orchestration services can run. Had this team stuck with a file-based streaming mechanism, the performance SLAs on cloud storage (like object or ephemeral Lustre) would provide a more reliable experience to ensure the data transfer happened in near-real-time. But the better solution to unpredictable file system performance is to do exactly what was done here: skip the file system entirely.

Just to keep the speaker honest, I asked why this computation couldn't simply be done at the same place as the telescope generating the data. After all, if the telescope always generates 750 GB per shot, you should be able to buy a couple GPU servers that are ideally sized to process that exact workload in the time between images. There were actually two answers: one from Sam and one from an audience member:

Sam said that you can process this workflow locally, but that the goal of this work was to prepare for a future microscope (or another instrument) that could not. He also insightfully pointed out that there's tremendous value in getting the data into the HPC environment because of all the services that can be used to work on that data later. I envisioned doing things like using a Jupyter notebook to further process the data, serve it up through a web UI, and similar tasks that cannot be done if the data is stuck inside a microscope room.
An audience member also pointed out that sticking GPU nodes in the same room as electron microscopes can result in enough noise and vibration to disrupt the actual scope. This was a great point! In the days before I started working in HPC, I was training to become an electron microscopist, and I worked in a lab where we had water-cooled walls to avoid the problems that would be caused by air conditioning breezes. There's no way a loud server would've worked in there.

Toshio Endo (Tokyo Tech) gave an interesting talk on how they enable urgent/interactive compute jobs on their batch-scheduled TSUBAME4.0 supercomputer by doing, frankly, unnatural things. Rather than holding aside some nodes for interactive use as is common practice, his work found that a lot of user jobs do not completely use all resources on each compute node they reserve:

I had to do a double-take when I saw this: even though 65%-80% of the nodes on the supercomputer were allocated to user jobs, less than 7% of the GPUs were actually being utilized.

Dr. Endo's hypothesis was that if nodes were suitably subdivided and jobs were allowed to oversubscribe CPUs, GPUs, and memory on a compute node without impacting performance too much, they could deliver real-time access to HPC resources without having to create a separate pool of nodes only for interactive uses. He defined success as the slowdown of a shared job being 1/k if k jobs shared the same node; for example, if four jobs were all running on the same node, each one taking four times as long to complete would be acceptable, but any longer would not. He then went on to show that the best way to accomplish this is using Slurm's gang scheduling, where each job takes turns having exclusive access to all the CPUs and GPUs on a node. The alternative (just letting the OS context switch) was no good.

While a fascinating study in how to provide zero wait time to jobs in exchange for reduced performance, this whole mechanism of using gang scheduling to exploit low resource utilization seems like jamming a square peg into a round hole. If a workload doesn't (or can't) use all the GPUs on a node, then that's not the right node for the job; I feel like a more appealing solution would simply be to offer a heterogeneous mix of nodes based on the demands of the workload mix. This is hard to do if you're buying monolithic supercomputers since you're stuck with whatever node mix you've got for five years, but there is another way to buy supercomputers!

I won't pretend like dynamically provisioning different flavors of CPU- and GPU-based nodes interconnected with InfiniBand in the cloud doesn't come with a cost; the convenience of being able to slosh a cluster makeup between CPU-heavy and GPU-heavy nodes will be more expensive than committing to use the same makeup of node flavors for multiple years. But if you're paying for GPUs that are only being used 7% of the time, surely it's cheaper to pay a higher cost for GPUs when you need them if it also allows you to not pay for them 93% of the time when they're idle.

Bjoern Enders (NERSC) gave the first lightning talk where he presented the exploration they're making into enabling real-time and urgent computation. They're currently going in three parallel directions to provide this capability:

Reservations, a process by which a user can request a specific number of nodes for a specific period of time, and Slurm ensures that many nodes are available for the exclusive use of that user by the time the reservation starts. He said that implementing this at NERSC is costly and rigid because it requires a human administrator to perform manual steps to register the reservation with Slurm.
Realtime queues, where a few nodes are held from the regular batch queue and only special real-time users can submit jobs to them. Dr. Enders said that NERSC is extremely selective about who can access this queue for obvious reasons: if too many people use it, it will back up just like the regular batch queues do.
Jupyter Hub, which utilizes job preemption and backfill under the hood. If a user requests a Jupyter job, Slurm will pre-empt a job that was submitted to a preemptible queue to satisfy the Jupyter request. However, if there are no preemptible jobs running, the Jupyter job will fail to launch after waiting for ten minutes.

To provide compute resources to back up these scheduling capabilities, they also deployed a new set of compute nodes that can be dynamically attached to different supercomputers they have to support urgent workloads even during downtimes. Called "Perlmutter on Demand" (POD), it sounded like a separate set of Cray EX racks that can be assigned to either the Perlmutter supercomputer, or if Perlmutter is down for maintenance, either their smaller Alvarez or Muller supercomputers which share the same Cray EX architecture. What wasn't clear to me is how the Slingshot fabrics of these nodes interact; perhaps POD has its own fabric, and only the control plane owning those racks are what changes.

He showed a slide of explorations they're doing with this POD infrastructure, but as with Dr. Endo's talk, this seemed a bit like a square peg in a round hole:

All of this sounds aligned with the strengths of what HPC in a cloud environment can deliver, and some of the big challenges (like figuring out the ideal node count to reserve for interactive jobs) are problems specific to Slurm and its mechanism for scheduling. There's a lot more flexibility to rapidly provision HPC resources in cloud environments because, unlike the case where Slurm is scheduling jobs on a single cluster, cloud resource managers can schedule across any number of clusters independently. For example, if an urgent workload needing only four GPU nodes suddenly appears, it doesn't necessarily have to be scheduled on the same InfiniBand fabric that a large hero job is running on. Since the urgent job and the hero job don't need to talk to each other, cloud resource managers can go find a GPU cluster with a little more flex in them to provision those resources quickly.

Automating the process of reservations is also a bit of a game of catch-up, though my guess is that this is more a matter of someone having a weekend to sit down and write the REST service that manages incoming reservation requests. Although there's not a direct analog for reservations like this in Azure, AWS has a feature called AWS Capacity Blocks that does exactly this: if you know you'll want a certain number of GPU nodes sometime in the future, Capacity Blocks let you reserve them ahead of time through an API.

Finally, I represented Microsoft and gave a lightning talk that riffed on a lot of what I've been writing about in this blog post: HPC seems to be reinventing a lot of things that the cloud has already figured out how to do. The illustrious Nick Brown was kind enough to snap a photo of one of my slides and post it on Twitter:

My thesis was that the way urgent HPC workflows are triggered, scheduled, run, and reported on follows the same pattern that inferencing-as-a-service services (like Copilot and ChatGPT) are implemented under the hood, right down to executing multi-node jobs on InfiniBand clusters. The difference is that these cloud workflows are built on the foundation of really nice cloud services that provide security, scalability, monitoring, and hands-free management that were originally developed for commercial (not HPC!) customers. My argument was that, even if you don't want to pay cloud providers to run urgent HPC workflows as a managed service, you can use these services (and the software infrastructure on which they're built) as a blueprint for how to build these capabilities in your own HPC environments.

Concluding thoughts

The ISC'24 conference was fantastic, and I am glad it has not lost the unique elements that made me want to attend in the years prior to the pandemic. It's still that smaller, intimate, and focused HPC conference that brings the community together. Although a lot of my synopsis above may sound critical of the content presented over the four days I attended, the fact that I've had so much to write down in this blog post is a testament to the value I really get out of attending: it makes me sit down and think critically about the way the HPC community is evolving, what the leading minds in the field are thinking, and where I might be able to contribute the most in the coming year.

I never much paid attention to the annual taglines of conferences like ISC, but this year's "Reinvent HPC" really resonated. The HPC community is at a crossroads. Exascale computing for science is now in the rear-view mirror, and large-scale AI is all the rage across the computing industry at large. But for the first time ever, this new direction in at-scale computing is happening without the inclusion of the people and organizations who've historically driven innovation in HPC. Whereas institutions like Oak Ridge, RIKEN, Cray, and Fujitsu defined the future of computing for decades, hundred-person startups like OpenAI and Anthropic are now paving the way in partnership with companies like Microsoft and Amazon.

HPC needs to be reinvented, if for no other reason than to decide whether the HPC community wants to be inclusive of new frontiers in computing that they do not lead. Does the HPC community want AI to be considered a part of HPC?

Judging from many speakers and panelists, the answer may be "no." To many, it sounded like AI is just another industry that's sucking all the air (and GPUs) out of the room; it's a distraction that is pulling funding and public interest away from solving real problems. It's not something worth understanding, it's not something that uses the familiar tools and libraries, and it's not the product of decades of steady, government-funded improvements. AI is "them" and HPC is "us."

Personally, I'd like the answer to be "yes" though. Now that I'm on the other side of the table, supporting AI for a cloud provider, I can say that the technical challenges I face at Microsoft are the same technical challenges I faced in the DOE. The desire to deeply understand systems, optimize applications, and put world-class computing infrastructure in the hands of people who do amazing things is the same. And as the days go by, many of the faces I see are the same; instead of wearing DOE or Cray badges, my lifelong colleagues are now wearing NVIDIA or Microsoft badges.

All this applies equally to whether cloud is HPC or not. The HPC community needs to reinvent itself to be inclusive of everyone working towards solving the same problems of computing at scale. Stop talking about people who work on commercial AI in cloud-based supercomputers as if they aren't in the room. They are in the room. Often near the front row, snapping photos, and angrily posting commentary on Twitter about how you're getting it all wrong.

HPC has historically been used to solve scientific problems, whether to expand our understanding of the university, to find the next best place to drill an oil well, or to model the safety of aging nuclear weapons. The fact that HPC is now being used to solve squishier problems related to natural language or image generation does not change the essence of HPC. And whether that HPC is delivered through physical nodes and networks or virtualized nodes and networks is irrelevant, as long as those resources are still delivering high performance. AI is just as much HPC as scientific computing is, and cloud is just as much HPC as OLCF, R-CCS, or CSCS is.

So perhaps HPC doesn't need to be reinvented as much as the mindset of its community does.

That all said, I am genuinely impressed by how quickly ISC'24 has been reinventing itself in recent years. It wasn't too long ago that all its keynote speakers were greybeards from a predictable pool of public HPC centers all saying the same things year after year. It's wonderful to see a greater diversity of perspectives on the main stage and torches passing on to the next generation of leading figures in the field. And it was not lost on me that, for the first time in the history of this conference, Thomas Sterling did not deliver the closing keynote. As much fun as I had poking fun at his meandering and often-off-the-mark conjectures every year, it was delightful to be exposed to something new this year.

I'm hopeful that ISC will continue to get better year over year, and ISC'25 will feel more inclusive of me despite the fact that I am now one of those hyperscale cloud AI people. So long as I still feel like it's my community, though, I will keep showing up in Germany every summer.

Centralized system and LSF logging on a Turing Pi system

2024-04-05T12:34:38-06:00

Logs are one of those indispensable things in IT when things go wrong. Having worked in technical support for software products in a past life, I’ve likely looked at hundreds (or more) logs over the years, helping to identify issues. So, I really appreciate the importance of logs, but I can honestly say that I never really thought about a logging strategy for the systems on my home network - primarily those running Linux.

One of my longtime friends, Peter Czanik, who also works in IT, happens to be a logging guru as well as an IBM Champion for Power Systems (yeah!). So it’s only natural that we get to talking about logging. He is often complaining that even at IT security conferences people are unaware of the importance of central logging. So, why is it so important? For security it’s obvious: logs are stored independently from the compromised system, so they cannot be modified or deleted by the attacker. But central logging is beneficial for the HPC operator as well. First of all, it’s availability. You can read the logs even if one of your nodes becomes unreachable. Instead of trying to breath life into the failed node, you can just take a look at the logs and see a broken hard drive, or a similar deadly problem. And it is also convenience, as all logs are available at a single location. Logging into each node on the 3 node cluster to check locally saved logs is inconvenient but doable. On a 10 node cluster it takes a long time. On a 100 node cluster a couple of working days. While, if your logs are collected to a central location, maybe a single grep command, or search in a Kibana or similar web interface.

Those who follow my blog will know that I’ve been tinkering with a Turing Pi V1 system lately. You can read my latest post here. For me, the Turing Pi has always been a cluster in a box. My Turing Pi is fully populated with 7 compute modules. I’ve designed Node 1 to be the NFS server and LSF manager for the cluster. LSF is a workload scheduler for high-performance computing (HPC) from IBM. Naturally I turned to Peter for his guidance on this, and the result is this blog. Peter recommended that I use syslog-ng for log aggregation and also helped me through some of my first steps with syslog-ng. And the goal was to aggregate both the system (syslog) as well as LSF logs on Node 1. TL;DR it was easy to get it all working. But I encourage you to read on to better understand the nuances and necessary configuration both syslog-ng and LSF that was needed.

The environment

The following software has been deployed on the Turing Pi:

Raspberry Pi OS (2023-02-21-raspios-bullseye-arm64-lite.img)
syslog-ng 3 – (3.28.1 as supplied with Raspberry Pi OS)
IBM LSF Standard Edition V10.1.0.13

The Turing Pi system is configured as follows:

Node 1 (turingpi) is the manager node of this cluster in a box and has by far the most storage. Naturally we want to use that as the centralized logging server.

Node	Hostname	Hardware	Notes
1	turingpi	CM3+	LSF manager, NFS server, 128GB SDcard
2	kemeny	CM3	4GB eMMC flash
3	neumann	CM3+	8GB SDcard
4	szilard	CM3+	8GB SDcard
5	teller	CM3+	8GB SDcard
6	vonkarman	CM3+	8GB SDcard
7	wigner	CM3+	8GB SDcard

Syslog-ng & LSF setup

Raspberry Pi OS configures rsyslog out of the box. The first step is to install syslog-ng on Node 1 in the environment. Note that installing syslog-ng automatically disables rsyslog on the nodes.

Output of apt update; apt-get install syslog-ng -y. Click to expand

root@turingpi:~# apt update; apt-get install syslog-ng -y 
Hit:1 http://security.debian.org/debian-security bullseye-security InRelease
Hit:2 http://deb.debian.org/debian bullseye InRelease                                                        
Hit:3 http://deb.debian.org/debian bullseye-updates InRelease                                                
Hit:4 https://repos.influxdata.com/debian stable InRelease                                                   
Hit:5 https://repos.influxdata.com/debian bullseye InRelease                                                 
Hit:6 http://archive.raspberrypi.org/debian bullseye InRelease                                  
Hit:7 https://packagecloud.io/ookla/speedtest-cli/debian bullseye InRelease                     
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
All packages are up to date.
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
The following additional packages will be installed:
  libbson-1.0-0 libdbi1 libesmtp6 libhiredis0.14 libivykis0 libmaxminddb0 libmongoc-1.0-0 libmongocrypt0
  libnet1 libprotobuf-c1 librabbitmq4 librdkafka1 libriemann-client0 libsnappy1v5 libsnmp-base libsnmp40
  syslog-ng-core syslog-ng-mod-add-contextual-data syslog-ng-mod-amqp syslog-ng-mod-examples
  syslog-ng-mod-extra syslog-ng-mod-geoip2 syslog-ng-mod-getent syslog-ng-mod-graphite syslog-ng-mod-http
  syslog-ng-mod-map-value-pairs syslog-ng-mod-mongodb syslog-ng-mod-python syslog-ng-mod-rdkafka
  syslog-ng-mod-redis syslog-ng-mod-riemann syslog-ng-mod-slog syslog-ng-mod-smtp syslog-ng-mod-snmp
  syslog-ng-mod-sql syslog-ng-mod-stardate syslog-ng-mod-stomp syslog-ng-mod-xml-parser
Suggested packages:
  mmdb-bin snmp-mibs-downloader rabbitmq-server graphite-web mongodb-server libdbd-mysql libdbd-pgsql
  libdbd-sqlite3 activemq
The following packages will be REMOVED:
  rsyslog
The following NEW packages will be installed:
  libbson-1.0-0 libdbi1 libesmtp6 libhiredis0.14 libivykis0 libmaxminddb0 libmongoc-1.0-0 libmongocrypt0
  libnet1 libprotobuf-c1 librabbitmq4 librdkafka1 libriemann-client0 libsnappy1v5 libsnmp-base libsnmp40
  syslog-ng syslog-ng-core syslog-ng-mod-add-contextual-data syslog-ng-mod-amqp syslog-ng-mod-examples
  syslog-ng-mod-extra syslog-ng-mod-geoip2 syslog-ng-mod-getent syslog-ng-mod-graphite syslog-ng-mod-http
  syslog-ng-mod-map-value-pairs syslog-ng-mod-mongodb syslog-ng-mod-python syslog-ng-mod-rdkafka
  syslog-ng-mod-redis syslog-ng-mod-riemann syslog-ng-mod-slog syslog-ng-mod-smtp syslog-ng-mod-snmp
  syslog-ng-mod-sql syslog-ng-mod-stardate syslog-ng-mod-stomp syslog-ng-mod-xml-parser
0 upgraded, 39 newly installed, 1 to remove and 0 not upgraded.
Need to get 7,015 kB of archives.
After this operation, 15.1 MB of additional disk space will be used.
Get:1 http://deb.debian.org/debian bullseye/main arm64 libbson-1.0-0 arm64 1.17.6-1 [69.7 kB]
Get:2 http://deb.debian.org/debian bullseye/main arm64 libmongocrypt0 arm64 1.1.0-1 [114 kB]
Get:3 http://deb.debian.org/debian bullseye/main arm64 libsnappy1v5 arm64 1.1.8-1 [17.2 kB]
Get:4 http://deb.debian.org/debian bullseye/main arm64 libmongoc-1.0-0 arm64 1.17.6-1 [257 kB]
Get:5 http://deb.debian.org/debian bullseye/main arm64 libivykis0 arm64 0.42.4-1 [25.3 kB]
Get:6 http://deb.debian.org/debian bullseye/main arm64 libnet1 arm64 1.1.6+dfsg-3.1 [56.8 kB]
Get:7 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-core arm64 3.28.1-2+deb11u1 [591 kB]
Get:8 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-mongodb arm64 3.28.1-2+deb11u1 [37.9 kB]
Get:9 http://deb.debian.org/debian bullseye/main arm64 libdbi1 arm64 0.9.0-6 [27.8 kB]
Get:10 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-sql arm64 3.28.1-2+deb11u1 [41.5 kB]
Get:11 http://deb.debian.org/debian bullseye/main arm64 libesmtp6 arm64 1.0.6-4.3 [52.0 kB]
Get:12 http://deb.debian.org/debian bullseye/main arm64 libhiredis0.14 arm64 0.14.1-1 [33.7 kB]
Get:13 http://deb.debian.org/debian bullseye/main arm64 libmaxminddb0 arm64 1.5.2-1 [29.6 kB]
Get:14 http://deb.debian.org/debian bullseye/main arm64 libprotobuf-c1 arm64 1.3.3-1+b2 [26.8 kB]
Get:15 http://deb.debian.org/debian bullseye/main arm64 librabbitmq4 arm64 0.10.0-1 [39.7 kB]
Get:16 http://deb.debian.org/debian bullseye/main arm64 librdkafka1 arm64 1.6.0-1 [515 kB]
Get:17 http://deb.debian.org/debian bullseye/main arm64 libriemann-client0 arm64 1.10.4-2+b2 [21.9 kB]
Get:18 http://deb.debian.org/debian bullseye/main arm64 libsnmp-base all 5.9+dfsg-4+deb11u1 [1,736 kB]
Get:19 http://deb.debian.org/debian bullseye/main arm64 libsnmp40 arm64 5.9+dfsg-4+deb11u1 [2,497 kB]
Get:20 http://deb.debian.org/debian bullseye/main arm64 syslog-ng all 3.28.1-2+deb11u1 [25.9 kB]
Get:21 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-add-contextual-data arm64 3.28.1-2+deb11u1 [40.5 kB]
Get:22 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-amqp arm64 3.28.1-2+deb11u1 [48.8 kB]
Get:23 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-examples arm64 3.28.1-2+deb11u1 [57.3 kB]
Get:24 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-extra all 3.28.1-2+deb11u1 [35.7 kB]
Get:25 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-geoip2 arm64 3.28.1-2+deb11u1 [36.9 kB]
Get:26 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-graphite arm64 3.28.1-2+deb11u1 [29.4 kB]
Get:27 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-http arm64 3.28.1-2+deb11u1 [50.5 kB]
Get:28 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-python arm64 3.28.1-2+deb11u1 [69.9 kB]
Get:29 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-rdkafka arm64 3.28.1-2+deb11u1 [41.5 kB]
Get:30 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-redis arm64 3.28.1-2+deb11u1 [37.6 kB]
Get:31 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-riemann arm64 3.28.1-2+deb11u1 [40.1 kB]
Get:32 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-slog arm64 3.28.1-2+deb11u1 [63.3 kB]
Get:33 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-smtp arm64 3.28.1-2+deb11u1 [38.0 kB]
Get:34 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-snmp arm64 3.28.1-2+deb11u1 [42.5 kB]
Get:35 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-stomp arm64 3.28.1-2+deb11u1 [39.1 kB]
Get:36 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-xml-parser arm64 3.28.1-2+deb11u1 [34.7 kB]
Get:37 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-getent arm64 3.28.1-2+deb11u1 [29.5 kB]
Get:38 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-map-value-pairs arm64 3.28.1-2+deb11u1 [34.0 kB]
Get:39 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-stardate arm64 3.28.1-2+deb11u1 [28.6 kB]
Fetched 7,015 kB in 5s (1,311 kB/s)           
Extracting templates from packages: 100%
(Reading database ... 90182 files and directories currently installed.)
Removing rsyslog (8.2102.0-2+deb11u1) ...
Selecting previously unselected package libbson-1.0-0.
(Reading database ... 90124 files and directories currently installed.)
Preparing to unpack .../00-libbson-1.0-0_1.17.6-1_arm64.deb ...
Unpacking libbson-1.0-0 (1.17.6-1) ...
Selecting previously unselected package libmongocrypt0:arm64.
Preparing to unpack .../01-libmongocrypt0_1.1.0-1_arm64.deb ...
Unpacking libmongocrypt0:arm64 (1.1.0-1) ...
Selecting previously unselected package libsnappy1v5:arm64.
Preparing to unpack .../02-libsnappy1v5_1.1.8-1_arm64.deb ...
Unpacking libsnappy1v5:arm64 (1.1.8-1) ...
Selecting previously unselected package libmongoc-1.0-0.
Preparing to unpack .../03-libmongoc-1.0-0_1.17.6-1_arm64.deb ...
Unpacking libmongoc-1.0-0 (1.17.6-1) ...
Selecting previously unselected package libivykis0:arm64.
Preparing to unpack .../04-libivykis0_0.42.4-1_arm64.deb ...
Unpacking libivykis0:arm64 (0.42.4-1) ...
Selecting previously unselected package libnet1:arm64.
Preparing to unpack .../05-libnet1_1.1.6+dfsg-3.1_arm64.deb ...
Unpacking libnet1:arm64 (1.1.6+dfsg-3.1) ...
Selecting previously unselected package syslog-ng-core.
Preparing to unpack .../06-syslog-ng-core_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-core (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-mongodb.
Preparing to unpack .../07-syslog-ng-mod-mongodb_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-mongodb (3.28.1-2+deb11u1) ...
Selecting previously unselected package libdbi1:arm64.
Preparing to unpack .../08-libdbi1_0.9.0-6_arm64.deb ...
Unpacking libdbi1:arm64 (0.9.0-6) ...
Selecting previously unselected package syslog-ng-mod-sql.
Preparing to unpack .../09-syslog-ng-mod-sql_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-sql (3.28.1-2+deb11u1) ...
Selecting previously unselected package libesmtp6.
Preparing to unpack .../10-libesmtp6_1.0.6-4.3_arm64.deb ...
Unpacking libesmtp6 (1.0.6-4.3) ...
Selecting previously unselected package libhiredis0.14:arm64.
Preparing to unpack .../11-libhiredis0.14_0.14.1-1_arm64.deb ...
Unpacking libhiredis0.14:arm64 (0.14.1-1) ...
Selecting previously unselected package libmaxminddb0:arm64.
Preparing to unpack .../12-libmaxminddb0_1.5.2-1_arm64.deb ...
Unpacking libmaxminddb0:arm64 (1.5.2-1) ...
Selecting previously unselected package libprotobuf-c1:arm64.
Preparing to unpack .../13-libprotobuf-c1_1.3.3-1+b2_arm64.deb ...
Unpacking libprotobuf-c1:arm64 (1.3.3-1+b2) ...
Selecting previously unselected package librabbitmq4:arm64.
Preparing to unpack .../14-librabbitmq4_0.10.0-1_arm64.deb ...
Unpacking librabbitmq4:arm64 (0.10.0-1) ...
Selecting previously unselected package librdkafka1:arm64.
Preparing to unpack .../15-librdkafka1_1.6.0-1_arm64.deb ...
Unpacking librdkafka1:arm64 (1.6.0-1) ...
Selecting previously unselected package libriemann-client0:arm64.
Preparing to unpack .../16-libriemann-client0_1.10.4-2+b2_arm64.deb ...
Unpacking libriemann-client0:arm64 (1.10.4-2+b2) ...
Selecting previously unselected package libsnmp-base.
Preparing to unpack .../17-libsnmp-base_5.9+dfsg-4+deb11u1_all.deb ...
Unpacking libsnmp-base (5.9+dfsg-4+deb11u1) ...
Selecting previously unselected package libsnmp40:arm64.
Preparing to unpack .../18-libsnmp40_5.9+dfsg-4+deb11u1_arm64.deb ...
Unpacking libsnmp40:arm64 (5.9+dfsg-4+deb11u1) ...
Selecting previously unselected package syslog-ng.
Preparing to unpack .../19-syslog-ng_3.28.1-2+deb11u1_all.deb ...
Unpacking syslog-ng (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-add-contextual-data.
Preparing to unpack .../20-syslog-ng-mod-add-contextual-data_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-add-contextual-data (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-amqp.
Preparing to unpack .../21-syslog-ng-mod-amqp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-amqp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-examples.
Preparing to unpack .../22-syslog-ng-mod-examples_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-examples (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-extra.
Preparing to unpack .../23-syslog-ng-mod-extra_3.28.1-2+deb11u1_all.deb ...
Unpacking syslog-ng-mod-extra (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-geoip2.
Preparing to unpack .../24-syslog-ng-mod-geoip2_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-geoip2 (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-graphite.
Preparing to unpack .../25-syslog-ng-mod-graphite_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-graphite (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-http.
Preparing to unpack .../26-syslog-ng-mod-http_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-http (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-python.
Preparing to unpack .../27-syslog-ng-mod-python_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-python (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-rdkafka.
Preparing to unpack .../28-syslog-ng-mod-rdkafka_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-rdkafka (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-redis.
Preparing to unpack .../29-syslog-ng-mod-redis_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-redis (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-riemann.
Preparing to unpack .../30-syslog-ng-mod-riemann_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-riemann (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-slog.
Preparing to unpack .../31-syslog-ng-mod-slog_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-slog (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-smtp.
Preparing to unpack .../32-syslog-ng-mod-smtp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-smtp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-snmp.
Preparing to unpack .../33-syslog-ng-mod-snmp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-snmp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-stomp.
Preparing to unpack .../34-syslog-ng-mod-stomp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-stomp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-xml-parser.
Preparing to unpack .../35-syslog-ng-mod-xml-parser_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-xml-parser (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-getent.
Preparing to unpack .../36-syslog-ng-mod-getent_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-getent (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-map-value-pairs.
Preparing to unpack .../37-syslog-ng-mod-map-value-pairs_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-map-value-pairs (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-stardate.
Preparing to unpack .../38-syslog-ng-mod-stardate_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-stardate (3.28.1-2+deb11u1) ...
Setting up librabbitmq4:arm64 (0.10.0-1) ...
Setting up libdbi1:arm64 (0.9.0-6) ...
Setting up libsnmp-base (5.9+dfsg-4+deb11u1) ...
Setting up libmaxminddb0:arm64 (1.5.2-1) ...
Setting up libesmtp6 (1.0.6-4.3) ...
Setting up libnet1:arm64 (1.1.6+dfsg-3.1) ...
Setting up libprotobuf-c1:arm64 (1.3.3-1+b2) ...
Setting up libsnappy1v5:arm64 (1.1.8-1) ...
Setting up libsnmp40:arm64 (5.9+dfsg-4+deb11u1) ...
Setting up libbson-1.0-0 (1.17.6-1) ...
Setting up libivykis0:arm64 (0.42.4-1) ...
Setting up libriemann-client0:arm64 (1.10.4-2+b2) ...
Setting up librdkafka1:arm64 (1.6.0-1) ...
Setting up libhiredis0.14:arm64 (0.14.1-1) ...
Setting up libmongocrypt0:arm64 (1.1.0-1) ...
Setting up libmongoc-1.0-0 (1.17.6-1) ...
Setting up syslog-ng-core (3.28.1-2+deb11u1) ...
Created symlink /etc/systemd/system/multi-user.target.wants/syslog-ng.service → /lib/systemd/system/syslog-ng.service.
Setting up syslog-ng-mod-examples (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-xml-parser (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-stomp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-riemann (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-stardate (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-geoip2 (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-getent (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-amqp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-python (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-smtp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-snmp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-extra (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-rdkafka (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-graphite (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-add-contextual-data (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-mongodb (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-http (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-slog (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-map-value-pairs (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-sql (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-redis (3.28.1-2+deb11u1) ...
Setting up syslog-ng (3.28.1-2+deb11u1) ...
Processing triggers for man-db (2.9.4-2) ...
Processing triggers for libc-bin (2.31-13+rpt2+rpi1+deb11u8) ...
Scanning processes...                                                                                         
Scanning processor microcode...                                                                               
Scanning linux images...                                                                                      

Running kernel seems to be up-to-date.

Failed to check for processor microcode upgrades.

No services need to be restarted.

No containers need to be restarted.

No user sessions are running outdated binaries.

2. With syslog-ng installed, it’s now time to build the configuration for it. A new configuration file fromnet.conf is shown below, in which a syslog-ng destination is created which will aggregate logs from the Turing Pi nodes in /var/log/fromnet in plain text format. Additionally, the logs will be written in JSON format to the file /var/log/fromnet.json.

root@turingpi:~# cat /etc/syslog-ng/fromnet.conf 
# source
source s_fromnet {
  syslog(port(601));
};
# destination 
destination d_fromnet {
  file("/var/log/fromnet");
  file("/var/log/fromnet.json" template("$(format-json --scope rfc5424 --scope dot-nv-pairs
        --rekey .* --shift 1 --scope nv-pairs)\n") );
};
# log path
log {
  source(s_fromnet);
  destination(d_fromnet);
};

Unless we only want to see source IP addresses in the collected logs, it’s necessary to update the syslog-ng configuration file /etc/syslog-ng/syslog-ng.conf to record the hostnames from which the log messages have originated. This is done by adding the keep_hostname(yes) parameter to the options section as follows:

....
....
# First, set some global options. 
options { chain_hostnames(off); flush_lines(0); use_dns(no); use_fqdn(no);          
        keep_hostname(yes);dns_cache(no); owner("root"); group("adm"); perm(0640); 
        stats_freq(0); bad_hostname("^gconfd$"); 
};
....
....

Next, the IBM LSF configuration is updated to prevent the creation of local logfiles for the LSF daemons. This is done by commenting the LSF_LOGDIR option in the configuration file $LSF_ENVDIR/lsf.conf. At the same time, we also set LSF_LOG_MASK=LOG_DEBUG for testing purposes to enable verbose logging for the LSF daemons.

....
....
# Daemon log messages
# LSF_LOGDIR=/opt/ibm/lsf/log
LSF_LOG_MASK=LOG_DEBUG
....
....

Finally, to make the changes take effect, both syslog-ng and LSF are restarted.

root@turingpi:~# systemctl restart syslog-ng 
root@turingpi:~# . /opt/ibm/lsf/conf/profile.lsf  
root@turingpi:~# lsf_daemons restart 
Stopping the LSF subsystem 
Starting the LSF subsystem

With the configuration ready on the centralized logging server, host turingpi, we now turn our attention to Nodes 2-7 in the cluster. Here we’ll use the parallel-ssh tool to streamline some operations. We start with the installation of syslog-ng across Nodes 2-7. Note that the output of the installation of syslog-ng across the compute nodes has been truncated.

Truncated output of parallel-ssh -h /opt/workers -i “apt-get install syslog-ng -y”. Click to expand

root@turingpi:~# parallel-ssh -h /opt/workers -i "apt-get install syslog-ng -y" 
[1] 13:57:07 [SUCCESS] kemeny
Reading package lists...
Building dependency tree...
Reading state information...
The following additional packages will be installed:
  libbson-1.0-0 libdbi1 libesmtp6 libhiredis0.14 libivykis0 libmaxminddb0
  libmongoc-1.0-0 libmongocrypt0 libnet1 libprotobuf-c1 librabbitmq4
  librdkafka1 libriemann-client0 libsensors-config libsensors5 libsnappy1v5
  libsnmp-base libsnmp40 syslog-ng-core syslog-ng-mod-add-contextual-data
  syslog-ng-mod-amqp syslog-ng-mod-examples syslog-ng-mod-extra
  syslog-ng-mod-geoip2 syslog-ng-mod-getent syslog-ng-mod-graphite
  syslog-ng-mod-http syslog-ng-mod-map-value-pairs syslog-ng-mod-mongodb
  syslog-ng-mod-python syslog-ng-mod-rdkafka syslog-ng-mod-redis
  syslog-ng-mod-riemann syslog-ng-mod-slog syslog-ng-mod-smtp
  syslog-ng-mod-snmp syslog-ng-mod-sql syslog-ng-mod-stardate
  syslog-ng-mod-stomp syslog-ng-mod-xml-parser
Suggested packages:
  mmdb-bin lm-sensors snmp-mibs-downloader rabbitmq-server graphite-web
  mongodb-server libdbd-mysql libdbd-pgsql libdbd-sqlite3 activemq
The following packages will be REMOVED:
  rsyslog
The following NEW packages will be installed:
  libbson-1.0-0 libdbi1 libesmtp6 libhiredis0.14 libivykis0 libmaxminddb0
  libmongoc-1.0-0 libmongocrypt0 libnet1 libprotobuf-c1 librabbitmq4
  librdkafka1 libriemann-client0 libsensors-config libsensors5 libsnappy1v5
  libsnmp-base libsnmp40 syslog-ng syslog-ng-core
  syslog-ng-mod-add-contextual-data syslog-ng-mod-amqp syslog-ng-mod-examples
  syslog-ng-mod-extra syslog-ng-mod-geoip2 syslog-ng-mod-getent
  syslog-ng-mod-graphite syslog-ng-mod-http syslog-ng-mod-map-value-pairs
  syslog-ng-mod-mongodb syslog-ng-mod-python syslog-ng-mod-rdkafka
  syslog-ng-mod-redis syslog-ng-mod-riemann syslog-ng-mod-slog
  syslog-ng-mod-smtp syslog-ng-mod-snmp syslog-ng-mod-sql
  syslog-ng-mod-stardate syslog-ng-mod-stomp syslog-ng-mod-xml-parser
0 upgraded, 41 newly installed, 1 to remove and 0 not upgraded.
Need to get 7,098 kB of archives.
After this operation, 15.3 MB of additional disk space will be used.
Get:1 http://deb.debian.org/debian bullseye/main arm64 libbson-1.0-0 arm64 1.17.6-1 [69.7 kB]
Get:2 http://deb.debian.org/debian bullseye/main arm64 libmongocrypt0 arm64 1.1.0-1 [114 kB]
Get:3 http://deb.debian.org/debian bullseye/main arm64 libsnappy1v5 arm64 1.1.8-1 [17.2 kB]
Get:4 http://deb.debian.org/debian bullseye/main arm64 libmongoc-1.0-0 arm64 1.17.6-1 [257 kB]
Get:5 http://deb.debian.org/debian bullseye/main arm64 libivykis0 arm64 0.42.4-1 [25.3 kB]
Get:6 http://deb.debian.org/debian bullseye/main arm64 libnet1 arm64 1.1.6+dfsg-3.1 [56.8 kB]
Get:7 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-core arm64 3.28.1-2+deb11u1 [591 kB]
Get:8 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-mongodb arm64 3.28.1-2+deb11u1 [37.9 kB]
Get:9 http://deb.debian.org/debian bullseye/main arm64 libdbi1 arm64 0.9.0-6 [27.8 kB]
Get:10 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-sql arm64 3.28.1-2+deb11u1 [41.5 kB]
Get:11 http://deb.debian.org/debian bullseye/main arm64 libesmtp6 arm64 1.0.6-4.3 [52.0 kB]
Get:12 http://deb.debian.org/debian bullseye/main arm64 libhiredis0.14 arm64 0.14.1-1 [33.7 kB]
Get:13 http://deb.debian.org/debian bullseye/main arm64 libmaxminddb0 arm64 1.5.2-1 [29.6 kB]
Get:14 http://deb.debian.org/debian bullseye/main arm64 libprotobuf-c1 arm64 1.3.3-1+b2 [26.8 kB]
Get:15 http://deb.debian.org/debian bullseye/main arm64 librabbitmq4 arm64 0.10.0-1 [39.7 kB]
Get:16 http://deb.debian.org/debian bullseye/main arm64 librdkafka1 arm64 1.6.0-1 [515 kB]
Get:17 http://deb.debian.org/debian bullseye/main arm64 libriemann-client0 arm64 1.10.4-2+b2 [21.9 kB]
Get:18 http://deb.debian.org/debian bullseye/main arm64 libsensors-config all 1:3.6.0-7 [32.3 kB]
Get:19 http://deb.debian.org/debian bullseye/main arm64 libsensors5 arm64 1:3.6.0-7 [51.2 kB]
Get:20 http://deb.debian.org/debian bullseye/main arm64 libsnmp-base all 5.9+dfsg-4+deb11u1 [1,736 kB]
Get:21 http://deb.debian.org/debian bullseye/main arm64 libsnmp40 arm64 5.9+dfsg-4+deb11u1 [2,497 kB]
Get:22 http://deb.debian.org/debian bullseye/main arm64 syslog-ng all 3.28.1-2+deb11u1 [25.9 kB]
Get:23 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-add-contextual-data arm64 3.28.1-2+deb11u1 [40.5 kB]
Get:24 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-amqp arm64 3.28.1-2+deb11u1 [48.8 kB]
Get:25 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-examples arm64 3.28.1-2+deb11u1 [57.3 kB]
Get:26 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-extra all 3.28.1-2+deb11u1 [35.7 kB]
Get:27 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-geoip2 arm64 3.28.1-2+deb11u1 [36.9 kB]
Get:28 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-graphite arm64 3.28.1-2+deb11u1 [29.4 kB]
Get:29 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-http arm64 3.28.1-2+deb11u1 [50.5 kB]
Get:30 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-python arm64 3.28.1-2+deb11u1 [69.9 kB]
Get:31 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-rdkafka arm64 3.28.1-2+deb11u1 [41.5 kB]
Get:32 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-redis arm64 3.28.1-2+deb11u1 [37.6 kB]
Get:33 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-riemann arm64 3.28.1-2+deb11u1 [40.1 kB]
Get:34 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-slog arm64 3.28.1-2+deb11u1 [63.3 kB]
Get:35 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-smtp arm64 3.28.1-2+deb11u1 [38.0 kB]
Get:36 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-snmp arm64 3.28.1-2+deb11u1 [42.5 kB]
Get:37 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-stomp arm64 3.28.1-2+deb11u1 [39.1 kB]
Get:38 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-xml-parser arm64 3.28.1-2+deb11u1 [34.7 kB]
Get:39 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-getent arm64 3.28.1-2+deb11u1 [29.5 kB]
Get:40 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-map-value-pairs arm64 3.28.1-2+deb11u1 [34.0 kB]
Get:41 http://deb.debian.org/debian bullseye/main arm64 syslog-ng-mod-stardate arm64 3.28.1-2+deb11u1 [28.6 kB]
Fetched 7,098 kB in 2s (3,566 kB/s)
(Reading database ... 37650 files and directories currently installed.)
Removing rsyslog (8.2102.0-2+deb11u1) ...
Selecting previously unselected package libbson-1.0-0.
(Reading database ... 37592 files and directories currently installed.)
Preparing to unpack .../00-libbson-1.0-0_1.17.6-1_arm64.deb ...
Unpacking libbson-1.0-0 (1.17.6-1) ...
Selecting previously unselected package libmongocrypt0:arm64.
Preparing to unpack .../01-libmongocrypt0_1.1.0-1_arm64.deb ...
Unpacking libmongocrypt0:arm64 (1.1.0-1) ...
Selecting previously unselected package libsnappy1v5:arm64.
Preparing to unpack .../02-libsnappy1v5_1.1.8-1_arm64.deb ...
Unpacking libsnappy1v5:arm64 (1.1.8-1) ...
Selecting previously unselected package libmongoc-1.0-0.
Preparing to unpack .../03-libmongoc-1.0-0_1.17.6-1_arm64.deb ...
Unpacking libmongoc-1.0-0 (1.17.6-1) ...
Selecting previously unselected package libivykis0:arm64.
Preparing to unpack .../04-libivykis0_0.42.4-1_arm64.deb ...
Unpacking libivykis0:arm64 (0.42.4-1) ...
Selecting previously unselected package libnet1:arm64.
Preparing to unpack .../05-libnet1_1.1.6+dfsg-3.1_arm64.deb ...
Unpacking libnet1:arm64 (1.1.6+dfsg-3.1) ...
Selecting previously unselected package syslog-ng-core.
Preparing to unpack .../06-syslog-ng-core_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-core (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-mongodb.
Preparing to unpack .../07-syslog-ng-mod-mongodb_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-mongodb (3.28.1-2+deb11u1) ...
Selecting previously unselected package libdbi1:arm64.
Preparing to unpack .../08-libdbi1_0.9.0-6_arm64.deb ...
Unpacking libdbi1:arm64 (0.9.0-6) ...
Selecting previously unselected package syslog-ng-mod-sql.
Preparing to unpack .../09-syslog-ng-mod-sql_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-sql (3.28.1-2+deb11u1) ...
Selecting previously unselected package libesmtp6.
Preparing to unpack .../10-libesmtp6_1.0.6-4.3_arm64.deb ...
Unpacking libesmtp6 (1.0.6-4.3) ...
Selecting previously unselected package libhiredis0.14:arm64.
Preparing to unpack .../11-libhiredis0.14_0.14.1-1_arm64.deb ...
Unpacking libhiredis0.14:arm64 (0.14.1-1) ...
Selecting previously unselected package libmaxminddb0:arm64.
Preparing to unpack .../12-libmaxminddb0_1.5.2-1_arm64.deb ...
Unpacking libmaxminddb0:arm64 (1.5.2-1) ...
Selecting previously unselected package libprotobuf-c1:arm64.
Preparing to unpack .../13-libprotobuf-c1_1.3.3-1+b2_arm64.deb ...
Unpacking libprotobuf-c1:arm64 (1.3.3-1+b2) ...
Selecting previously unselected package librabbitmq4:arm64.
Preparing to unpack .../14-librabbitmq4_0.10.0-1_arm64.deb ...
Unpacking librabbitmq4:arm64 (0.10.0-1) ...
Selecting previously unselected package librdkafka1:arm64.
Preparing to unpack .../15-librdkafka1_1.6.0-1_arm64.deb ...
Unpacking librdkafka1:arm64 (1.6.0-1) ...
Selecting previously unselected package libriemann-client0:arm64.
Preparing to unpack .../16-libriemann-client0_1.10.4-2+b2_arm64.deb ...
Unpacking libriemann-client0:arm64 (1.10.4-2+b2) ...
Selecting previously unselected package libsensors-config.
Preparing to unpack .../17-libsensors-config_1%3a3.6.0-7_all.deb ...
Unpacking libsensors-config (1:3.6.0-7) ...
Selecting previously unselected package libsensors5:arm64.
Preparing to unpack .../18-libsensors5_1%3a3.6.0-7_arm64.deb ...
Unpacking libsensors5:arm64 (1:3.6.0-7) ...
Selecting previously unselected package libsnmp-base.
Preparing to unpack .../19-libsnmp-base_5.9+dfsg-4+deb11u1_all.deb ...
Unpacking libsnmp-base (5.9+dfsg-4+deb11u1) ...
Selecting previously unselected package libsnmp40:arm64.
Preparing to unpack .../20-libsnmp40_5.9+dfsg-4+deb11u1_arm64.deb ...
Unpacking libsnmp40:arm64 (5.9+dfsg-4+deb11u1) ...
Selecting previously unselected package syslog-ng.
Preparing to unpack .../21-syslog-ng_3.28.1-2+deb11u1_all.deb ...
Unpacking syslog-ng (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-add-contextual-data.
Preparing to unpack .../22-syslog-ng-mod-add-contextual-data_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-add-contextual-data (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-amqp.
Preparing to unpack .../23-syslog-ng-mod-amqp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-amqp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-examples.
Preparing to unpack .../24-syslog-ng-mod-examples_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-examples (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-extra.
Preparing to unpack .../25-syslog-ng-mod-extra_3.28.1-2+deb11u1_all.deb ...
Unpacking syslog-ng-mod-extra (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-geoip2.
Preparing to unpack .../26-syslog-ng-mod-geoip2_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-geoip2 (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-graphite.
Preparing to unpack .../27-syslog-ng-mod-graphite_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-graphite (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-http.
Preparing to unpack .../28-syslog-ng-mod-http_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-http (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-python.
Preparing to unpack .../29-syslog-ng-mod-python_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-python (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-rdkafka.
Preparing to unpack .../30-syslog-ng-mod-rdkafka_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-rdkafka (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-redis.
Preparing to unpack .../31-syslog-ng-mod-redis_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-redis (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-riemann.
Preparing to unpack .../32-syslog-ng-mod-riemann_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-riemann (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-slog.
Preparing to unpack .../33-syslog-ng-mod-slog_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-slog (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-smtp.
Preparing to unpack .../34-syslog-ng-mod-smtp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-smtp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-snmp.
Preparing to unpack .../35-syslog-ng-mod-snmp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-snmp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-stomp.
Preparing to unpack .../36-syslog-ng-mod-stomp_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-stomp (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-xml-parser.
Preparing to unpack .../37-syslog-ng-mod-xml-parser_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-xml-parser (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-getent.
Preparing to unpack .../38-syslog-ng-mod-getent_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-getent (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-map-value-pairs.
Preparing to unpack .../39-syslog-ng-mod-map-value-pairs_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-map-value-pairs (3.28.1-2+deb11u1) ...
Selecting previously unselected package syslog-ng-mod-stardate.
Preparing to unpack .../40-syslog-ng-mod-stardate_3.28.1-2+deb11u1_arm64.deb ...
Unpacking syslog-ng-mod-stardate (3.28.1-2+deb11u1) ...
Setting up librabbitmq4:arm64 (0.10.0-1) ...
Setting up libdbi1:arm64 (0.9.0-6) ...
Setting up libsnmp-base (5.9+dfsg-4+deb11u1) ...
Setting up libmaxminddb0:arm64 (1.5.2-1) ...
Setting up libsensors-config (1:3.6.0-7) ...
Setting up libesmtp6 (1.0.6-4.3) ...
Setting up libnet1:arm64 (1.1.6+dfsg-3.1) ...
Setting up libprotobuf-c1:arm64 (1.3.3-1+b2) ...
Setting up libsnappy1v5:arm64 (1.1.8-1) ...
Setting up libbson-1.0-0 (1.17.6-1) ...
Setting up libivykis0:arm64 (0.42.4-1) ...
Setting up libriemann-client0:arm64 (1.10.4-2+b2) ...
Setting up libsensors5:arm64 (1:3.6.0-7) ...
Setting up librdkafka1:arm64 (1.6.0-1) ...
Setting up libhiredis0.14:arm64 (0.14.1-1) ...
Setting up libmongocrypt0:arm64 (1.1.0-1) ...
Setting up libsnmp40:arm64 (5.9+dfsg-4+deb11u1) ...
Setting up libmongoc-1.0-0 (1.17.6-1) ...
Setting up syslog-ng-core (3.28.1-2+deb11u1) ...
Created symlink /etc/systemd/system/multi-user.target.wants/syslog-ng.service → /lib/systemd/system/syslog-ng.service.
Setting up syslog-ng-mod-examples (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-xml-parser (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-stomp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-riemann (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-stardate (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-geoip2 (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-getent (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-amqp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-python (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-smtp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-snmp (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-extra (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-rdkafka (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-graphite (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-add-contextual-data (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-mongodb (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-http (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-slog (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-map-value-pairs (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-sql (3.28.1-2+deb11u1) ...
Setting up syslog-ng-mod-redis (3.28.1-2+deb11u1) ...
Setting up syslog-ng (3.28.1-2+deb11u1) ...
Processing triggers for man-db (2.9.4-2) ...
Processing triggers for libc-bin (2.31-13+rpt2+rpi1+deb11u8) ...
Stderr: debconf: unable to initialize frontend: Dialog
debconf: (TERM is not set, so the dialog frontend is not usable.)
debconf: falling back to frontend: Readline
debconf: unable to initialize frontend: Readline
debconf: (This frontend requires a controlling tty.)
debconf: falling back to frontend: Teletype
dpkg-preconfigure: unable to re-open stdin: 
....
....

7. Following the installation of syslog-ng across Nodes 2-7. We verify that the installation was successful by checking the syslog-ng service status.

Output of parallel-ssh -h /opt/workers -i “systemctl status syslog-ng”. Click to expand

root@turingpi:~# parallel-ssh -h /opt/workers -i "systemctl status syslog-ng" 
[1] 14:03:46 [SUCCESS] kemeny
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 13:57:01 EDT; 6min ago
       Docs: man:syslog-ng(8)
   Main PID: 28694 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 40.228s
     CGroup: /system.slice/syslog-ng.service
             └─28694 /usr/sbin/syslog-ng -F

Mar 28 13:57:00 kemeny systemd[1]: Starting System Logger Daemon...
Mar 28 13:57:01 kemeny syslog-ng[28694]: DIGEST-MD5 common mech free
Mar 28 13:57:01 kemeny systemd[1]: Started System Logger Daemon.
[2] 14:03:50 [SUCCESS] vonkarman
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 13:57:49 EDT; 5min ago
       Docs: man:syslog-ng(8)
   Main PID: 27486 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 2min 5.540s
     CGroup: /system.slice/syslog-ng.service
             └─27486 /usr/sbin/syslog-ng -F

Mar 28 13:57:44 vonkarman systemd[1]: Starting System Logger Daemon...
Mar 28 13:57:46 vonkarman syslog-ng[27486]: DIGEST-MD5 common mech free
Mar 28 13:57:49 vonkarman systemd[1]: Started System Logger Daemon.
[3] 14:03:51 [SUCCESS] teller
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 13:57:39 EDT; 6min ago
       Docs: man:syslog-ng(8)
   Main PID: 24821 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 2min 262ms
     CGroup: /system.slice/syslog-ng.service
             └─24821 /usr/sbin/syslog-ng -F

Mar 28 13:57:38 teller systemd[1]: Starting System Logger Daemon...
Mar 28 13:57:38 teller syslog-ng[24821]: DIGEST-MD5 common mech free
Mar 28 13:57:39 teller systemd[1]: Started System Logger Daemon.
[4] 14:03:53 [SUCCESS] neumann
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 13:57:39 EDT; 6min ago
       Docs: man:syslog-ng(8)
   Main PID: 27734 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 1min 43.504s
     CGroup: /system.slice/syslog-ng.service
             └─27734 /usr/sbin/syslog-ng -F

Mar 28 13:57:38 neumann systemd[1]: Starting System Logger Daemon...
Mar 28 13:57:38 neumann syslog-ng[27734]: DIGEST-MD5 common mech free
Mar 28 13:57:39 neumann systemd[1]: Started System Logger Daemon.
[5] 14:03:53 [SUCCESS] wigner
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 13:57:37 EDT; 6min ago
       Docs: man:syslog-ng(8)
   Main PID: 27512 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 1min 49.643s
     CGroup: /system.slice/syslog-ng.service
             └─27512 /usr/sbin/syslog-ng -F

Mar 28 13:57:36 wigner systemd[1]: Starting System Logger Daemon...
Mar 28 13:57:36 wigner syslog-ng[27512]: DIGEST-MD5 common mech free
Mar 28 13:57:37 wigner systemd[1]: Started System Logger Daemon.
[6] 14:03:57 [SUCCESS] szilard
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 13:57:35 EDT; 6min ago
       Docs: man:syslog-ng(8)
   Main PID: 24136 (syslog-ng)
      Tasks: 5 (limit: 779)
        CPU: 2min 10.257s
     CGroup: /system.slice/syslog-ng.service
             └─24136 /usr/sbin/syslog-ng -F

Mar 28 13:57:34 szilard systemd[1]: Starting System Logger Daemon...
Mar 28 13:57:34 szilard syslog-ng[24136]: DIGEST-MD5 common mech free
Mar 28 13:57:35 szilard systemd[1]: Started System Logger Daemon.

8. Create the configuration file send.conf in /opt on host turingpi. Note that /opt is an NFS export on turingpi and is NFS mounted by all of the compute nodes. This file will set the HOST field to the local hostname for log messages that are sent. This in done in the subsequent steps where “placeholder” will be replaced using a sed operation with the local hostname. Additionally, a data source s_hpc is defined which will scan /opt/ibm/lsf/log for the presence of LSF daemon logfiles.

 
root@turingpi:/# cat /opt/send.conf
rewrite r_host { set("placeholder", value("HOST")); };

destination d_net {
  syslog("turingpi" port(601));
};
source s_hpc {
  wildcard-file(
      base-dir("/opt/ibm/lsf/log")
      filename-pattern("*.log.*")
      recursive(no)
      follow-freq(1)
  );
};
log {
  source(s_src);
  source(s_hpc);
  rewrite(r_host); 
  destination(d_net);
};

On Nodes 2-7, copy the file /opt/send.conf to /etc/syslog-ng/conf.d/send.conf.

 
root@turingpi:/# parallel-ssh -h /opt/workers -i "cp /opt/send.conf /etc/syslog-ng/conf.d" 
[1] 14:19:29 [SUCCESS] kemeny
[2] 14:19:30 [SUCCESS] vonkarman
[3] 14:19:30 [SUCCESS] wigner
[4] 14:19:30 [SUCCESS] szilard
[5] 14:19:30 [SUCCESS] teller
[6] 14:19:31 [SUCCESS] neumann

Using sed, replace the “placeholder” string in /etc/syslog-ng/conf.d/send.conf with the local hostname. And we also double check that the change was correctly made.

 
root@turingpi:/# parallel-ssh -h /opt/workers -i 'HOST=`hostname`; sed -i "s/placeholder/$HOST/g" /etc/syslog-ng/conf.d/send.conf' 
[1] 14:38:09 [SUCCESS] kemeny
[2] 14:38:09 [SUCCESS] teller
[3] 14:38:09 [SUCCESS] vonkarman
[4] 14:38:09 [SUCCESS] wigner
[5] 14:38:09 [SUCCESS] neumann
[6] 14:38:09 [SUCCESS] szilard

Output of parallel-ssh -h /opt/workers -i “cat /etc/syslog-ng/conf.d/send.conf”. Click to expand

root@turingpi:/# parallel-ssh -h /opt/workers -i "cat /etc/syslog-ng/conf.d/send.conf" [1] 14:38:33 [SUCCESS] kemeny
rewrite r_host { set("kemeny", value("HOST")); };

destination d_net {
  syslog("turingpi" port(601));
};
source s_hpc {
  wildcard-file(
      base-dir("/opt/ibm/lsf/log")
      filename-pattern("*.log.*")
      recursive(no)
      follow-freq(1)
  );
};
log {
  source(s_sys);
  source(s_hpc);
  rewrite(r_host); 
  destination(d_net);
};

[2] 14:38:33 [SUCCESS] teller
rewrite r_host { set("teller", value("HOST")); };

destination d_net {
  syslog("turingpi" port(601));
};
source s_hpc {
  wildcard-file(
      base-dir("/opt/ibm/lsf/log")
      filename-pattern("*.log.*")
      recursive(no)
      follow-freq(1)
  );
};
log {
  source(s_sys);
  source(s_hpc);
  rewrite(r_host); 
  destination(d_net);
};

[3] 14:38:33 [SUCCESS] neumann
rewrite r_host { set("neumann", value("HOST")); };

destination d_net {
  syslog("turingpi" port(601));
};
source s_hpc {
  wildcard-file(
      base-dir("/opt/ibm/lsf/log")
      filename-pattern("*.log.*")
      recursive(no)
      follow-freq(1)
  );
};
log {
  source(s_sys);
  source(s_hpc);
  rewrite(r_host); 
  destination(d_net);
};

[4] 14:38:33 [SUCCESS] szilard
rewrite r_host { set("szilard", value("HOST")); };

destination d_net {
  syslog("turingpi" port(601));
};
source s_hpc {
  wildcard-file(
      base-dir("/opt/ibm/lsf/log")
      filename-pattern("*.log.*")
      recursive(no)
      follow-freq(1)
  );
};
log {
  source(s_sys);
  source(s_hpc);
  rewrite(r_host); 
  destination(d_net);
};

[5] 14:38:33 [SUCCESS] wigner
rewrite r_host { set("wigner", value("HOST")); };

destination d_net {
  syslog("turingpi" port(601));
};
source s_hpc {
  wildcard-file(
      base-dir("/opt/ibm/lsf/log")
      filename-pattern("*.log.*")
      recursive(no)
      follow-freq(1)
  );
};
log {
  source(s_sys);
  source(s_hpc);
  rewrite(r_host); 
  destination(d_net);
};

[6] 14:38:33 [SUCCESS] vonkarman
rewrite r_host { set("vonkarman", value("HOST")); };

destination d_net {
  syslog("turingpi" port(601));
};
source s_hpc {
  wildcard-file(
      base-dir("/opt/ibm/lsf/log")
      filename-pattern("*.log.*")
      recursive(no)
      follow-freq(1)
  );
};
log {
  source(s_sys);
  source(s_hpc);
  rewrite(r_host); 
  destination(d_net);
};

11. Finally, syslog-ng is restarted on Nodes 2-7 and the status of the service is checked to ensure that there are no errors.

 
root@turingpi:/opt# parallel-ssh -h /opt/workers -i "systemctl restart syslog-ng" 
[1] 14:49:03 [SUCCESS] kemeny
[2] 14:49:05 [SUCCESS] szilard
[3] 14:49:06 [SUCCESS] vonkarman
[4] 14:49:06 [SUCCESS] neumann
[5] 14:49:06 [SUCCESS] teller
[6] 14:49:07 [SUCCESS] wigner

Output of parallel-ssh -h /opt/workers -i “systemctl status syslog-ng”. Click to expand

root@turingpi:/opt# parallel-ssh -h /opt/workers -i "systemctl status syslog-ng" 
[1] 14:49:31 [SUCCESS] kemeny
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 14:49:03 EDT; 28s ago
       Docs: man:syslog-ng(8)
   Main PID: 34982 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 398ms
     CGroup: /system.slice/syslog-ng.service
             └─34982 /usr/sbin/syslog-ng -F

Mar 28 14:49:02 kemeny systemd[1]: Starting System Logger Daemon...
Mar 28 14:49:02 kemeny syslog-ng[34982]: DIGEST-MD5 common mech free
Mar 28 14:49:03 kemeny systemd[1]: Started System Logger Daemon.
[2] 14:49:33 [SUCCESS] vonkarman
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 14:49:06 EDT; 25s ago
       Docs: man:syslog-ng(8)
   Main PID: 33710 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 934ms
     CGroup: /system.slice/syslog-ng.service
             └─33710 /usr/sbin/syslog-ng -F

Mar 28 14:49:03 vonkarman systemd[1]: Starting System Logger Daemon...
Mar 28 14:49:03 vonkarman syslog-ng[33710]: DIGEST-MD5 common mech free
Mar 28 14:49:06 vonkarman systemd[1]: Started System Logger Daemon.
[3] 14:49:33 [SUCCESS] neumann
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 14:49:06 EDT; 25s ago
       Docs: man:syslog-ng(8)
   Main PID: 34000 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 959ms
     CGroup: /system.slice/syslog-ng.service
             └─34000 /usr/sbin/syslog-ng -F

Mar 28 14:49:03 neumann systemd[1]: Starting System Logger Daemon...
Mar 28 14:49:03 neumann syslog-ng[34000]: DIGEST-MD5 common mech free
Mar 28 14:49:06 neumann systemd[1]: Started System Logger Daemon.
[4] 14:49:33 [SUCCESS] wigner
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 14:49:07 EDT; 25s ago
       Docs: man:syslog-ng(8)
   Main PID: 33941 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 1.115s
     CGroup: /system.slice/syslog-ng.service
             └─33941 /usr/sbin/syslog-ng -F

Mar 28 14:49:03 wigner systemd[1]: Starting System Logger Daemon...
Mar 28 14:49:04 wigner syslog-ng[33941]: DIGEST-MD5 common mech free
Mar 28 14:49:07 wigner systemd[1]: Started System Logger Daemon.
[5] 14:49:34 [SUCCESS] szilard
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 14:49:05 EDT; 26s ago
       Docs: man:syslog-ng(8)
   Main PID: 30348 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 816ms
     CGroup: /system.slice/syslog-ng.service
             └─30348 /usr/sbin/syslog-ng -F

Mar 28 14:49:03 szilard systemd[1]: Starting System Logger Daemon...
Mar 28 14:49:03 szilard syslog-ng[30348]: DIGEST-MD5 common mech free
Mar 28 14:49:05 szilard systemd[1]: Started System Logger Daemon.
[6] 14:49:34 [SUCCESS] teller
● syslog-ng.service - System Logger Daemon
     Loaded: loaded (/lib/systemd/system/syslog-ng.service; enabled; vendor preset: enabled)
     Active: active (running) since Thu 2024-03-28 14:49:06 EDT; 25s ago
       Docs: man:syslog-ng(8)
   Main PID: 31034 (syslog-ng)
      Tasks: 2 (limit: 779)
        CPU: 965ms
     CGroup: /system.slice/syslog-ng.service
             └─31034 /usr/sbin/syslog-ng -F

Does it work?

The answer to this question is an emphatic YES!

Let’s begin with a simple test running the logger command on all of the compute nodes, while monitoring /var/log/fromnet on host turingpi.

 
root@turingpi:/home/lsfadmin# date; parallel-ssh -h /opt/workers -i 'HOST=`hostname`; logger This is a test from node $HOST. Do not panic!' 
Wed  3 Apr 21:41:45 EDT 2024 
[1] 21:41:46 [SUCCESS] teller 
[2] 21:41:46 [SUCCESS] neumann 
[3] 21:41:46 [SUCCESS] wigner 
[4] 21:41:46 [SUCCESS] kemeny 
[5] 21:41:46 [SUCCESS] szilard 
[6] 21:41:46 [SUCCESS] vonkarman

root@turingpi:/var/log# tail -f fromnet |grep panic 
Apr  3 21:41:46 szilard root[10918]: This is a test from node szilard. Do not panic! 
Apr  3 21:41:46 wigner root[11011]: This is a test from node wigner. Do not panic! 
Apr  3 21:41:46 neumann root[11121]: This is a test from node neumann. Do not panic! 
Apr  3 21:41:46 kemeny root[11029]: This is a test from node kemeny. Do not panic! 
Apr  3 21:41:46 teller root[10875]: This is a test from node teller. Do not panic! 
Apr  3 21:41:46 vonkarman root[10805]: This is a test from node vonkarman. Do not panic!

Next, let’s look at whether the LSF logging is also captured. Here we simply restart the LSF daemons on Nodes 2-7 and monitor the /var/log/fromnet file. The full output can be viewed below.

Output of tail -f /var/log/fromnet. Click to expand

root@turingpi:/var/log# tail -f fromnet 
Apr  3 21:41:57 vonkarman systemd[10786]: systemd-exit.service: Succeeded. 
Apr  3 21:41:57 vonkarman systemd[10786]: Finished Exit the Session. 
Apr  3 21:41:57 vonkarman systemd[10786]: Reached target Exit the Session. 
Apr  3 21:41:57 vonkarman systemd[1]: user@0.service: Succeeded. 
Apr  3 21:41:57 vonkarman systemd[1]: Stopped User Manager for UID 0. 
Apr  3 21:41:57 vonkarman systemd[1]: Stopping User Runtime Directory /run/user/0... 
Apr  3 21:41:57 vonkarman systemd[1]: run-user-0.mount: Succeeded. 
Apr  3 21:41:57 vonkarman systemd[1]: user-runtime-dir@0.service: Succeeded. 
Apr  3 21:41:57 vonkarman systemd[1]: Stopped User Runtime Directory /run/user/0. 
Apr  3 21:41:57 vonkarman systemd[1]: Removed slice User Slice of UID 0. 
Apr  3 21:44:30 wigner dhcpcd[493]: eth0: Router Advertisement from fe80::da58:d7ff:fe00:6d83 
Apr  3 21:44:57 szilard sshd[11234]: Accepted publickey for root from 192.168.1.172 port 52600 ssh2: ED25519 S
HA256:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
Apr  3 21:44:57 szilard sshd[11234]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0) 
Apr  3 21:44:58 szilard systemd[1]: Created slice User Slice of UID 0. 
Apr  3 21:44:58 szilard systemd[1]: Starting User Runtime Directory /run/user/0... 
Apr  3 21:44:58 szilard systemd-logind[382]: New session 30 of user root. 
Apr  3 21:44:58 szilard systemd[1]: Finished User Runtime Directory /run/user/0. 
Apr  3 21:44:58 szilard systemd[1]: Starting User Manager for UID 0... 
Apr  3 21:44:58 szilard systemd[11237]: pam_unix(systemd-user:session): session opened for user root(uid=0) by
(uid=0) 
Apr  3 21:44:57 wigner sshd[11342]: Accepted publickey for root from 192.168.1.172 port 60388 ssh2: ED25519 SH
A256:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
Apr  3 21:44:57 wigner sshd[11342]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0) 
Apr  3 21:44:58 wigner systemd[1]: Created slice User Slice of UID 0. 
Apr  3 21:44:58 wigner systemd[1]: Starting User Runtime Directory /run/user/0... 
Apr  3 21:44:58 wigner systemd-logind[383]: New session 30 of user root. 
Apr  3 21:44:58 wigner systemd[1]: Finished User Runtime Directory /run/user/0. 
Apr  3 21:44:58 wigner systemd[1]: Starting User Manager for UID 0... 
Apr  3 21:44:58 wigner systemd[11345]: pam_unix(systemd-user:session): session opened for user root(uid=0) by 
(uid=0) 
Apr  3 21:44:57 neumann sshd[11436]: Accepted publickey for root from 192.168.1.172 port 55144 ssh2: ED25519 S
HA256:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
Apr  3 21:44:57 neumann sshd[11436]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0) 
Apr  3 21:44:57 neumann systemd[1]: Created slice User Slice of UID 0. 
Apr  3 21:44:57 neumann systemd[1]: Starting User Runtime Directory /run/user/0... 
Apr  3 21:44:58 neumann systemd-logind[398]: New session 30 of user root. 
Apr  3 21:44:58 neumann systemd[1]: Finished User Runtime Directory /run/user/0. 
Apr  3 21:44:58 neumann systemd[1]: Starting User Manager for UID 0... 
Apr  3 21:44:58 neumann systemd[11439]: pam_unix(systemd-user:session): session opened for user root(uid=0) by
(uid=0) 
Apr  3 21:44:57 kemeny sshd[11345]: Accepted publickey for root from 192.168.1.172 port 59830 ssh2: ED25519 SH
A256:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
Apr  3 21:44:57 kemeny sshd[11345]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0) 
Apr  3 21:44:58 kemeny systemd[1]: Created slice User Slice of UID 0. 
Apr  3 21:44:58 kemeny systemd[1]: Starting User Runtime Directory /run/user/0... 
Apr  3 21:44:58 kemeny systemd-logind[386]: New session 30 of user root. 
Apr  3 21:44:58 kemeny systemd[1]: Finished User Runtime Directory /run/user/0. 
Apr  3 21:44:58 kemeny systemd[1]: Starting User Manager for UID 0... 
Apr  3 21:44:58 kemeny systemd[11348]: pam_unix(systemd-user:session): session opened for user root(uid=0) by 
(uid=0) 
Apr  3 21:44:57 teller sshd[11189]: Accepted publickey for root from 192.168.1.172 port 35310 ssh2: ED25519 SH
A256:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
Apr  3 21:44:57 teller sshd[11189]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0) 
Apr  3 21:44:58 teller systemd[1]: Created slice User Slice of UID 0. 
Apr  3 21:44:58 teller systemd[1]: Starting User Runtime Directory /run/user/0... 
Apr  3 21:44:58 teller systemd-logind[382]: New session 30 of user root. 
Apr  3 21:44:58 teller systemd[1]: Finished User Runtime Directory /run/user/0. 
Apr  3 21:44:58 teller systemd[1]: Starting User Manager for UID 0... 
Apr  3 21:44:58 teller systemd[11192]: pam_unix(systemd-user:session): session opened for user root(uid=0) by 
(uid=0) 
Apr  3 21:44:57 vonkarman sshd[11118]: Accepted publickey for root from 192.168.1.172 port 48654 ssh2: ED25519
SHA256:xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 
Apr  3 21:44:58 vonkarman sshd[11118]: pam_unix(sshd:session): session opened for user root(uid=0) by (uid=0) 
Apr  3 21:44:58 vonkarman systemd[1]: Created slice User Slice of UID 0. 
Apr  3 21:44:58 vonkarman systemd[1]: Starting User Runtime Directory /run/user/0... 
Apr  3 21:44:58 vonkarman systemd-logind[382]: New session 29 of user root. 
Apr  3 21:44:58 vonkarman systemd[1]: Finished User Runtime Directory /run/user/0. 
Apr  3 21:44:58 vonkarman systemd[1]: Starting User Manager for UID 0... 
Apr  3 21:44:58 vonkarman systemd[11121]: pam_unix(systemd-user:session): session opened for user root(uid=0) 
by (uid=0) 
Apr  3 21:44:58 neumann systemd[11439]: Queued start job for default target Main User Target. 
Apr  3 21:44:58 neumann systemd[11439]: Created slice User Application Slice. 
Apr  3 21:44:58 neumann systemd[11439]: Reached target Paths. 
Apr  3 21:44:58 neumann systemd[11439]: Reached target Timers. 
Apr  3 21:44:58 neumann systemd[11439]: Listening on GnuPG network certificate management daemon. 
Apr  3 21:44:58 neumann systemd[11439]: Listening on GnuPG cryptographic agent and passphrase cache (access fo
r web browsers). 
Apr  3 21:44:58 neumann systemd[11439]: Listening on GnuPG cryptographic agent and passphrase cache (restricte
d). 
Apr  3 21:44:58 neumann systemd[11439]: Listening on GnuPG cryptographic agent (ssh-agent emulation). 
Apr  3 21:44:58 neumann systemd[11439]: Listening on GnuPG cryptographic agent and passphrase cache. 
Apr  3 21:44:58 neumann systemd[11439]: Reached target Sockets. 
Apr  3 21:44:58 neumann systemd[11439]: Reached target Basic System. 
Apr  3 21:44:58 neumann systemd[11439]: Reached target Main User Target. 
Apr  3 21:44:58 neumann systemd[11439]: Startup finished in 379ms. 
Apr  3 21:44:58 neumann systemd[1]: Started User Manager for UID 0. 
Apr  3 21:44:58 neumann systemd[1]: Started Session 30 of user root. 
Apr  3 21:44:58 teller systemd[11192]: Queued start job for default target Main User Target. 
Apr  3 21:44:58 teller systemd[11192]: Created slice User Application Slice. 
Apr  3 21:44:58 teller systemd[11192]: Reached target Paths. 
Apr  3 21:44:58 teller systemd[11192]: Reached target Timers. 
Apr  3 21:44:58 teller systemd[11192]: Listening on GnuPG network certificate management daemon. 
Apr  3 21:44:58 teller systemd[11192]: Listening on GnuPG cryptographic agent and passphrase cache (access for
web browsers). 
Apr  3 21:44:58 teller systemd[11192]: Listening on GnuPG cryptographic agent and passphrase cache (restricted
). 
Apr  3 21:44:58 teller systemd[11192]: Listening on GnuPG cryptographic agent (ssh-agent emulation). 
Apr  3 21:44:58 teller systemd[11192]: Listening on GnuPG cryptographic agent and passphrase cache. 
Apr  3 21:44:58 teller systemd[11192]: Reached target Sockets. 
Apr  3 21:44:58 teller systemd[11192]: Reached target Basic System. 
Apr  3 21:44:58 teller systemd[11192]: Reached target Main User Target. 
Apr  3 21:44:58 teller systemd[11192]: Startup finished in 373ms. 
Apr  3 21:44:58 teller systemd[1]: Started User Manager for UID 0. 
Apr  3 21:44:58 teller systemd[1]: Started Session 30 of user root. 
Apr  3 21:44:58 vonkarman systemd[11121]: Queued start job for default target Main User Target. 
Apr  3 21:44:58 vonkarman systemd[11121]: Created slice User Application Slice. 
Apr  3 21:44:58 vonkarman systemd[11121]: Reached target Paths. 
Apr  3 21:44:58 vonkarman systemd[11121]: Reached target Timers. 
Apr  3 21:44:58 vonkarman systemd[11121]: Listening on GnuPG network certificate management daemon. 
Apr  3 21:44:58 vonkarman systemd[11121]: Listening on GnuPG cryptographic agent and passphrase cache (access 
for web browsers). 
Apr  3 21:44:58 vonkarman systemd[11121]: Listening on GnuPG cryptographic agent and passphrase cache (restric
ted). 
Apr  3 21:44:58 vonkarman systemd[11121]: Listening on GnuPG cryptographic agent (ssh-agent emulation). 
Apr  3 21:44:58 vonkarman systemd[11121]: Listening on GnuPG cryptographic agent and passphrase cache. 
Apr  3 21:44:58 vonkarman systemd[11121]: Reached target Sockets. 
Apr  3 21:44:58 vonkarman systemd[11121]: Reached target Basic System. 
Apr  3 21:44:58 vonkarman systemd[11121]: Reached target Main User Target. 
Apr  3 21:44:58 vonkarman systemd[11121]: Startup finished in 392ms. 
Apr  3 21:44:58 vonkarman systemd[1]: Started User Manager for UID 0. 
Apr  3 21:44:58 vonkarman systemd[1]: Started Session 29 of user root. 
Apr  3 21:44:58 szilard systemd[11237]: Queued start job for default target Main User Target. 
Apr  3 21:44:58 szilard systemd[11237]: Created slice User Application Slice. 
Apr  3 21:44:58 szilard systemd[11237]: Reached target Paths. 
Apr  3 21:44:58 szilard systemd[11237]: Reached target Timers. 
Apr  3 21:44:58 szilard systemd[11237]: Listening on GnuPG network certificate management daemon. 
Apr  3 21:44:58 szilard systemd[11237]: Listening on GnuPG cryptographic agent and passphrase cache (access fo
r web browsers). 
Apr  3 21:44:58 szilard systemd[11237]: Listening on GnuPG cryptographic agent and passphrase cache (restricte
d). 
Apr  3 21:44:58 szilard systemd[11237]: Listening on GnuPG cryptographic agent (ssh-agent emulation). 
Apr  3 21:44:58 szilard systemd[11237]: Listening on GnuPG cryptographic agent and passphrase cache. 
Apr  3 21:44:58 szilard systemd[11237]: Reached target Sockets. 
Apr  3 21:44:58 szilard systemd[11237]: Reached target Basic System. 
Apr  3 21:44:58 szilard systemd[11237]: Reached target Main User Target. 
Apr  3 21:44:58 szilard systemd[11237]: Startup finished in 385ms. 
Apr  3 21:44:58 szilard systemd[1]: Started User Manager for UID 0. 
Apr  3 21:44:58 szilard systemd[1]: Started Session 30 of user root. 
Apr  3 21:44:58 wigner systemd[11345]: Queued start job for default target Main User Target. 
Apr  3 21:44:58 wigner systemd[11345]: Created slice User Application Slice. 
Apr  3 21:44:58 wigner systemd[11345]: Reached target Paths. 
Apr  3 21:44:58 wigner systemd[11345]: Reached target Timers. 
Apr  3 21:44:58 wigner systemd[11345]: Listening on GnuPG network certificate management daemon. 
Apr  3 21:44:58 wigner systemd[11345]: Listening on GnuPG cryptographic agent and passphrase cache (access for
web browsers). 
Apr  3 21:44:58 wigner systemd[11345]: Listening on GnuPG cryptographic agent and passphrase cache (restricted
). 
Apr  3 21:44:58 wigner systemd[11345]: Listening on GnuPG cryptographic agent (ssh-agent emulation). 
Apr  3 21:44:58 wigner systemd[11345]: Listening on GnuPG cryptographic agent and passphrase cache. 
Apr  3 21:44:58 wigner systemd[11345]: Reached target Sockets. 
Apr  3 21:44:58 wigner systemd[11345]: Reached target Basic System. 
Apr  3 21:44:58 wigner systemd[11345]: Reached target Main User Target. 
Apr  3 21:44:58 wigner systemd[11345]: Startup finished in 375ms. 
Apr  3 21:44:58 wigner systemd[1]: Started User Manager for UID 0. 
Apr  3 21:44:58 wigner systemd[1]: Started Session 30 of user root. 
Apr  3 21:44:58 kemeny systemd[11348]: Queued start job for default target Main User Target. 
Apr  3 21:44:58 kemeny systemd[11348]: Created slice User Application Slice. 
Apr  3 21:44:58 kemeny systemd[11348]: Reached target Paths. 
Apr  3 21:44:58 kemeny systemd[11348]: Reached target Timers. 
Apr  3 21:44:58 kemeny systemd[11348]: Listening on GnuPG network certificate management daemon. 
Apr  3 21:44:58 kemeny systemd[11348]: Listening on GnuPG cryptographic agent and passphrase cache (access for
web browsers). 
Apr  3 21:44:58 kemeny systemd[11348]: Listening on GnuPG cryptographic agent and passphrase cache (restricted
). 
Apr  3 21:44:58 kemeny systemd[11348]: Listening on GnuPG cryptographic agent (ssh-agent emulation). 
Apr  3 21:44:58 kemeny systemd[11348]: Listening on GnuPG cryptographic agent and passphrase cache. 
Apr  3 21:44:58 kemeny systemd[11348]: Reached target Sockets. 
Apr  3 21:44:58 kemeny systemd[11348]: Reached target Basic System. 
Apr  3 21:44:58 kemeny systemd[11348]: Reached target Main User Target. 
Apr  3 21:44:58 kemeny systemd[11348]: Startup finished in 400ms. 
Apr  3 21:44:58 kemeny systemd[1]: Started User Manager for UID 0. 
Apr  3 21:44:58 kemeny systemd[1]: Started Session 30 of user root. 
Apr  3 21:44:59 kemeny res[691]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 kemeny lim[688]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 kemeny sbatchd[693]: Daemon on host <kemeny> received signal <15>; exiting 
Apr  3 21:44:59 kemeny lsf_daemons[11434]: Stopping the LSF subsystem 
Apr  3 21:44:59 kemeny systemd[1]: lsfd.service: Succeeded. 
Apr  3 21:44:59 kemeny systemd[1]: lsfd.service: Consumed 11min 56.744s CPU time. 
Apr  3 21:44:59 szilard lim[685]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 szilard res[687]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 szilard sbatchd[689]: Daemon on host <szilard> received signal <15>; exiting 
Apr  3 21:44:59 vonkarman lim[686]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 vonkarman sbatchd[690]: Daemon on host <vonkarman> received signal <15>; exiting 
Apr  3 21:44:59 vonkarman res[688]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 teller lim[683]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 teller res[689]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 teller sbatchd[691]: Daemon on host <teller> received signal <15>; exiting 
Apr  3 21:44:59 teller lsf_daemons[11294]: Stopping the LSF subsystem 
Apr  3 21:44:59 wigner lim[719]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 wigner res[722]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 wigner sbatchd[724]: Daemon on host <wigner> received signal <15>; exiting 
Apr  3 21:44:59 wigner lsf_daemons[11438]: Stopping the LSF subsystem 
Apr  3 21:44:59 neumann res[713]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 neumann sbatchd[715]: Daemon on host <neumann> received signal <15>; exiting 
Apr  3 21:44:59 neumann lim[711]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 neumann lsf_daemons[11540]: Stopping the LSF subsystem 
Apr  3 21:44:59 neumann sshd[11436]: Received disconnect from 192.168.1.172 port 55144:11: disconnected by use
r 
Apr  3 21:44:59 neumann sshd[11436]: Disconnected from user root 192.168.1.172 port 55144 
Apr  3 21:44:59 szilard lsf_daemons[11331]: Stopping the LSF subsystem 
Apr  3 21:44:59 szilard sshd[11234]: Received disconnect from 192.168.1.172 port 52600:11: disconnected by use
r 
Apr  3 21:44:59 szilard sshd[11234]: Disconnected from user root 192.168.1.172 port 52600 
Apr  3 21:44:59 szilard sshd[11234]: pam_unix(sshd:session): session closed for user root 
Apr  3 21:44:59 szilard res[11357]: res/get_hostInfo: ls_gethostinfo() failed. Server host LIM configuration i
s not ready yet. 
Apr  3 21:44:59 szilard systemd-logind[382]: Session 30 logged out. Waiting for processes to exit. 
Apr  3 21:44:59 szilard res[11357]: cg_load_hierarchies: Please use the LSF package with higher glibc version 
to enable LSF cgroup v2 support. 
Apr  3 21:44:59 szilard systemd[1]: lsfd.service: Succeeded. 
Apr  3 21:44:59 szilard systemd[1]: lsfd.service: Consumed 1h 17min 44.040s CPU time. 
Apr  3 21:44:59 neumann sshd[11436]: pam_unix(sshd:session): session closed for user root 
Apr  3 21:44:59 neumann systemd-logind[398]: Session 30 logged out. Waiting for processes to exit. 
Apr  3 21:44:59 neumann res[11559]: res/get_hostInfo: ls_gethostinfo() failed. Server host LIM configuration i
s not ready yet. 
Apr  3 21:44:59 neumann res[11559]: cg_load_hierarchies: Please use the LSF package with higher glibc version 
to enable LSF cgroup v2 support. 
Apr  3 21:44:59 neumann systemd[1]: lsfd.service: Succeeded. 
Apr  3 21:44:59 neumann systemd[1]: lsfd.service: Consumed 1h 17min 21.135s CPU time. 
Apr  3 21:44:59 teller sshd[11189]: Received disconnect from 192.168.1.172 port 35310:11: disconnected by user 
Apr  3 21:44:59 teller sshd[11189]: Disconnected from user root 192.168.1.172 port 35310 
Apr  3 21:44:59 teller sshd[11189]: pam_unix(sshd:session): session closed for user root 
Apr  3 21:44:59 teller systemd-logind[382]: Session 30 logged out. Waiting for processes to exit. 
Apr  3 21:44:59 teller res[11307]: res/get_hostInfo: ls_gethostinfo() failed. Server host LIM configuration is
not ready yet. 
Apr  3 21:44:59 teller res[11307]: cg_load_hierarchies: Please use the LSF package with higher glibc version t
o enable LSF cgroup v2 support. 
Apr  3 21:44:59 teller res[11307]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 teller lim[11305]: term_handler: Received signal 15, exiting 
Apr  3 21:44:59 teller systemd[1]: lsfd.service: Succeeded. 
Apr  3 21:44:59 teller systemd[1]: lsfd.service: Consumed 1h 17min 47.675s CPU time. 
Apr  3 21:44:59 teller sbatchd[11309]: cg_load_hierarchies: Please use the LSF package with higher glibc versi
on to enable LSF cgroup v2 support. 
Apr  3 21:44:59 kemeny sshd[11345]: Received disconnect from 192.168.1.172 port 59830:11: disconnected by user 
Apr  3 21:44:59 kemeny sshd[11345]: Disconnected from user root 192.168.1.172 port 59830 
Apr  3 21:44:59 kemeny sshd[11345]: pam_unix(sshd:session): session closed for user root 
Apr  3 21:44:59 kemeny systemd-logind[386]: Session 30 logged out. Waiting for processes to exit. 
Apr  3 21:44:59 kemeny res[11467]: res/get_hostInfo: ls_gethostinfo() failed. Server host LIM configuration is
not ready yet. 
Apr  3 21:44:59 kemeny res[11467]: cg_load_hierarchies: Please use the LSF package with higher glibc version t
o enable LSF cgroup v2 support. 
Apr  3 21:44:59 vonkarman lsf_daemons[11215]: Stopping the LSF subsystem 
Apr  3 21:44:59 vonkarman sshd[11118]: Received disconnect from 192.168.1.172 port 48654:11: disconnected by u
ser 
Apr  3 21:44:59 vonkarman sshd[11118]: Disconnected from user root 192.168.1.172 port 48654 
Apr  3 21:44:59 vonkarman sshd[11118]: pam_unix(sshd:session): session closed for user root 
Apr  3 21:44:59 vonkarman systemd-logind[382]: Session 29 logged out. Waiting for processes to exit. 
Apr  3 21:44:59 vonkarman res[11241]: res/get_hostInfo: ls_gethostinfo() failed. Server host LIM configuration
is not ready yet. 
Apr  3 21:44:59 vonkarman res[11241]: cg_load_hierarchies: Please use the LSF package with higher glibc versio
n to enable LSF cgroup v2 support. 
Apr  3 21:44:59 vonkarman systemd[1]: lsfd.service: Succeeded. 
Apr  3 21:44:59 vonkarman systemd[1]: lsfd.service: Consumed 1h 17min 34.650s CPU time. 
Apr  3 21:44:59 wigner sshd[11342]: Received disconnect from 192.168.1.172 port 60388:11: disconnected by user 
Apr  3 21:44:59 wigner sshd[11342]: Disconnected from user root 192.168.1.172 port 60388 
Apr  3 21:44:59 wigner sshd[11342]: pam_unix(sshd:session): session closed for user root 
Apr  3 21:44:59 wigner res[11464]: res/get_hostInfo: ls_gethostinfo() failed. Server host LIM configuration is
not ready yet. 
Apr  3 21:44:59 wigner systemd-logind[383]: Session 30 logged out. Waiting for processes to exit. 
Apr  3 21:44:59 wigner res[11464]: cg_load_hierarchies: Please use the LSF package with higher glibc version t
o enable LSF cgroup v2 support. 
Apr  3 21:44:59 wigner systemd[1]: lsfd.service: Succeeded. 
Apr  3 21:44:59 wigner systemd[1]: lsfd.service: Consumed 1h 17min 44.610s CPU time.

As expected, we observed that LSF log messages are written to the fromnet file. And importantly each entry contains the hostname, so that we can identify the origin of the message.

Conclusion

What started out as a chat about logging, grew into an idea of a blog, for which I am thankful for the collaboration of Peter. We’ve illustrated an example here of how to setup centralized logging on a Turing Pi system with syslog-ng to collect system and LSF logs.

Of course collecting log messages centrally is just the start of a journey. It is an important step as it allows for significantly easier debugging and troubleshooting. You can store logs to databases for easier search. And once you better understand which log messages are important, you can even potentially parse those and generate alersts from them or dashboards. All of these help you to make sure that your HPC system runs smoothly and with minimal downtime. For me this was a learning experience and I’ll be looking how I can implement more broadly centralized logging in my home network.

4 turning and 7 chilling

2024-03-21T18:09:30-06:00

How to keep your cool

I’m back again and revisiting the Turing Pi V1 board. This time the focus isn’t on software, but rather cooling. In my previous write-up Pi in the sky? A compute cluster in mini ITX form factor, I used a USB fan I had at hand to keep the temperature of the compute modules in check during the Linpack runs. Although the fan was a seriously sketchy one, it did the job, and prevented throttling of the compute modules under high load, albeit with much noise. Clearly not content with this mediocre setup I pondered what other solutions I could quickly come up with.

Looking in my electronics spare parts bin, I came across 2 spare Noctua 40x40x20mm fans, part number NF-A4x20 PWM. I found that these fans fit well on the Turing Pi board perpendicular to the compute modules. I measured that for full cooling coverage of the compute modules I’d need 4 such fans, side by side. However before investing in two more fans, I needed to confirm that they had enough oomph (yes that’s a technical term) to keep things cool.

So my plan was to first test two fans cooling half of the modules. However, to test these fans out, I first needed to get a hold of some USB to 3/4-pin fan power adapter cables. Once I had these adapters, I used a thick elastic band to bind the 2 fans together, and connected them to the USB for power using the adapters and give them a whirl - pun intended. Of course I fell back on Linpack to get the compute modules busy.

The results were promising enough that I immediately ordered two more fans and adapters to complete the setup which is shown in the photo below. A thick elastic band was used again to fasten the remaining 2 fans together. Of course, the setup will be made more robust to ensure that fans will stay in place. And I’ll do a bit of work on cable management.

Totally chill

The view of the dashboard (see below) speaks for itself. Under heavy load running Linpack, the compute modules don’t exceed 50C. This is about 10 degrees cooler than what I saw with that USB desk fan. So I’d consider that a result. Plus the Noctua fans are so much quieter and will be much more durable in the long run.

Conclusion

So where does the title of my blog come from? It’s inspired by the slogan “6 turning and 4 burning” of the B-36 Peacemaker strategic bomber! You can see the B-36 in all it’s glory in this short excerpt from the 1955 film Strategic Air Command starring Jimmy Stewart. You could say I have eclectic taste in films. Plus the B-36 has always fascinated me with it’s combination of jet and piston engines. As for this blog, 4 turning obviously refers to the 4 Noctua fans turning. And 7 chilling refers to the 7 CM3 modules that now keep their cool under pressure. With a more suitable cooling solution in place, especially as the warmer days arrive, I can now refocus my attention to the software side of things. And as always, stay cool!

Advanced LSF resource connector configuration on IBM Cloud - part II

2024-03-19T20:40:52-06:00

Overview

Back in November 2023 I authored a blog titled Advanced LSF resource connector configuration on IBM Cloud - part I. As I signed off in that post, I mentioned that there would be a follow-on post to cover some more advanced configuration topics on LSF resource connector. And that’s the topic of this article today.

To recap, the IBM LSF resource connector functionality enables LSF clusters to dynamically spin-up cloud instances from supported resource providers in the cloud based on workload demand, and to destroy those instances when no longer required.

The LSF resource connector intelligently choses the most appropriate cloud instance type for a given job from the templates that have been defined by the administrator. This is done automatically and is transparent from the end user perspective. What if the job requires to run on a very specific instance type? In this post we’ll show you how this is feasible by defining an LSF string resource, along with the necessary configuration of LSF resource connector and a supporting script. This will allow users to submit jobs to LSF with a resource requirement string specifying the cloud instance type desired.

For the example below, we’ll be using an IBM LSF environment which has been deployed on IBM Cloud using the extensive deployment automation that is available via the IBM Cloud catalog for IBM LSF. Using this automation, you can deploy an LSF cluster in about 10 minutes time including the creation of the virtual private cloud (VPC), networking, security, bastion node, NFS server node, LSF management nodes and optionally LSF Application Center.

What is user_data.sh?

We’ll start with a brief description of the LSF resource connector user_data.sh script. This script will play an important part in the configuration of the compute servers as we’ll see. The user_data.sh script is used to start-up the LSF daemons on the compute instances launched by LSF resource connector. It also crucially enables admins to configure settings, including LSF settings, which is what we’ll be using the in example below.

Specifying the cloud instance type

By default, the LSF resource connector intelligently chooses the cloud instance profile type based upon the job submission parameters. For example, it considers things like the number of processor requested, the memory requested just to name of few. And it will startup the compute instance or instances from the available configured templates which most closely matches the job requirement.

What if you need to request a very specific compute instance type for the work that you’ve submitted based upon other, site-specific needs? Here we will show exactly how you can achieve this.

Let the configuration begin!

We begin with updating the LSF configuration to create a new string resource called profile. In the configuration file $LSF_ENVDIR/lsf.shared, define the new string resource profile in the Resource section.

….
….
Begin Resource
RESOURCENAME	TYPE	    INTERVAL	INCREASING	DESCRIPTION        # Keywords
profile 	  String   ()       ()             (IBM Cloud Gen2 profile type)
End Resource
….
….

To make the change take effect, reconfigure the LSF cluster with the LSF command lsadmin reconfig.

# lsadmin reconfig -v

Checking configuration files ...


EGO 3.4.0 build 1599999, Jan 04 2023
Copyright International Business Machines Corp. 1992, 2016.
US Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp.

  binary type: linux3.10-glibc2.17-x86_64
Reading configuration from /opt/ibm/lsf/conf/lsf.conf
Mar 13 16:57:25 2024 1478621 6 3.4.0 Lim starting...
Mar 13 16:57:25 2024 1478621 6 3.4.0 LIM is running in advanced workload execution mode.
Mar 13 16:57:25 2024 1478621 6 3.4.0 Master LIM is not running in EGO_DISABLE_UNRESOLVABLE_HOST mode.
Mar 13 16:57:25 2024 1478621 5 3.4.0 /opt/ibm/lsf/10.1/linux3.10-glibc2.17-x86_64/etc/lim -C
Mar 13 16:57:26 2024 1478621 6 3.4.0 LIM is running as IBM Spectrum LSF Standard Edition.
Mar 13 16:57:26 2024 1478621 6 3.4.0 reCheckClass: numhosts 1 so reset exchIntvl to 15.00
Mar 13 16:57:26 2024 1478621 6 3.4.0 Checking Done.
---------------------------------------------------------
No errors found.

Restart only the master candidate hosts? [y/n] n
Do you really want to restart LIMs on all hosts? [y/n] y
Restart LIM on  ...... done

Now, check that the profile variable has been setup properly. This can be done using the LSF lsinfo command.

# lsinfo |grep profile
profile        String   N/A   IBM Cloud Gen2 profile type

Now we’re ready to update the LSF resource connector templates to add the profile string variable. For this example, there are two templates defined for IBM Cloud profile types bx2-4x16 and mx2-16x128 in the configuration file $LSF_ENVDIR/resource_connector/ibmcloudgen2/conf. Within the template definition, the profile string variable is defined, and a value is set for each respective profile type. Note that the “-“ character cannot be used in the LSF string variables, and in place of that the “_” character is used. The specified profile string for each respective template is defined and used as the selection criteria by LSF resource connector. Then the userData field is used to ensure that this value gets passed and set in the compute instance that is started by the LSF resource connector when the user_data.sh script is run.

Instance type	LSF profile variable string value
bx2-4x16	bx2_4x16
mx2-16x128	mx2_16x128

ibmcloudgen2_templates.json, with profile configured (obfuscated)

{
    "templates": [
        {
            "templateId": "Template-1",
            "maxNumber": 2,
            "attributes": {
                "type": ["String", "X86_64"],
                "ncores": ["Numeric", "2"],
                "ncpus": ["Numeric", "4"],
                "mem": ["Numeric", "16384"],
                "icgen2host": ["Boolean", "1"], 
                "profile":["String","bx2_4x16"]
            },
            "imageId": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "subnetId": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "vpcId": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "vmType": "bx2-4x16",
            "userData":"profile=bx2_4x16",
            "securityGroupIds": ["aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff"],
            "resourceGroupId": "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa",
            "sshkey_id": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "priority": "10", 
            "region": "us-east",
            "zone": "us-east-1" 
        },

       {
            "templateId": "Template-2",
            "maxNumber": 2,
            "attributes": {
                "type": ["String", "X86_64"],
                "ncores": ["Numeric", "8"],
                "ncpus": ["Numeric", "16"],
                "mem": ["Numeric", "131072"],
                "icgen2host": ["Boolean", "1"],
		       "profile":["String","mx2_16x128"]
            },
            "imageId": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "subnetId": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "vpcId": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "vmType": "mx2-16x128",
            "userData":"profile=mx2_16x128",
            "securityGroupIds": ["aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff"],
            "resourceGroupId": "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa",
            "sshkey_id": "aaaa-bbbbbbbb-cccc-dddd-eeee-ffffffffffff",
            "priority": "5",
            "region": "us-east",
            "zone": "us-east-1"
        }

    ]
}

Now the user_data.sh script is required to be updated in order to set the value of the profile variable in the LSF resourcemap based upon what was requested by the user. This will be added to the LSF configuration during the bootup of the dynamic cloud instances. For more information about the LSF resourcemap read here.

user_data.sh script portion

….
….
# Set value of profile variable in the LSF resourcemap. This is based
# on the profile value requested at job submission time. 
if [ -n "$profile" ]; then
sed -i "s/\(LSF_LOCAL_RESOURCES=.*\)\"/\1 [resourcemap $profile*profile]\"/" $LSF_CONF_FILE
echo "update LSF_LOCAL_RESOURCES in $LSF_CONF_FILE successfully, add [resourcemap ${profile}*profile]" >> $logfile
else
echo "profile doesn't exist in environment variable" >> $logfile
fi
….
….

With all of the configuration in place, it’s now time to test things out. Initially, a stress job is submitted requesting 4 cores is submitted without requesting a specific compute profile. In this case, the LSF resource connector will chose the most appropriate instance type from the configured templates. In our configuration the templates for instance types bx2-4x16 and mx2-16x128 are configured. Given this, we expect the LSF resource connector to startup a bx2-4x16 instance to satisfy the requirements for this example job.

$ bsub -n 4 -q normal -o /mnt/data/%J.out /usr/bin/stress --cpu 4 --vm-bytes 8192MB --timeout 60s 
Job <3811> is submitted to queue .

After a few moments, we see that a new host, icgen2host-XXX-YYY-ZZZ-44 joins the LSF cluster and the job enters run state. We note that this is a host with the characteristics of 4 cores, and 16GB RAM, which matches the type bx2-4x16.

$ lsload -w
HOST_NAME               status  r15s   r1m  r15m   ut    pg  ls    it   tmp   swp   mem
icgen2host-XXX-YYY-ZZZ-37      ok   0.4   0.1   0.1   2%   0.0   1    16   40G    0
icgen2host-XXX-YYY-ZZZ-44      ok   0.9   0.2   0.1  19%   0.0   0     0   88G    0M 14.9G

$ lshosts -w
HOST_NAME                       type       model  cpuf ncpus maxmem maxswp server RESOURCES
icgen2host-XXX-YYY-ZZZ-37        X86_64    Intel_E5  12.5     4  15.4G      -    Yes (mg docker ParaView)
icgen2host-XXX-YYY-ZZZ-44		 X86_64    Intel_E5  12.5     4  15.5G      -    Dyn (icgen2host docker)

$ bjobs -l -r

Job <3811>, User , Project , Status , Queue , C
                     ommand , Share group charged 
Mon Mar 18 19:42:22: Submitted from host , CWD <$HOME>,
                      Output File , 4 Task(s);
Mon Mar 18 19:45:11: Started 4 Task(s) on Host(s)    , Allocated 4 Slot(s) on Host(s)    , Execution Home , Execution CWD ;
Mon Mar 18 19:45:47: Resource usage collected.
                     The CPU time used is 142 seconds.
                     MEM: 3 Mbytes;  SWAP: 0 Mbytes;  NTHREAD: 8
                     PGID: 2169;  PIDs: 2169 2170 2172 2173 2174 2175 2176 


 MEMORY USAGE:
 MAX MEM: 3 Mbytes;  AVG MEM: 3 Mbytes; MEM Efficiency: 0.00%

 CPU USAGE:
 CPU PEAK: 0.00 ;  CPU PEAK DURATION: 0 second(s)
 CPU AVERAGE EFFICIENCY: 0.00% ;  CPU PEAK EFFICIENCY: 0.00%

 SCHEDULING PARAMETERS:
           r15s   r1m  r15m   ut      pg    io   ls    it    tmp    swp    mem
 loadSched   -     -     -     -       -     -    -     -     -      -      -  
 loadStop    -     -     -     -       -     -    -     -     -      -      -  

 RESOURCE REQUIREMENT DETAILS:
 Combined: select[type == local] order[r15s:pg]
 Effective: select[type == local] order[r15s:pg] 

$ bhist -l 3811

Job <3811>, User , Project , Command 
Mon Mar 18 19:42:22: Submitted from host , to Queue , CWD <$HOME>, Output File , 4 Task
                     (s);
Mon Mar 18 19:45:11: Dispatched 4 Task(s) on Host(s)  <
                     icgen2host-XXX-YYY-ZZZ-44>  , Allocated 4 Slot(s) on Host(s)    , Effective RES_REQ