GigaIO Awarded Testbed Contract at TACC

2022-03-11 08:46:58 By : Ms. Catherine Chen

Since 1987 - Covering the Fastest Computers in the World and the People Who Run Them

Since 1987 - Covering the Fastest Computers in the World and the People Who Run Them

SAN DIEGO, March 10, 2022 – GigaIO, creator of next-gen data center rack-scale architecture for Artificial Intelligence (AI) and High Performance Computing (HPC) solutions, today announced that production has begun on their CDI testbed in the Lonestar6 system at The Texas Advanced Computing Center (TACC) at The University of Texas at Austin. Lonestar6 is a 600-node system utilizing Milan-based AMD servers from Dell Technologies and A100 GPUs from NVIDIA, and is the first platform at TACC to incorporate Composable Disaggregated Infrastructure (CDI) in order to benefit from de-centralized server infrastructure.

CDI pools compute and hardware accelerators over a software defined PCIe-based memory fabric, thus providing access to more processing power and storage when needed, and allowing for easy sharing of those resources, thereby increasing their utilization. GigaIO’s composable infrastructure platform pairs this unlimited flexibility with the agility of the cloud, allowing researchers to build completely customized and otherwise impossible servers for their AI and HPC workflows. GigaIO is the only CDI vendor that can transform each server to an entire rack using only PCIe for the absolute lowest latency and highest bandwidth throughout.

GigaIO does this through its universal composable fabric, FabreX, a unique and highly disruptive technology that can transform a rack of servers into a true rack-scale system without proprietary architecture lock-in. Making this happen requires both the ability to disaggregate and compose resources to servers, and the ability to run internode communications on the same network. FabreX orchestrates workloads by configuring any resource on the fly and integrating networking, storage, memory, and specialized accelerators into a single-system cluster fabric. Composing resources with FabreX dramatically lowers OpEx and CapEx costs due to increased resource efficiency, while improving both serviceability and upgradeability.

TACC chose GigaIO as the CDI solution for Lonestar-6 for a number of reasons, chief among them the flexibility to seamlessly choose the best hardware accelerator (or number of accelerators) for each workload; the green savings on cooling, power, and footprint available through increased utilization and minimized server requirements; and most importantly, because of GigaIO’s truly open platform for heterogeneous architectures. GigaIO offers the only composable solution that does not require any proprietary orchestration software. Instead, the company is committed to being an open standards platform, and to working with leading northbound integration software vendors to integrate natively for ease of deployment.

“GigaIO allows us to mix the type and number of accelerators attached to each node, varying them with the particular mix of jobs at any given moment in time,” said Dan Stanzione, Executive Director of TACC. “GigaIO’s ability to efficiently scale resources across all open standards software and hardware is just one of the reasons we selected them and will be continuing to work with them on future cluster enhancements.”

TACC designs and operates some of the world’s most powerful computing resources. The center’s mission is to enable discoveries that advance science and society through the application of advanced computing technologies. Lonestar6 is the latest cluster environment that will be utilized for this work, including computational fluid dynamics, material science, and climate science.

GigaIO is providing Lonestar6’s fabric infrastructure, including switches, cards, cables, JBOGs (Just a Bunch Of GPUs), and composition software. “GigaIO’s composable infrastructure solution democratizes access to expensive specialized resources such as accelerators, which it shares across users and workloads in a way that is simple to implement for IT managers,” said Alan Benjamin, CEO of GigaIO. “FabreX will allow TACC to use any accelerator for any job without limitation, in a future-proof solution that can continue to grow as their processing needs grow.”

Headquartered in Carlsbad, Calif., GigaIO democratizes AI and HPC architectures by delivering the elasticity of the cloud at a fraction of the TCO (Total Cost of Ownership). With its universal dynamic infrastructure fabric, FabreX, and its innovative open architecture using industry-standard PCI Express/soon CXL technology, GigaIO breaks the constraints of the server box, liberating resources to shorten time to results. Data centers can scale up or scale out the performance of their systems, enabling their existing investment to flex as workloads and business change over time. For more information, contact info@gigaio.com or visit www.gigaio.com.

The Texas Advanced Computing Center (TACC) at The University of Texas at Austin is one of the leading supercomputing centers in the world. TACC’s mission is to enable discoveries that advance science and society through the application of advanced computing technologies. Tens of thousands of scientists and students use TACC’s supercomputers each year to answer complex questions in every field of science. TACC staff also encourage, educate, and train the next generation of researchers, empowering them to make discoveries that change the world.

Be the most informed person in the room! Stay ahead of the tech trends with industy updates delivered to you every week!

Add Amazon Web Services to the growing list of companies (tech and otherwise) that are curtailing business with Russia in opposition to President Putin’s invasion of Ukraine. As reported in the New York Times and then by Amazon itself, Amazon Web Services is blocking new sign-ups from Russia and Belarus. Existing customers are not impacted. “We’ve suspended shipment of retail... Read more…

As the world’s students return to classrooms, a general unease remains over the dynamics of Covid transmission even as the omicron variant settles into a lull. A trio of researchers from Argonne National Laboratory and Read more…

In this regular feature, HPCwire highlights newly published research in the high-performance computing community and related domains. From parallel programming to exascale to quantum computing, the details are here. Read more…

The world is (once again) returning to some semblance of pre-pandemic life as the omicron variant wanes. Many are now wondering about the risk calculus for popular activities such as plane travel, which can often be a hi Read more…

Memory-bound computing performance has become the way of life in much of HPC. While processor speeds have improved, mostly through specialization and parallelism, the ability to move data to and from processors has not k Read more…

OpenFOAM is one the most widely used Computational Fluid Dynamics (CFD) packages and helps companies in a broad range of sectors (automotive, aerospace, energy, and life-sciences) to conduct research and design new products. Read more…

Love it or hate it, improv — though it may appear random — is often more purposeful and patterned than it may seem. And, improbable as it may seem, supercomputing is at play here, too: a team of Penn State-led resear Read more…

Add Amazon Web Services to the growing list of companies (tech and otherwise) that are curtailing business with Russia in opposition to President Putin’s invasion of Ukraine. As reported in the New York Times and then by Amazon itself, Amazon Web Services is blocking new sign-ups from Russia and Belarus. Existing customers are not impacted. “We’ve suspended shipment of retail... Read more…

Just a couple of weeks ago, the Indian government promised that it had five HPC systems in the final stages of installation and would launch nine new supercomputers this year. Now, it appears to be making good on that promise: the country’s National Supercomputing Mission (NSM) has announced the deployment of “PARAM Ganga” petascale supercomputer at Indian Institute of Technology (IIT)... Read more…

AMD/Xilinx has released an improved version of its VCK5000 AI inferencing card along with a series of competitive benchmarks aimed directly at Nvidia’s GPU line. AMD says the new VCK5000 has 3x better performance than earlier versions and delivers 2x TCO over Nvidia T4. AMD also showed favorable benchmarks against several Nvidia GPUs, claiming its VCK5000 achieved... Read more…

Nvidia has announced that it has acquired Excelero. The high-performance block storage provider, founded in 2014, will have its technology integrated into Nvidia’s enterprise software stack. Nvidia is not disclosing the value of the deal. Excelero’s core product, Excelero NVMesh, offers software-defined block storage via networked NVMe SSDs. NVMesh operates through... Read more…

Graphcore introduced its AI-focused, PCIe-based Intelligent Processing Units (IPUs) six years ago. Since then, the company has done anything but slow down, announcing a second generation of IPUs in 2020 and, over the years, larger and larger IPU-based “IPU-POD” systems — most recently the IPU-POD128 and the IPU-POD256, both announced just a few months... Read more…

Cerebras Systems, pioneer of wafer-scale computing for AI and HPC, today announced that TotalEnergies (formerly “Total”) has deployed the Cerebras CS-2 syst Read more…

A new industry consortium aims to establish a die-to-die interconnect standard – Universal Chiplet Interconnect Express (UCIe) – in support of an open chipl Read more…

U.S. leadership computers today are giant GPU-based machines, whether you’re talking about the pre-exascale Summit supercomputer in operation today at the Oak Read more…

Graphics chip powerhouse Nvidia today announced that it has acquired HPC cluster management company Bright Computing for an undisclosed sum. Unlike Nvidia’s bid to purchase semiconductor IP company Arm, which has been stymied by regulatory challenges, the Bright deal is a straightforward acquisition that aims to expand... Read more…

Fresh off its rebrand last October, Meta (née Facebook) is putting muscle behind its vision of a metaversal future with a massive new AI supercomputer called the AI Research SuperCluster (RSC). Meta says that RSC will be used to help build new AI models, develop augmented reality tools, seamlessly analyze multimedia data and more. The supercomputer’s... Read more…

Details about two previously rumored Chinese exascale systems came to light during last week’s SC21 proceedings. Asked about these systems during the Top500 media briefing on Monday, Nov. 15, list author and co-founder Jack Dongarra indicated he was aware of some very impressive results, but withheld comment when asked directly if he had... Read more…

IBM today announced it will deploy its first quantum computer in Canada, putting Canada on a short list of countries that will have access to an IBM Quantum Sys Read more…

Today, the LLVM compiler infrastructure world is essentially inescapable in HPC. But back in the 2000 timeframe, LLVM (low level virtual machine) was just getting its start as a new way of thinking about how to overcome shortcomings in the Java Virtual Machine. At the time, Chris Lattner was a graduate student of... Read more…

GPU-maker Nvidia is continuing to try to keep its proposed acquisition of British chip IP vendor Arm Ltd. alive, despite continuing concerns from several governments around the world. In its latest action, Nvidia filed a 29-page response to the U.K. government to point out a list of potential benefits of the proposed $40 billion deal. Read more…

Today at the hybrid virtual/in-person SC21 conference, the organizers announced the winners of the 2021 ACM Gordon Bell Prize: a team of Chinese researchers leveraging the new exascale Sunway system to simulate quantum circuits. The Gordon Bell Prize, which comes with an award of $10,000 courtesy of HPC pioneer Gordon Bell, is awarded annually... Read more…

On October 1 of this year, IonQ became the first pure-play quantum computing start-up to go public. At this writing, the stock (NYSE: IONQ) was around $15 and its market capitalization was roughly $2.89 billion. Co-founder and chief scientist Chris Monroe says it was fun to have a few of the company’s roughly 100 employees travel to New York to ring the opening bell of the New York Stock... Read more…

Just about a month ago, Pfizer scored its second huge win of the pandemic when the U.S. Food and Drug Administration issued another emergency use authorization Read more…

AMD/Xilinx has released an improved version of its VCK5000 AI inferencing card along with a series of competitive benchmarks aimed directly at Nvidia’s GPU line. AMD says the new VCK5000 has 3x better performance than earlier versions and delivers 2x TCO over Nvidia T4. AMD also showed favorable benchmarks against several Nvidia GPUs, claiming its VCK5000 achieved... Read more…

No exascale for you* -- at least, not within the High-Performance Linpack (HPL) territory of the latest Top500 list, issued today from the 33rd annual Supercomputing Conference (SC21), held in-person in St. Louis, Mo., and virtually, from Nov. 14–19. "We were hoping to have the first exascale system on this list but that didn’t happen," said Top500 co-author... Read more…

The rapid adoption of Julia, the open source, high level programing language with roots at MIT, shows no sign of slowing according to data from Julialang.org. I Read more…

MLCommons today released its fifth round of MLPerf training benchmark results with Nvidia GPUs again dominating. That said, a few other AI accelerator companies Read more…

As Intel, HPE, and Argonne National Laboratory drive toward a 2022 delivery of the Aurora leadership-class supercomputer, HPCwire spoke with Dr. Robert Wisniewski, Intel Fellow: SuperCompute Software, Aurora technical lead and PI, to learn about Intel’s Borealis testbed for Aurora. Wisniewski also explains why he views High Bandwidth Memory as a game-changer for HPC. Read more…

HPE – working with France’s HPC agency, GENCI, and its National Computing Center for Higher Education, CINES – announced a stellar win today at SC21: it will build France’s 70 peak petaflops Adastra supercomputer, scheduling another leading system for delivery in Europe on the heels of a string of EuroHPC system debuts. The system, slated for delivery and... Read more…

Intel held its 2022 investor meeting yesterday, covering everything from the imminent Sapphire Rapids CPUs to the hotly anticipated (and delayed) Ponte Vecchio GPUs. But somewhat buried in its summary of the meeting was a new namedrop: “Falcon Shores,” described as “a new architecture that will bring x86 and Xe GPU together into a single socket.” The reveal was... Read more…

© 2022 HPCwire. All Rights Reserved. A Tabor Communications Publication

HPCwire is a registered trademark of Tabor Communications, Inc. Use of this site is governed by our Terms of Use and Privacy Policy.

Reproduction in whole or in part in any form or medium without express written permission of Tabor Communications, Inc. is prohibited.