FinanceLane
  • Funding
    • Equity Funding
    • Debt Funding
    • Crowdfunding
    • Real Estate Funding
  • Investing
    • Stocks
    • Bonds
    • Mutual Funds
    • Commodities
    • Forex
    • Private Equity
    • Real Estate
    • Crypto Investing
  • Lending
    • Personal Loan
    • Business Loan
    • Mortgage
    • Credit Card
    • Microfinance
    • Peer-to-Peer Lending
  • Insurance
    • Life Insurance
    • Health Insurance
    • Auto Insurance
    • Education Insurance
    • General Insurance
  • Banking
    • Individual Banking
    • Business Banking
    • Investment Banking
    • Neo Banking
    • Payments Bank
  • Wealth
    • Earning
    • Savings
    • Investments
    • Budgeting
    • Credit Management
    • Tax Planning
    • Retirement
  • Fintech
    • Payments
    • Digital Banks
    • Alternative Financing
    • Asset Management
    • Softwares
  • Startup
    • Startup Ecosystem
    • Merging & Acquisition
    • Equity Investing
    • Franchising
    • Business Offers
  • Crypto
    • Crypto Coins
    • Crypto Trading
    • Bitcoin
    • Blockchain
    • DAPP
    • Crypto Investing
  • Login
No Result
View All Result
FinanceLane
  • Home
  • Funding
  • Investing
  • Lending
  • Insurance
  • Banking
  • Wealth
  • Crypto
  • Newsletters
  • Feedback
Home News Feed Blockchain News

NVIDIA’s NCCL 2.24 Enhances Networking Reliability and Observability

Blockchainby Blockchain
March 14, 2025

Joerg Hiller Mar 14, 2025 02:22

NVIDIA’s latest NCCL 2.24 release introduces new features to enhance multi-GPU and multinode communication, including RAS subsystem, NIC Fusion, and FP8 support, optimizing deep learning training.

NVIDIA's NCCL 2.24 Enhances Networking Reliability and Observability

The NVIDIA Collective Communications Library (NCCL) has introduced its latest version, 2.24, bringing significant advancements in networking reliability and observability for multi-GPU and multinode (MGMN) communication. As reported by NVIDIA Developer Blog, this release is optimized specifically for NVIDIA GPUs and networking, making it an essential component for multi-GPU deep learning training.

NCCL 2.24 New Features

The update includes several new features aimed at enhancing performance and reliability:

  • Reliability, Availability, and Serviceability (RAS) subsystem
  • User Buffer (UB) registration for multinode collectives
  • NIC Fusion
  • Optional receive completions
  • FP8 support
  • Strict enforcement of NCCL_ALGO and NCCL_PROTO

The RAS Subsystem

The RAS subsystem is one of the standout additions in NCCL 2.24. It is designed to assist users in diagnosing application issues like crashes and hangs, particularly in large-scale deployments. This low-overhead infrastructure offers a global view of running applications, enabling the detection of anomalies such as unresponsive nodes or lagging processes. It operates by creating a network of threads across NCCL processes that monitor each other’s health through regular keep-alive messages.

Enhancements in User Buffer Registration

NCCL 2.24 introduces user buffer (UB) registration for multinode collectives, allowing more efficient data transfer and reduced GPU resource consumption. The library now supports UB registration for multiple ranks-per-node collective networking and standard peer-to-peer networks, offering significant performance gains, particularly for operations like AllGather and Broadcast.

NIC Fusion

With the expansion of many-NIC systems, NCCL has adapted to optimize network communication. The new NIC Fusion feature allows the logical merging of multiple NICs into a single entity, ensuring efficient use of network resources. This capability is particularly beneficial for systems with more than one NIC per GPU, addressing issues such as crashes and inefficient resource allocation.

Additional Features and Fixes

The update also introduces optional receive completions for LL and LL128 protocols, allowing for reduced overhead and congestion. NCCL 2.24 supports native FP8 reductions on NVIDIA Hopper and newer architectures, enhancing processing capabilities. Additionally, stricter enforcement of NCCL_ALGO and NCCL_PROTO is implemented, ensuring more precise tuning and error handling for users.

This update also includes various bug fixes and minor improvements, such as adjustments to PAT tuning and enhancements in memory allocation functions, enhancing the overall robustness and efficiency of the NCCL library.

Image source: Shutterstock Read The Original Article on Blockchain.News

Tags: DEEP LEARNINGNCCLNETWORKINGNewsNvidia

Related Topics

Advisory

Here’s how you can protect your turf at work

Advisory

What should FD investors do now? RBI cuts repo rate by 50 bps, interest rates will fall further

Prev Next

You May Like

Advisory

Here’s how you can protect your turf at work

Advisory

What should FD investors do now? RBI cuts repo rate by 50 bps, interest rates will fall further

Advisory

Big savings for home loan borrowers as EMIs to fall significantly after RBI cuts repo rate by 50 bps

Advisory

Bakrid bank holiday today: Are banks open or closed in your state on June 6, 2025 for Id-ul-Ad’ha 2025

Advisory

HDFC Bank UPI and other services won’t be available on this date: Check details here

Advisory

Waiting list train ticket? Get ticket confirmation assurance with up to 3x money back guarantee from Ixigo, Redbus and MakeMyTrip

Advisory

Bank holiday on June 6, 2025 and June 7, 2025: Are banks closed tomorrow in your state for Bakrid?

Advisory

5 things you’re probably doing, that are pushing away success at your job

Financial News

Advisory

Pahalgam tragedy: LIC announces special window for death claim settlements, how to file a death claim with LIC

FinanceLane
by FinanceLane
Advisory

Deadline to claim pending TDS credit for FY 2007-08 till Q3 FY 2017-18 is March 31, 2025; do this now to file a TDS correction statement

FinanceLane
by FinanceLane
Banking

HKMA Reveals Banks Participating in RMB Trade Financing Liquidity Facility

Blockchain
by Blockchain
Advisory

See how house prices have risen in top 6 cities in India

FinanceLane
by FinanceLane
Blockchain News

CoreWeave Introduces NVIDIA Blackwell Cloud Instances for Enhanced AI Performance

Blockchain
by Blockchain
Advisory

5 things you’re probably doing, that are pushing away success at your job

FinanceLane
by FinanceLane
Blockchain

Kaia (KAIA): A High-Performance Blockchain with EVM Compatibility

Blockchain
by Blockchain
Advisory

Track your car insurance details online in minutes

FinanceLane
by FinanceLane
Investing

Tariff War: How to invest in current market conditions

FinanceLane
by FinanceLane
Blockchain

Conflux (CFX) Invites Community Participation in New Bug Bounty Program

Blockchain
by Blockchain
Blockchain News

Hong Kong Monetary Authority Announces Upcoming Exchange Fund Bill Tenders

Blockchain
by Blockchain
Investing

How to choose mutual funds if you have a moderate risk appetite?

FinanceLane
by FinanceLane
Load More
FinanceLane.com
  • Disclaimer
  • Privacy Policy
  • Terms of use
  • Subscribe
  • Contact

Subscribe to get the latest updates

Follow us on

© 2022 FinanceLane.com. All rights reserved.

Welcome Back!

Sign In with Facebook
Sign In with Google
Sign In with Linked In
OR

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
  • Home
  • Funding
    • Equity Funding
    • Debt Funding
    • Real Estate Funding
    • Crowdfunding
  • Investing
    • Stocks
    • Bonds
    • Mutual Funds
    • Private Equity
    • Merging & Acquisition
    • Real Estate
  • Lending
    • Personal Loan
    • Business Loan
    • Credit Card
    • Microfinance
    • Peer-to-Peer Lending
  • Insurance
    • Life Insurance
    • Auto Insurance
    • Education Insurance
    • Health Insurance
  • Banking
    • Business Banking
    • Payments Bank
    • Investment Banking
    • Individual Banking
  • Wealth
    • Earning
    • Savings
    • Investments
    • Budgeting
    • Credit Management
    • Tax Planning
    • Retirement
  • Fintech
    • Alternative Financing
    • Payments
    • Asset Management
    • Digital Banks
    • Softwares
  • Fintech
    • Alternative Financing
    • Asset Management
    • Digital Banks
    • Softwares
    • Payments
  • Crypto
    • Crypto Investing
    • Crypto Trading
    • Crypto Coins
    • Bitcoin
    • Blockchain
    • DAPP
  • Subscribe
  • Contact
  • Login

© 2022 FinanceLane - Terms and Conditions | Disclaimer | Privacy Policy

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.