FinanceLane
  • Funding
    • Equity Funding
    • Debt Funding
    • Crowdfunding
    • Real Estate Funding
  • Investing
    • Stocks
    • Bonds
    • Mutual Funds
    • Commodities
    • Forex
    • Private Equity
    • Real Estate
    • Crypto Investing
  • Lending
    • Personal Loan
    • Business Loan
    • Mortgage
    • Credit Card
    • Microfinance
    • Peer-to-Peer Lending
  • Insurance
    • Life Insurance
    • Health Insurance
    • Auto Insurance
    • Education Insurance
    • General Insurance
  • Banking
    • Individual Banking
    • Business Banking
    • Investment Banking
    • Neo Banking
    • Payments Bank
  • Wealth
    • Earning
    • Savings
    • Investments
    • Budgeting
    • Credit Management
    • Tax Planning
    • Retirement
  • Fintech
    • Payments
    • Digital Banks
    • Alternative Financing
    • Asset Management
    • Softwares
  • Startup
    • Startup Ecosystem
    • Merging & Acquisition
    • Equity Investing
    • Franchising
    • Business Offers
  • Crypto
    • Crypto Coins
    • Crypto Trading
    • Bitcoin
    • Blockchain
    • DAPP
    • Crypto Investing
  • Login
No Result
View All Result
FinanceLane
  • Home
  • Funding
  • Investing
  • Lending
  • Insurance
  • Banking
  • Wealth
  • Crypto
  • Newsletters
  • Feedback
Home News Feed Blockchain News

NVIDIA Delves into RAPIDS cuVS IVF-PQ for Accelerated Vector Search

Blockchainby Blockchain
July 18, 2024

Zach Anderson Jul 18, 2024 20:12

NVIDIA explores the RAPIDS cuVS IVF-PQ algorithm, enhancing vector search performance through compression and GPU acceleration.

NVIDIA Delves into RAPIDS cuVS IVF-PQ for Accelerated Vector Search

In a detailed blog post, NVIDIA has provided insights into their RAPIDS cuVS IVF-PQ algorithm, which aims to accelerate vector search by leveraging GPU technology and advanced compression techniques. This is part one of a two-part series that continues from their previous exploration of the IVF-Flat algorithm.

IVF-PQ Algorithm Introduction

The blog post introduces IVF-PQ (Inverted File Index with Product Quantization), an algorithm designed to enhance search performance and reduce memory usage by storing data in a compressed form. This method, however, comes at the cost of some accuracy, a trade-off that will be further explored in the second part of the series.

IVF-PQ builds upon the concepts of IVF-Flat, which uses an inverted file index to limit the search complexity to a smaller subset of data through clustering. Product quantization (PQ) adds another layer of compression by encoding database vectors, making the process more efficient for large datasets.

Performance Benchmarks

NVIDIA shared benchmarks using the DEEP dataset, which contains a billion records and 96 dimensions, amounting to 360 GiB in size. A typical IVF-PQ configuration compresses this into an index of 54 GiB without significantly impacting search performance, or as small as 24 GiB with a slight slowdown. This compression allows the index to fit into GPU memory.

Comparisons with the popular CPU algorithm HNSW on a 100-million subset of the DEEP dataset show that cuVS IVF-PQ can significantly accelerate both index building and vector search.

Algorithm Overview

IVF-PQ follows a two-step process: a coarse search and a fine search. The coarse search is identical to IVF-Flat, while the fine search involves calculating distances between query points and vectors in probed clusters, but with the vectors stored in a compressed format.

This compression is achieved through PQ, which approximates a vector using two-level quantization. This allows IVF-PQ to fit more data into GPU memory, enhancing memory bandwidth utilization and speeding up the search process.

Optimizations and Performance

NVIDIA has implemented various optimizations in cuVS to ensure the IVF-PQ algorithm performs efficiently on GPUs. These include:

  • Fusing operations to reduce output size and optimize memory bandwidth utilization.
  • Storing the lookup table (LUT) in GPU shared memory when possible for faster access.
  • Using a custom 8-bit floating point data type in the LUT for faster data conversion.
  • Aligning data in 16-byte chunks to optimize data transfers.
  • Implementing an “early stop” check to avoid unnecessary distance computations.

NVIDIA’s benchmarks on a 100-million scale dataset show that IVF-PQ outperforms IVF-Flat, particularly with larger batch sizes, achieving up to 3-4 times the number of queries per second.

Conclusion

IVF-PQ is a robust ANN search algorithm that leverages clustering and compression to enhance search performance and throughput. The first part of NVIDIA’s blog series provides a comprehensive overview of the algorithm’s workings and its advantages on GPU platforms. For more detailed performance tuning recommendations, NVIDIA encourages readers to explore the second part of their series.

For more information, visit the NVIDIA Technical Blog.

Image source: Shutterstock Read The Original Article on Blockchain.News

Tags: GPUNewsNvidiaRAPIDSVECTOR SEARCH

Related Topics

Advisory

Big relief proposed by RBI for account holders in activating an inoperative bank account or claiming unclaimed deposits

Advisory

Periodic KYC update in bank account to become easier; RBI proposes new draft rules, allows time till June 30, 2026, to do KYC for these customers

Prev Next

You May Like

Advisory

Big relief proposed by RBI for account holders in activating an inoperative bank account or claiming unclaimed deposits

Advisory

Periodic KYC update in bank account to become easier; RBI proposes new draft rules, allows time till June 30, 2026, to do KYC for these customers

Advisory

ICICI Bank discontinues its PayLater credit line on UPI for all customers; The bank answers what happens with customers

Advisory

Savings Ki Vidya’ campaign by Federal Bank: A fresh approach to savings

Advisory

Saturday bank holiday: Are banks open or closed on May 24, 2025?

Blockchain

Ava Protocol Revolutionizes Agent-Driven Workflows with Verifiable Execution

Blockchain News

NVIDIA Surpasses 1,000 TPS/User with Llama 4 Maverick and Blackwell GPUs

Blockchain News

Gala Games Launches ‘VEXI at Work’ Leaderboard Event with $GALA Rewards

Financial News

Advisory

Removal of indexation benefit on sale of property: Majority of taxpayers to have substantial tax savings, says tax department

FinanceLane
by FinanceLane
Advisory

Exciting new UPI features that you can use in 2025: International travel, UPI one world wallet, cashback, payment delegation, and more

FinanceLane
by FinanceLane
Blockchain News

Vitalik Buterin’s Vision of Techno-Optimism and AI’s Future

Blockchain
by Blockchain
Bitcoin

Bank Clients Just Dipped Their Toes Into Bitcoin ETFs, but Q4 Could See a FOMO Spike

CoinDesk
by CoinDesk
Blockchain News

John Deaton Files Amicus Brief in Support of Coinbase’s Appeal Against SEC

Blockchain
by Blockchain
Blockchain News

Alpaca Debuts Private Equity Real Estate

Blockchain
by Blockchain
Blockchain News

Canaan Inc. Secures Major Order for Avalon Miner A1566 from Cipher Mining

Blockchain
by Blockchain
Advisory

Thinking of a holiday? Here’s why Singapore is a go-to destination

FinanceLane
by FinanceLane
Bitcoin

Bitcoin Hits Record High Against BlackRock’s U.S. Treasury ETF as Investors Search for Returns: Van Straten

CoinDesk
by CoinDesk
Blockchain News

Gresini Racing and Partners Launch Innovative Fan-Powered Motorsport Sponsorship

Blockchain
by Blockchain
Blockchain News

Highlights from Real World Crypto 2025 Conference: SNARKs, Digital Euros, and AI Agents

Blockchain
by Blockchain
Bitcoin

Riot Platforms’ Major Expansion: Acquiring 66,560 Bitcoin Mining Rigs from MicroBT

Blockchain
by Blockchain
Load More
FinanceLane.com
  • Disclaimer
  • Privacy Policy
  • Terms of use
  • Subscribe
  • Contact

Subscribe to get the latest updates

Follow us on

© 2022 FinanceLane.com. All rights reserved.

Welcome Back!

Sign In with Facebook
Sign In with Google
Sign In with Linked In
OR

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
  • Home
  • Funding
    • Equity Funding
    • Debt Funding
    • Real Estate Funding
    • Crowdfunding
  • Investing
    • Stocks
    • Bonds
    • Mutual Funds
    • Private Equity
    • Merging & Acquisition
    • Real Estate
  • Lending
    • Personal Loan
    • Business Loan
    • Credit Card
    • Microfinance
    • Peer-to-Peer Lending
  • Insurance
    • Life Insurance
    • Auto Insurance
    • Education Insurance
    • Health Insurance
  • Banking
    • Business Banking
    • Payments Bank
    • Investment Banking
    • Individual Banking
  • Wealth
    • Earning
    • Savings
    • Investments
    • Budgeting
    • Credit Management
    • Tax Planning
    • Retirement
  • Fintech
    • Alternative Financing
    • Payments
    • Asset Management
    • Digital Banks
    • Softwares
  • Fintech
    • Alternative Financing
    • Asset Management
    • Digital Banks
    • Softwares
    • Payments
  • Crypto
    • Crypto Investing
    • Crypto Trading
    • Crypto Coins
    • Bitcoin
    • Blockchain
    • DAPP
  • Subscribe
  • Contact
  • Login

© 2022 FinanceLane - Terms and Conditions | Disclaimer | Privacy Policy

This website uses cookies. By continuing to use this website you are giving consent to cookies being used. Visit our Privacy and Cookie Policy.