Close Menu
StreamLineCrypto.comStreamLineCrypto.com
  • Home
  • Crypto News
  • Bitcoin
  • Altcoins
  • NFT
  • Defi
  • Blockchain
  • Metaverse
  • Regulations
  • Trading
What's Hot

Dogecoin Triangle Support Test Maps Out Recovery Roadmap And When To Sell

December 13, 2025

Solana ETFs Buck The Trend With 7-Day Positive Inflow Streak

December 13, 2025

LDO Price Prediction: Targeting $0.75-$1.27 Recovery Within 4-6 Weeks

December 13, 2025
Facebook X (Twitter) Instagram
Saturday, December 13 2025
  • Contact Us
  • Privacy Policy
  • Cookie Privacy Policy
  • Terms of Use
  • DMCA
Facebook X (Twitter) Instagram
StreamLineCrypto.comStreamLineCrypto.com
  • Home
  • Crypto News
  • Bitcoin
  • Altcoins
  • NFT
  • Defi
  • Blockchain
  • Metaverse
  • Regulations
  • Trading
StreamLineCrypto.comStreamLineCrypto.com

NVIDIA’s RAPIDS cuDF Enhances pandas Performance by 30x on Large Datasets

August 10, 2024Updated:August 11, 2024No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
NVIDIA’s RAPIDS cuDF Enhances pandas Performance by 30x on Large Datasets
Share
Facebook Twitter LinkedIn Pinterest Email
ad


Felix Pinkston
Aug 10, 2024 02:42

NVIDIA releases RAPIDS cuDF unified reminiscence, boosting pandas efficiency as much as 30x on massive and text-heavy datasets.





NVIDIA has unveiled new options in RAPIDS cuDF, considerably bettering the efficiency of the pandas library when dealing with massive and text-heavy datasets. In line with NVIDIA Technical Weblog, the enhancements allow knowledge scientists to speed up their workloads by as much as 30x.

RAPIDS cuDF and pandas

RAPIDS is a collection of open-source GPU-accelerated knowledge science and AI libraries, and cuDF is its Python GPU DataFrame library designed for knowledge loading, becoming a member of, aggregating, and filtering. pandas, a widely-used knowledge evaluation and manipulation library for Python, has struggled with processing velocity and effectivity as dataset sizes develop, significantly on CPU-only techniques.

At GTC 2024, NVIDIA introduced that RAPIDS cuDF may speed up pandas almost 150x with out requiring code adjustments. Google later revealed that RAPIDS cuDF is on the market by default on Google Colab, making it extra accessible to knowledge scientists.

Tackling Limitations

Person suggestions on the preliminary launch of cuDF highlighted a number of limitations, significantly with the dimensions and kind of datasets that might profit from acceleration:

  • To maximise acceleration, datasets wanted to suit inside GPU reminiscence, limiting the info measurement and complexity of operations that could possibly be carried out.
  • Textual content-heavy datasets confronted constraints, with the unique cuDF launch supporting solely as much as 2.1 billion characters in a column.

To handle these points, the newest launch of RAPIDS cuDF consists of:

  • Optimized CUDA unified reminiscence, permitting for as much as 30x speedups of bigger datasets and extra complicated workloads.
  • Expanded string assist from 2.1 billion characters in a column to 2.1 billion rows of tabular textual content knowledge.

Accelerated Information Processing with Unified Reminiscence

cuDF depends on CPU fallback to make sure a seamless expertise. When reminiscence necessities exceed GPU capability, cuDF transfers knowledge into CPU reminiscence and makes use of pandas for processing. Nevertheless, to keep away from frequent CPU fallback, datasets ought to ideally match inside GPU reminiscence.

With CUDA unified reminiscence, cuDF can now scale pandas workloads past GPU reminiscence. Unified reminiscence supplies a single deal with area spanning CPUs and GPUs, enabling digital reminiscence allocations bigger than obtainable GPU reminiscence and migrating knowledge as wanted. This helps maximize efficiency, though datasets ought to nonetheless be sized to slot in GPU reminiscence for peak acceleration.

Benchmarks present that utilizing cuDF for knowledge joins on a ten GB dataset with a 16 GB reminiscence GPU can obtain as much as 30x speedups in comparison with CPU-only pandas. It is a important enchancment, particularly for processing datasets bigger than 4 GB, which beforehand confronted efficiency points as a result of GPU reminiscence constraints.

Processing Tabular Textual content Information at Scale

The unique cuDF launch’s 2.1 billion character restrict in a column posed challenges for big datasets. With the brand new launch, cuDF can now deal with as much as 2.1 billion rows of tabular textual content knowledge, making pandas a viable software for knowledge preparation in generative AI pipelines.

These enhancements make pandas code execution a lot quicker, particularly for text-heavy datasets like product opinions, customer support logs, and datasets with substantial location or person ID knowledge.

Get Began

All these options can be found with RAPIDS 24.08, which might be downloaded from the RAPIDS Set up Information. Observe that the unified reminiscence characteristic is just supported on Linux-based techniques.

Picture supply: Shutterstock


ad
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Related Posts

Dogecoin Triangle Support Test Maps Out Recovery Roadmap And When To Sell

December 13, 2025

Solana ETFs Buck The Trend With 7-Day Positive Inflow Streak

December 13, 2025

LDO Price Prediction: Targeting $0.75-$1.27 Recovery Within 4-6 Weeks

December 13, 2025

Bitcoin’s $55 billion options market is now obsessing over one specific date that forces a $100k showdown

December 13, 2025
Add A Comment
Leave A Reply Cancel Reply

ad
What's New Here!
Dogecoin Triangle Support Test Maps Out Recovery Roadmap And When To Sell
December 13, 2025
Solana ETFs Buck The Trend With 7-Day Positive Inflow Streak
December 13, 2025
LDO Price Prediction: Targeting $0.75-$1.27 Recovery Within 4-6 Weeks
December 13, 2025
Bitcoin Is A ‘Digital Labubu’ With No Economic Value: Vanguard Quant Head
December 13, 2025
Bitcoin’s $55 billion options market is now obsessing over one specific date that forces a $100k showdown
December 13, 2025
Facebook X (Twitter) Instagram Pinterest
  • Contact Us
  • Privacy Policy
  • Cookie Privacy Policy
  • Terms of Use
  • DMCA
© 2025 StreamlineCrypto.com - All Rights Reserved!

Type above and press Enter to search. Press Esc to cancel.