• CONTACT
  • Privacy Policy
  • Blog
  • Terms & Conditions
  • About Us
Crypto Tag News
  • Home
  • Blockchain
  • Crypto
    • Bitcoin
    • Ethereum
    • Forex
    • Tether
  • Market
    • Binance
    • Business
    • Investor
    • Money
    • Trading
Reading: Enhancing Kubernetes with NVIDIA’s NIM Microservices Autoscaling
Share
  • bitcoinBitcoin(BTC)$107,166.00
  • ethereumEthereum(ETH)$2,490.42
  • tetherTether(USDT)$1.00
  • rippleXRP(XRP)$2.25
  • binancecoinBNB(BNB)$657.63
  • solanaSolana(SOL)$155.51
  • usd-coinUSDC(USDC)$1.00
  • tronTRON(TRX)$0.280548
  • dogecoinDogecoin(DOGE)$0.165168
  • staked-etherLido Staked Ether(STETH)$2,490.05
Crypto Tag NewsCrypto Tag News
Aa
  • Home
  • Blockchain
  • Crypto
  • Market
Search
  • Home
  • Blockchain
  • Crypto
    • Bitcoin
    • Ethereum
    • Forex
    • Tether
  • Market
    • Binance
    • Business
    • Investor
    • Money
    • Trading
Have an existing account? Sign In
Follow US
© Crypto Tag NEWS. All Rights Reserved.
Crypto Tag News > Blog > Market > Enhancing Kubernetes with NVIDIA’s NIM Microservices Autoscaling
Market

Enhancing Kubernetes with NVIDIA’s NIM Microservices Autoscaling

snifferius
Last updated: 2025/01/24 at 10:44 PM
snifferius Published January 24, 2025
Share


Contents
Understanding NVIDIA NIM MicroservicesSetting Up AutoscalingDeploying NIM MicroservicesImplementing Horizontal Pod AutoscalingFuture Prospects


Terrill Dicki
Jan 24, 2025 14:36

Explore NVIDIA’s approach to horizontal autoscaling of NIM microservices on Kubernetes, utilizing custom metrics for efficient resource management.



Enhancing Kubernetes with NVIDIA's NIM Microservices Autoscaling

NVIDIA has introduced a comprehensive approach to horizontally autoscale its NIM microservices on Kubernetes, as detailed by Juana Nakfour on the NVIDIA Developer Blog. This method leverages Kubernetes Horizontal Pod Autoscaling (HPA) to dynamically adjust resources based on custom metrics, optimizing compute and memory usage.

Understanding NVIDIA NIM Microservices

NVIDIA NIM microservices serve as model inference containers deployable on Kubernetes, crucial for managing large-scale machine learning models. These microservices necessitate a clear understanding of their compute and memory profiles in a production environment to ensure efficient autoscaling.

Setting Up Autoscaling

The process begins with setting up a Kubernetes cluster equipped with essential components such as the Kubernetes Metrics Server, Prometheus, Prometheus Adapter, and Grafana. These tools are integral for scraping and displaying metrics required for the HPA service.

The Kubernetes Metrics Server collects resource metrics from Kubelets and exposes them via the Kubernetes API Server. Prometheus and Grafana are employed to scrape metrics from pods and create dashboards, while the Prometheus Adapter allows HPA to utilize custom metrics for scaling strategies.

Deploying NIM Microservices

NVIDIA provides a detailed guide for deploying NIM microservices, specifically using the NIM for LLMs model. This involves setting up the necessary infrastructure and ensuring the NIM for LLMs microservice is ready for scaling based on GPU cache usage metrics.

Grafana dashboards visualize these custom metrics, facilitating the monitoring and adjustment of resource allocation based on traffic and workload demands. The deployment process includes generating traffic with tools like genai-perf, which helps in assessing the impact of varying concurrency levels on resource utilization.

Implementing Horizontal Pod Autoscaling

To implement HPA, NVIDIA demonstrates creating an HPA resource focused on the gpu_cache_usage_perc metric. By running load tests at different concurrency levels, the HPA automatically adjusts the number of pods to maintain optimal performance, demonstrating its effectiveness in handling fluctuating workloads.

Future Prospects

NVIDIA’s approach opens avenues for further exploration, such as scaling based on multiple metrics like request latency or GPU compute utilization. Additionally, leveraging Prometheus Query Language (PromQL) to create new metrics can enhance the autoscaling capabilities.

For more detailed insights, visit the NVIDIA Developer Blog.

Image source: Shutterstock


You Might Also Like

$12,000/Month Cash Flow by Cracking the Rental “Formula”

20 Companies With Permanent Remote Jobs

5 Financial Things You Should Always Keep Private

Artificial Intelligence Optimization (AIO): Enhancing AI System Performance

Bitcoin (BTC) Faces Limited Momentum Amid On-Chain Activity Slowdown

TAGGED: Autoscaling, Enhancing, Kubernetes, microservices, NIM, Nvidias

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share this Article
Facebook Twitter Email Copy Link Print
Previous Article Ethereum Consolidates But Open Interest Points to Potential Breakout
Next Article Did the US Secretary of State just cutoff aid to Ukraine?
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Follow US

Find US on Socials
Facebook Like
Twitter Follow
Youtube Subscribe
Telegram Follow

Subscribe to our newslettern

Get Newest Articles Instantly!

- Advertisement -
Ad image
Popular News
$12,000/Month Cash Flow by Cracking the Rental “Formula”
Understanding Bitcoin: A Beginner’s Guide to the World of Cryptocurrency
Exploring the Impact of Cryptocurrency Regulations on Global Finance

Follow Us on Socials

We use social media to react to breaking news, update supporters and share information

Twitter Youtube Telegram Linkedin
Crypto Tag News

We influence 20 million users and is the number one business blockchain and crypto news network on the planet.

Subscribe to our newsletter

You can be the first to find out the latest news and tips about trading, markets...

Ad image

© Crypto Tag NEWS. All Rights Reserved.

Removed from reading list

Undo
Welcome Back!

Sign in to your account

Lost your password?