Skip to content
Tiatra, LLCTiatra, LLC
Tiatra, LLC
Information Technology Solutions for Washington, DC Government Agencies
  • Home
  • About Us
  • Services
    • IT Engineering and Support
    • Software Development
    • Information Assurance and Testing
    • Project and Program Management
  • Clients & Partners
  • Careers
  • News
  • Contact
 
  • Home
  • About Us
  • Services
    • IT Engineering and Support
    • Software Development
    • Information Assurance and Testing
    • Project and Program Management
  • Clients & Partners
  • Careers
  • News
  • Contact

AI and load balancing

Just as cloud computing led to the emergence of software-defined (SD) load balancing, the artificial intelligence (AI) revolution is taking us a step farther, to AI-defined architectures. This transformation represents a significant shift in how enterprises approach their infrastructure to support modern AI workloads and bring AI benefits to existing workloads.

AI applications present significant challenges with respect to load balancing. AI workloads, including agentic workloads, demand extreme performance: terabits/second, not the gigabits/second that’s been required for traditional applications. As a result, organizations need load balancers with extraordinary throughput capabilities and the scalability to support elastic operations.

“When you build modern AI applications for enterprises, there has to be a very high level of performance, resilience, security, and elasticity,” says Chris Wolf, global head of AI and advanced services, VCF Division at Broadcom. “Load balancers in the AI era must be able to manage services and fulfill enterprise requirements across multiple servers and clusters, because of the distributed nature of large inference and training jobs in private AI environments.”

Additionally, enterprise AI applications are almost exclusively built on Kubernetes with a microservices architecture. That means organizations need load balancers that can autoscale, autoheal, and operate “as code,” with built-in capabilities including global server load balancing (GSLB), web application firewalls (WAFs), and application programming interface (API) security.

AI applications exchange vast amounts of sensitive data through APIs, requiring robust protection against attacks and data leakage through comprehensive web app and API security. Thresholding with anomaly detection and traffic pattern recognition should be employed to optimize resource allocation.

AI-defined load balancing

It’s only fitting that load balancing in the AI era employs AI to get the job done, and it does so across three key dimensions.

First, predictive intelligence enables high resilience, by leveraging health score monitoring and dynamic thresholds that scale in real time as needed to accommodate bursts. In this environment, static thresholds aren’t feasible, because traffic is too dynamic and overprovisioning for max load would be prohibitively expensive. Active-active high-availability configurations ensure continuous operation, and autoscaling capabilities coupled with autohealing recognize traffic patterns and remediate most issues without an admin getting deeply involved, if at all.

Second, generative AI (genAI) can dramatically improve operational efficiency by acting as copilots to assist teams in several ways. Admins can ask questions by using natural language, and the AI tools provide answers, analytics, and contextual insights based on information found in application health scores, application latency measurements, design guides, and knowledge base (KB) documentation. AI tools can also provide correlated analytics, contextual insights, and multifactor inference within admins’ work streams. Infrastructure-as-code capabilities reduce manual work, because configurations can be changed programmatically in software. Capacity management and performance troubleshooting assistance can flag emerging issues for admins to address long before they affect users, all of which dramatically improves productivity.

Finally, AI-powered self-service capabilities create load balancing interfaces for DevOps teams that require zero training, because AI can provide intuitive assistance for engineers to follow. The result is faster deployment and configuration without sacrificing quality or security.

A solution that meets all of these AI era requirements, such as Broadcom’s VMware Avi Load Balancer, delivers big dividends. Rigorous studies have shown that enterprise IT can achieve 43% OpEx savings, 90% faster app delivery provisioning, and a 27% DevOps productivity boost with this solution.

Software-defined load balancing principles remain—ensuring scale-out performance, dynamic availability, and application-level security—and the AI era dramatically amplifies these requirements while infusing AI principles. Organizations that embrace AI-defined load balancing will not only support their AI and non-AI workloads more effectively but will also benefit from the intelligence embedded within their infrastructure.

To learn more about how Broadcom can help your organization bring load balancing into the AI era, visit us here.

>
>About the author:

>Umesh Mahajan is Vice President and General Manager of Broadcom’s Application Networking and Security Division. He joined Broadcom from VMware, where he led the Networking and Security Business Unit and was responsible for the NSX software-defined network virtualization platform, which encompassed network connectivity, security, and load balancing. With more than three decades of experience in multi-cloud networking and networking services, Mr. Mahajan holds over 30 patents. Prior to joining VMware, he founded Avi Networks, which built the disruptive software-defined advanced load balancer. Earlier, he held senior leadership positions at Cisco, including Vice President and General Manager of the data center switching business, and was responsible for Nexus 7000 & MDS 9000 platforms, and the NX-OS operating system. Mr. Mahajan holds a Master of Science in computer science from Duke University and a Bachelor of Technology from IIT Delhi.

LinkedIn: https://www.linkedin.com/in/umeshmahajan/

>


Read More from This Article: AI and load balancing
Source: News

Category: NewsMay 21, 2025
Tags: art

Post navigation

PreviousPrevious post:M&S says it will respond to April cyberattack by accelerating digital transformation plansNextNext post:Basis Technologies launches Klario to help automate SAP change management

Related posts

PwCのCITO(最高情報技術責任者)が語る「CIOの魅力」とは
May 21, 2025
M&S says it will respond to April cyberattack by accelerating digital transformation plans
May 21, 2025
Basis Technologies launches Klario to help automate SAP change management
May 21, 2025
The AI-native generation is here. Don’t get left behind
May 21, 2025
Synthetic data’s fine line between reward and disaster
May 21, 2025
IBM’s massive SAP S/4HANA migration pays off
May 21, 2025
Recent Posts
  • PwCのCITO(最高情報技術責任者)が語る「CIOの魅力」とは
  • M&S says it will respond to April cyberattack by accelerating digital transformation plans
  • AI and load balancing
  • Basis Technologies launches Klario to help automate SAP change management
  • The AI-native generation is here. Don’t get left behind
Recent Comments
    Archives
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • August 2023
    • July 2023
    • June 2023
    • May 2023
    • April 2023
    • March 2023
    • February 2023
    • January 2023
    • December 2022
    • November 2022
    • October 2022
    • September 2022
    • August 2022
    • July 2022
    • June 2022
    • May 2022
    • April 2022
    • March 2022
    • February 2022
    • January 2022
    • December 2021
    • November 2021
    • October 2021
    • September 2021
    • August 2021
    • July 2021
    • June 2021
    • May 2021
    • April 2021
    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020
    • September 2020
    • August 2020
    • July 2020
    • June 2020
    • May 2020
    • April 2020
    • January 2020
    • December 2019
    • November 2019
    • October 2019
    • September 2019
    • August 2019
    • July 2019
    • June 2019
    • May 2019
    • April 2019
    • March 2019
    • February 2019
    • January 2019
    • December 2018
    • November 2018
    • October 2018
    • September 2018
    • August 2018
    • July 2018
    • June 2018
    • May 2018
    • April 2018
    • March 2018
    • February 2018
    • January 2018
    • December 2017
    • November 2017
    • October 2017
    • September 2017
    • August 2017
    • July 2017
    • June 2017
    • May 2017
    • April 2017
    • March 2017
    • February 2017
    • January 2017
    Categories
    • News
    Meta
    • Log in
    • Entries feed
    • Comments feed
    • WordPress.org
    Tiatra LLC.

    Tiatra, LLC, based in the Washington, DC metropolitan area, proudly serves federal government agencies, organizations that work with the government and other commercial businesses and organizations. Tiatra specializes in a broad range of information technology (IT) development and management services incorporating solid engineering, attention to client needs, and meeting or exceeding any security parameters required. Our small yet innovative company is structured with a full complement of the necessary technical experts, working with hands-on management, to provide a high level of service and competitive pricing for your systems and engineering requirements.

    Find us on:

    FacebookTwitterLinkedin

    Submitclear

    Tiatra, LLC
    Copyright 2016. All rights reserved.