ScaleOps Introduced an AI Infrastructure Resource Manager to Accelerate Self‑Hosted AI Adoption

NEWS 21 November 2025

The launch bundled GPU workload optimization, model observability, and dynamic allocation to cut costs and improve throughput for production models.

ScaleOps Introduced an AI Infrastructure Resource Manager to Accelerate Self‑Hosted AI Adoption

Image source: Public Domain

ScaleOps, the market leader in cloud resource management, announced the launch of its AI Infra Product, expanding its proven capabilities to manage resources for self-hosted AI models and GPU-based applications at scale, redefining how enterprises manage and optimize AI infrastructure.

The ScaleOps platform automatically manages production environments in real time for industry leaders, including Wiz, DocuSign, Rubrik, and Coupa, Alkami, Vantor, Grubhub, Island, Chewy, and Fortune 500 Companies. With the AI Infra Product launch, ScaleOps extends its capabilities to help AIOps and DevOps teams run self-hosted LLM and AI models, enabling organizations to improve GPU efficiency, eliminate waste, and scale their AI workloads efficiently.

As companies increasingly deploy self-hosted AI models at scale, engineering teams face major challenges. Wasted GPU costs are a major pain point - companies often fail to fully utilize their GPUs, resulting in low utilization and substantial wasted cloud spend.^{^[1]} Performance issues worsen the problem - large models cause long load times and latency during demand spikes, prompting teams to overprovision GPUs and incur higher costs. Engineers waste valuable time on manual tuning, constantly adjusting workloads to maintain performance.

The ScaleOps AI Infra Product provides a complete resource management solution for self-hosted GenAI models and GPU-based applications in cloud native environments. It intelligently allocates and scales GPU resources in real-time, increases utilization, accelerates model load times, and continuously adapts to dynamic demand. By combining application context-awareness with real-time continuous automation, ScaleOps keeps self-hosted AI models running optimally, eliminating GPU waste, driving substantial cost savings, and freeing engineering teams from repeated manual tuning.

"Cloud-native AI infrastructure is reaching a breaking point," said Yodar Shafrir, CEO and Co-Founder of ScaleOps. "Cloud-native architectures unlocked great flexibility and control, but they also introduced a new level of complexity. Managing GPU resources at scale has become chaotic - waste, performance issues, and skyrocketing costs are now the norm. The ScaleOps platform was built to fix this. It delivers the complete solution for managing and optimizing GPU resources in cloud-native environments, enabling enterprises to run LLMs and AI applications efficiently, cost-effectively, and while improving performance."

Already deployed in customers' production environments, the ScaleOps AI Infra Product has driven savings of 50-70%, with large enterprises projecting tens of millions of dollars in annual savings as they modernize their GPU operations.

"ScaleOps provides enterprises with a complete, holistic solution that brings together every aspect of cloud resource management - enabling them to manage all their cloud workloads at scale." said Shafrir.

ScaleOps Introduced an AI Infrastructure Resource Manager to Accelerate Self‑Hosted AI Adoption

The launch bundled GPU workload optimization, model observability, and dynamic allocation to cut costs and improve throughput for production models.

United States of AMERICA

Tellius Unveils Agent Mode for Kaiya, Enabling Autonomous Multi-Step AI Analytics at Enterprise Scale

Anthropic Expands Partnership with Google Cloud to Train Claude Models on Up to 1 Million TPUs, Unlocking Tens of Billions in AI Compute Power

Partnership Between Sphere and NHC Advanced AI Ethics in Patient-Centered Health Systems

GitKraken Introduced Insights Platform to Empower Engineering Leaders with AI Metrics

Copia Automation Embedded Generative AI to Strengthen OT Cybersecurity and Simplify Code Management

Guidepoint Introduced AI Moderation to Streamline Expert Insight Delivery

AMA Introduced Skill Coach, an AI-Powered Role-Playing Platform That Elevated Experiential Learning

EUROPE

Huawei Launched Advanced AI WAN Solution Targeting Carrier Performance Gains

NewOwner.me Introduced AI Marketplace to Revolutionize Business Sales and Investment Decisions

BrowserStack Introduced AI-Driven Visual Review Agent to Enhance Testing Efficiency

Novo Nordisk Foundation Backs AI-Driven Global Pathogen Analysis Platform to Boost Global Disease Surveillance

Faculty Launched Frontier 3 to Revolutionize Enterprise Intelligence Globally

Tuya Smart Contributed to Global Dialogue on AI Cybersecurity and Governance in the Physical AI Era

Stibo Systems Introduced SmartSync™ for Salesforce, Enhancing Data Trust and AI Integration

ASIA

Neolix Raises Record-Breaking $600 Million Series D to Accelerate AI-Driven Autonomous Delivery and Global Expansion

Trend Micro and WDH Introduced Magna AI for End-to-End Global AI Enablement

Datamine and Aereo Announced Strategic Partnership to Advance AI in Mining

MediaTek and Airoha Launched First-Ever AI-Enabled Fiber Gateway Platform

Ant Group Open-Sources Ling-1T, a Trillion-Parameter AI Language Model

AI Chatbot Introduced by China’s Central Academy of Fine Arts to Improve Global Site Engagement

Deloitte Faces Backlash After AI-Generated Errors Mar Australian Government Report

ScaleOps Introduced an AI Infrastructure Resource Manager to Accelerate Self‑Hosted AI Adoption

United States of AMERICA

EUROPE

ASIA

Keep Up to Date with the Latest Artificial Intelligence Industry NEWS & Insights