Synthetic intelligence platform supplier Clarifai has unveiled a brand new compute orchestration functionality that guarantees to assist enterprises optimise their AI workloads in any computing setting, scale back prices and keep away from vendor lock-in.
Introduced on December 3, 2024, the public preview release lets organisations orchestrate AI workloads by means of a unified management airplane, whether or not these workloads are working on cloud, on-premises, or in air-gapped infrastructure. The platform can work with any AI mannequin and {hardware} accelerator together with GPUs, CPUs, and TPUs.
“Clarifai has all the time been forward of the curve, with over a decade of expertise supporting massive enterprise and mission-critical authorities wants with the total stack of AI instruments to create customized AI workloads,” said Matt Zeiler, founder and CEO of Clarifai. “Now, we’re opening up capabilities we constructed internally to optimise our compute prices as we scale to serve hundreds of thousands of fashions concurrently.”
The corporate claims its platform can scale back compute utilization by 3.7x by means of mannequin packing optimisations whereas supporting over 1.6 million inference requests per second with 99.9997% reliability. Based on Clarifai, the optimisations can doubtlessly reduce prices by 60-90%, relying on configuration.
Capabilities of the compute orchestration platform embrace:
- Price optimisation by means of automated useful resource administration, together with mannequin packing, dependency simplification, and customisable auto-scaling choices that may scale to zero for mannequin replicas and compute nodes,
- Deployment flexibility on any {hardware} vendor together with cloud, on-premise, air-gapped, and Clarifai SaaS infrastructure,
- Integration with Clarifai’s AI platform for knowledge labeling, coaching, analysis, workflows, and suggestions,
- Security measures that enable deployment into buyer VPCs or on-premise Kubernetes clusters with out requiring open inbound ports, VPC peering, or customized IAM roles.
The platform emerged from Clarifai prospects’ points with AI efficiency and price. “If we had a method to consider it holistically and take a look at our on-prem prices in comparison with our cloud prices, after which have the ability to orchestrate throughout environments with a value foundation, that will be extremely beneficial,” famous a buyer, as cited in Clarifai’s announcement.
The compute orchestration capabilities construct on Clarifai’s current AI platform that, the corporate says, has processed over 2 billion operations in laptop imaginative and prescient, language, and audio AI. The corporate experiences sustaining 99.99%+ uptime and 24/7 availability for essential functions.
The compute orchestration functionality is at the moment out there in public preview. Organisations concerned about testing the platform ought to contact Clarifai for entry.