Intel and Inflection AI have launched what they claim is an industry-first – Inflection for Enterprise – an enterprise-grade AI system powered by Intel Gaudi and Intel Tiber AI Cloud.

The companies say Inflection for Enterprise will accelerate the adoption and impact of AI for enterprises and developers – delivering empathetic, conversational, employee-friendly AI capabilities and providing the control, customisation, and scalability required for complex, large-scale deployments.

The system is available presently through the AI Cloud and will be shipping to customers as an industry-first AI appliance powered by Gaudi 3 in Q1 2025.

“Through this strategic collaboration with Inflection AI we are setting a new standard with AI solutions that deliver immediate, high-impact results,” says Justin Hotard, Intel executive vice-president and GM of the Data Centre and AI Group. “With support for open-source models, tools, and competitive performance per watt Intel Gaudi 3 solutions make deploying GenAI accessible, affordable, and efficient for enterprises of any size.”

Building an AI system typically demands substantial infrastructure -extensive model development and training, and collaboration among engineers, data scientists and application developers. With Inflection for Enterprise, built on Inflection 3.0, enterprise customers can now harness a comprehensive AI solution that empowers their workforce with a virtual AI co-worker specifically trained on their unique company data, policies, and culture.

“Every CEO and CTO we speak to is frustrated that existing AI tools on the market aren’t truly enterprise-grade,” says Inflection AI COO, Ted Shelton. “Enterprise organisations need more than generic off-the-shelf AI, but they don’t have the expertise to fine-tune a model themselves. We’re proud to offer an AI system that solves these problems – and with the performance gains we see from running on Intel Gaudi, we know it can scale to meet the needs of any enterprise.”

Inflection AI fine-tunes its model to be native to each individual organisation expediting user adoption and improving the usefulness of use cases through alignment with the company’s tone, purpose, and unique product, service, and operating information.

Inflection 3.0 enables enterprise customers with faster time-to-value through employee-friendly generative AI experiences, while offering price, performance, and security/compliance advantages.

  • Removing barriers to GenAI – Built on AI Cloud, Inflection for Enterprise provides application templates designed to let businesses skip hardware testing and model building, and avoid capital expenses to scale quickly. In Q1 2025 customers will also have the option to purchase Inflection for Enterprise on a complete turnkey AI appliance. Leveraging Gaudi 3, customers of this appliance can benefit from up to 2x improved price performance as well as 128GB of high-bandwidth memory capacity further optimising their GenAI performance compared with current competitive offerings.
  • Optimised price/performance – While Inflection AI’s Pi consumer application was previously run on Nvidia GPUs, Inflection 3.0 will be powered by Gaudi 3 with instances on-premises or in the cloud powered by AI Cloud. This not only cuts down on time to deploy, but also total cost of ownership.
  • Fine-tuned for enterprises – Leveraging the fine-tuning and reinforcement learning from human feedback (RLHF) expertise that powered Inflection AI’s Pi, Inflection for Enterprise models are unique to each business’ ethos and way of operating.  Modeled on data and insights from a company’s history, policies, content, tone, products, and operating information Inflection AI helps drive productivity and alignment across an organisation.
  • Enhanced ownership and security – Inflection for Enterprise allows enterprises to own their intelligence in its entirety. Fine-tuned models are the customer’s alone and are never shared outside their organisation. Additionally, customers can host and run the model on their preferred architecture whether hosted on-premises, in the cloud, or hybrid.