Gen-AI-Today

GenAI TODAY NEWS

Free eNews Subscription

Intel Launches AI Double Whammy: Xeon 6 and Gaudi 3

By Greg Tavarez

From automating tasks to improving decision-making capabilities, AI applications are valuable across industries. To keep up with this growing trend, businesses need infrastructure that supports the development and deployment of AI applications quickly and cost-effectively.

Look back at traditional IT infrastructure. For example, traditional data centers are expensive to build and maintain, and they may not have the flexibility needed to scale resources up or down as needed. The simple fact is that traditional IT infrastructure cannot meet the infrastructure demands from businesses today.

Industry giant Intel, on its mission to create world-changing technology that enables global progress and enriches lives, has a solution to tackle those challenges:

Xeon 6 with Performance-cores (P-cores) and Gaudi 3 AI accelerators.

Intel Xeon 6 with P-cores is designed to handle compute-intensive workloads with exceptional efficiency. Xeon 6 delivers twice the performance of its predecessor. It features increased core count, double the memory bandwidth and AI acceleration capabilities embedded in every core. This processor is engineered to meet the performance demands of AI from edge to data center and cloud environments.

Intel Gaudi 3 AI Accelerator is optimized for large-scale generative AI. Gaudi 3 boasts 64 Tensor processor cores and eight matrix multiplication engines to accelerate deep neural network computations. It includes 128 GB of HBM2e memory for training and inference, and 24 200 gigabit Ethernet ports for scalable networking. Gaudi 3 also offers seamless compatibility with the PyTorch framework and advanced Hugging Face transformer and diffuser models.

Intel recently announced a collaboration with IBM to deploy Intel Gaudi 3 AI accelerators as a service on IBM Cloud. Through this collaboration, Intel and IBM aim to lower the total cost of ownership to leverage and scale AI, while enhancing performance.

“Demand for AI is leading to a massive transformation in the data center, and the industry is asking for choice in hardware, software and developer tools,” said Justin Hotard, Intel Executive Vice President and General Manager of the Data Center and Artificial Intelligence Group. “With our launch of Xeon 6 with P-cores and Gaudi 3 AI accelerators, Intel is enabling an open ecosystem that allows our customers to implement all of their workloads with greater performance, efficiency and security.”

Xeon 6 and Gaudi 3 solidify the company’s commitment to deliver powerful AI systems with optimal performance per watt and lower TCO. Intel is also committed to providing enterprises with the infrastructure they need to deploy AI at scale. By leveraging its x86 architecture and open ecosystem, Intel offers flexible deployment options, competitive pricing and accessible AI technologies. This approach helps businesses build high-value AI systems with optimal TCO and performance per watt. In fact, according to Intel, 73% of GPU-accelerated servers use Intel Xeon as the host CPU3.

Also, to address the challenges of transitioning GenAI prototypes to production, Intel has collaborated with OEMs like Dell Technologies and Supermicro. Through co-engineering efforts, they develop tailored solutions that address real-time monitoring, error handling, logging, security, and scalability concerns. These solutions, based on the Open Platform Enterprise AI platform, integrate OPEA-based microservices into scalable retrieval-augmented generation systems, optimized for Xeon and Gaudi AI systems. This enables customers to easily integrate applications from Kubernetes, Red Hat OpenShift AI and Red Hat Enterprise Linux AI.

Furthermore, Intel's Tiber portfolio offers a range of business solutions to overcome challenges such as access, cost, complexity, security, efficiency and scalability in AI, cloud and edge environments. The Intel Tiber Developer Cloud now provides preview systems of Intel Xeon 6 for tech evaluation and testing.

Additionally, select customers will gain early access to Intel Gaudi 3 for validating AI model deployments, with Gaudi 3 clusters to begin rolling out next quarter for large-scale production deployments.

The recent news provided by Intel only shows its strong commitment to providing accessible and cost-effective AI solutions. The company is positioned as a valuable partner for businesses looking to get the most out of AI.

Be part of the discussion about the latest trends and developments in the Generative AI space at Generative AI Expo, taking place February 11-13, 2025 in Fort Lauderdale, Florida. Generative AI Expo covers the evolution of GenAI and feature conversations focused on the potential for GenAI across industries and how the technology is already being used to create new opportunities for businesses to improve operations, enhance customer experiences, and create new growth opportunities.




Edited by Alex Passett
Get stories like this delivered straight to your inbox. [Free eNews Subscription]

GenAIToday Editor

SHARE THIS ARTICLE
Related Articles

Building Personalized AI Agents

By: Special Guest    4/4/2025

It's tempting to build an AI Agent that can do everything, but that's a recipe for a diluted and, ultimately, less effective generic workflow.

Read More

Salad Redefines AI Transcription with Unmatched Accuracy and Ultra-Low Pricing

By: Erik Linask    3/31/2025

Salad looks to upend the AI transcription market with its low-cost, highly accurate artificial intelligence-driven Salad Transcription API.

Read More

The Human-AI Partnership: Elevating Customer Service Without Losing the Personal Touch

By: Special Guest    3/26/2025

How businesses can leverage AI to improve customer experiences without losing the human touch of customer interactions.

Read More

Boomi AI Studio Launched to Centralize Control and Governance of Enterprise AI Agents

By: Erik Linask    3/10/2025

Boomi AI Studio allows businesses to harness the power of AI-driven automation by delivering the necessary oversight and guardrails to enable scaling …

Read More

IBM Strengthens GenAI Portfolio with DataStax Acquisition

By: Erik Linask    2/25/2025

Bolstering its Generative AI portfolio, IBM announced its plan to acquire AI and data solutions provider DataStax.

Read More

-->