Numenta Accelerates Large Language Models with Intel® Xeon® CPU Max Series

1 Pages

Numenta, known for brain-inspired AI innovation, achieved major performance gains by running custom large language models on Intel® Xeon® CPU Max Series processors with high bandwidth memory (HBM). Their models showed up to 20X faster inference for large documents compared to AMD Milan CPUs, significantly reducing the cost of running NLP workloads in production. These advancements enable broader adoption of AI across use cases like natural language processing and computer vision, offering customers breakthrough efficiency and unlocking new capabilities with Intel’s powerful CPU architecture.

Join for free to read

Case Study Key HPC Codes Run Up to 3.8x Faster on Intel® Xeon® CPU Max…

Case Study Stony Brook University’s Seawulf is First U.S. Academic…

Case Study Numenta and Intel Deliver Cost-Effective, Powerful Inference…

Case Study Seawulf is the First U.S. Academic Supercomputer with Intel®…

More from Intel

Case Study Key HPC Codes Run Up to 3.8x Faster on Intel® Xeon® CPU Max…

Case Study Stony Brook University’s Seawulf is First U.S. Academic…

Case Study Numenta and Intel Deliver Cost-Effective, Powerful Inference…

Case Study Seawulf is the First U.S. Academic Supercomputer with Intel®…

Case Study

Numenta Accelerates Large Language Models with Intel® Xeon® CPU Max Series

Numenta Accelerates Large Language Models with Intel® Xeon® CPU Max Series

You Might Also Like

More from Intel