Case Study
Numenta Accelerates Large Language Models with Intel® Xeon® CPU Max Series
Numenta, known for brain-inspired AI innovation, achieved major performance gains by running custom large language models on Intel® Xeon® CPU Max Series processors with high bandwidth memory (HBM). Their models showed up to 20X faster inference for large documents compared to AMD Milan CPUs, significantly reducing the cost of running NLP workloads in production. These advancements enable broader adoption of AI across use cases like natural language processing and computer vision, offering customers breakthrough efficiency and unlocking new capabilities with Intel’s powerful CPU architecture.